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Preface to the Eighth Edition 


For the past two decades, Dr. Gerald Karp has written Cell and 
Molecular Biology: Concepts and Experiments. During this time, 
he has maintained a consistent focus on combining rigor with 
accessibility, so that even students without prior training in cell 
biology, molecular biology, or biochemistry have been able to 
learn cell biology not just as a collection of facts but as a process 
of discovery. The value of this approach is that the lessons 
learned extend far beyond the field of cell biology, and provide a 
way for students to learn how science works, how new experi- 
ments can overturn previous dogmas, and how new techniques 
can lead to groundbreaking discovery. This approach makes cell 
biology come alive. 

After seven editions, Dr. Karp is ready to move on to other 
adventures. We are excited to take on the challenge of continuing 
Dr. Karp’s unique approach to teaching cell biology, while continu- 
ing to put students first. Our goal for this revision was to build 
upon Karp’s hallmark experimental approach by bringing in our 
own unique perspectives and harnessing today’s technology. With 
our new Experimental Walkthrough feature, available in 
WileyPLUS Learning Space, students can see first-hand how key 
experimental techniques are performed in the lab. These offer a 
mix of video, which show how researchers carry out experiments, 
and 3D animations that show a molecular-level view of how the 
experiments work. These Walkthroughs provide context and a vis- 
ual explanation that helps make these important experimental 
techniques more concrete. 

A solid understanding of quantitative concepts is becoming 
increasingly important within cell biology, but is an area that 
many students struggle with. To address this issue, we have also 
added another new video feature, called Quantitative Tutorials, 
to visually illustrate how to solve specific analytical questions at 
the end of each chapter. The Quantitative Tutorial provides an 
accessible, student-friendly review of basic mathematical concepts 
used within the context of a biological problem, and will expand 
the available resources for quantitative and physical concepts 
within this 8" edition. 

One key feature of the past editions was to highlight how cell 
biology impacts our daily lives, in terms of medicine and other 
areas of society. The Human Perspectives sections highlight 
human interest stories to reinforce and review basic cell biology, 
and also provide examples of how fundamental discoveries have 
progressed into clinical practice. We have expanded this feature 
so that now every chapter has at least one Human Perspectives 
section. As part of this feature we report on the latest clinical tri- 
als for various cell biology-based therapies and drugs, a feature 
that we hope will inspire students who are pursuing careers in 
health sciences fields. In addition to the full Human Perspectives 
sections, each chapter is now introduced with a short “chapter 
opener” designed to generate enthusiasm about the science in 
each chapter through provocative issues or questions. We hope 
that this will give our readers the opportunity to think more 
about the links between science, society, and our place in the 
universe. 


Working on the 8" edition side by side with Dr. Karp has given 
us renewed admiration for his writing and his ability to keep track 
of the cutting edge in the full range of topics that comprise cell and 
molecular biology. In this and future editions of Karp’ Cell and 
Molecular Biology: Concepts and Experiments, we are dedicated to 
carrying out Dr. Karp’s original mission of providing an interesting, 
modern and readable text that is grounded in the experimental 
approach. We welcome your ideas and feedback as we continue our 
work on this text, so please feel free to get in touch. 


Janet Iwasa (jiwasa@gmail.com) 
Wallace Marshall (Wallace. Marshall@ucsf.edu) 


WileyPLUS Learning Space 


WileyPLUS Learning Space connects the text to carefully-selected 
media examples such as video, animations, and diagrams, and 
provides students a multitude of tools and content for self-study 
and practice. Instructors can customize their course content for 
students, create online homework and quizzes, and have insight 
into student activity through data analytics and reporting features. 
To try it, visit http://www.wileypluslearningspace.com. Here are 
some of the resources available in WileyPLUS Learning Space: 


Experimental Walkthrough Videos 

Quantitative Tutorial Videos 

Cell View Animations 

Video Library 

Biology Basics Animations 

Instructor’s Manual by Joel Piperberg, Millersville University 


Clicker Questions by Leocadia Paliulis, Bucknell University 
and Omar Quintero, University of Richmond 


e Lecture PowerPoint Presentations by Edmund B. Rucker, 
University of Kentucky 


e Testbank and Answer Key by Robert Seiser, Roosevelt University 
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Andre M. Lwoff 
Jacques L. Monod 
1964 Dorothy C. Hodgkin Chemistry X-ray structure of complex biological molecules 717 
1963 John C. Eccles M&P Ionic basis of nerve membrane potentials 160 
Alan L. Hodgkin 
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1962 Francis H. C. Crick M&P Three-dimensional structure of DNA 374-377 
James D. Watson 
Maurice H. F. Wilkins 
John C. Kendrew Chemistry Three-dimensional structure of globular proteins 56 
Max F. Perutz 
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1960 F. MacFarlane Burnet M&P Clonal selection theory of antibody formation 666 
Peter B. Medawar 
1959 Arthur Kornberg M&P Synthesis of DNA and RNA 518, 523 
Severo Ochoa 
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Joshua Lederberg 
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Acquired immune deficiency syndrome. See 
AIDS 
Acute lymphoblastic leukemia (ALL), 644, 651, 
653, Chapter 16 
Acute myeloid leukemia (AML), 647-648, 651, 
652f, 653f, Chapter 16 
Adaptive (acquired) immune response, 665, 
Chapter 17 
Adenoviruses, 21HP, 23, 24f, 158HP, 420-421, 
631, 641, 643, Chapter 1, Chapter 4, 
Chapter 11, Chapter 16 
Adrenoleukodystrophy (ALD), 197HP, 
Chapter 5 
African populations, genomes of, 385-386, 
Chapter 10 
Agammaglobulinemia, 665, Chapter 17 
Aging 
Down syndrome (trisomy 21) and, 479HP, 
Chapter 14 
mitochondrial disorders and, 195HP, Chapter 5 
premature (progeria), 461, 536HP, Chapter 12, 
Chapter 13 
radicals and, 34-35HP, Chapter 2 
telomeres and, 477, Chapter 12 
AIDS (acquired immune deficiency syndrome) 
helper T cells and, 678, Chapter 17 
resistance, 595-596HP, 678, 684, 685, 
Chapter 15, Chapter 17 
resistance to drugs, 74, 101-102HP, Chapter 2; 
Chapter 3 
therapies for, 433HP, Chapter 11 
ALD (adrenoleukodystrophy), 197HP, 
Chapter 5 
ALL (Acute lymphoblastic leukemia), 644, 651, 
653, Chapter 16 
Alzheimer’s disease (AD), 64-67HP, 66HPf, 165, 
Chapter 2, Chapter 4 
AML (Acute myeloid leukemia), 647-648, 651, 
652f, 653f, Chapter 16 
Anesthetics, 161, Chapter 4 
Aneuploidy, 566, 577-578HP, 630, 630f, 643, 
645, Chapter 14, Chapter 16 
Antacid medications, 154, Chapter 4 
Antibiotics, 100-102HP 
modes of, in clinical use, 101HPt, Chapter 3 
penicillin, 99, 1OOHP, 101HPt, 102HP, 
Chapter 3 
Antidepressants, 165, Chapter 4 
Anti-inflammatory drugs, and cancer, 632, 
Chapter 16 
Antioxidants, 35HP, Chapter 2 
Appetite, 111, 165, Chapter 3, Chapter 4 
Arthritis, rheumatoid, 669HP, 670HP, 671HP, 
740, Chapter 11, Chapter 18 
Atherosclerosis, 302, Chapter 8 
Autoimmune diseases, 668-671HP, 
Chapter 17 


Graves’ disease and thyroiditis, 669HP, 
Chapter 17 

inflammatory bowel diseases (IBDs), 669HP, 
Chapter 17 

multiple sclerosis (MS), 669HP, Chapter 17 

rheumatoid arthritis, 669HP, Chapter 17 

systemic lupus erythematosus (SLE), 669HP, 
669HPf, Chapter 17 

treatment of, 670-671HP, Chapter 17 

type 1 diabetes (T1D), 669HP, Chapter 17 


Bacterial toxins, 590, Chapter 15 
Bacteriophage therapy, 25, Chapter 1 
Benign tumors, 638, 644, 650, Chapter 16 
Biofilms, 13, 157HP, 583, Chapter 1, Chapter 4, 
Chapter 15 
Blistering diseases, 237, 245, 338, 671, 
Chapter 7, Chapter 9, Chapter 17 
Blood-brain barrier, 247, Chapter 7 
Blood cell differentiation, 666f, Chapter 17 
Blood clots, 46, 49, 229, 231, 233, 233f, 281, 302, 
392, 586, Chapter 2, Chapter 7, Chapter 8, 
Chapter 10, Chapter 15 
Blood glucose, 599-602, 599f, Chapter 15 
Blood-group antigens, 122-123, 123f, 398, 684, 
Chapter 4, Chapter 10, Chapter 17 
Bone marrow, 18HP, 197HP, 231f, 235, 242HP, 
288HP, 402HP, 475, 476, 481, 662, 662f, 665, 
666f, 669HP, 686, Chapter 1, Chapter 5, 
Chapter 7, Chapter 8, Chapter 10, 
Chapter 12, Chapter 17 
Bone marrow transplantation, 18HP, 197HP, 
242HP, Chapter 1, Chapter 5, Chapter 7 
Booster shots, 671, Chapter 17 
Breast cancer 
BRCA1 and, 512, 645, Chapter 13, 
Chapter 16 
cause of, 631, Chapter 16 
gene-expression analysis, 652-653, 654f, 
Chapter 16 
genetic mutations in, 640, 642f, 644, 
Chapter 16 
genetics and, 636, Chapter 16 
immunotherapy for, 655, Chapter 16 
karyotype of cell from, 630f, Chapter 16 
new cases and deaths in US in 2015, 629f, 
Chapter 16 
PI3K pathway in, 651, Chapter 16 
Preventive mastectomy, 512, Chapter 13 
protein-inhibiting drugs for, 657, Chapter 16 
PSA determinations for detecting, 660, 
Chapter 16 
research efforts, 628, Chapter 16 
tumor-suppressor genes in, 640t, Chapter 16 
tyrosine phosphorylation in cells, 585f, 
Chapter 15 
Burkitt’s lymphoma, 631, 647, Chapter 16 


Calorie-restricted diet, life span and, 632, 
Chapter 16, 111-112HP, Chapter 3 
Cancer, 627-660, Chapter 16 
cancer genome, 649-651, Chapter 16 
causes of, 631-632, Chapter 16 
cells, properties of, 628-631, 629f, Chapter 16 
aneuploidy, 630, 630f, Chapter 16 
cells of origin of malignant tumors, 
636-637, 637f, Chapter 16 
effects of serum deprivation on growth of, 
629-630, 630f, Chapter 16 
growth rate, 629, 629f, Chapter 16 
metastasis, 628, 628f, Chapter 16 
combating, strategies for, 654-660, Chapter 16 
angiogenesis, 659-660, Chapter 16 
cancer stem cells, 658-659, Chapter 16 
chemotherapy, 628, 637, 644, 648, 653, 654, 
655, 659, Chapter 16 
immunotherapy, 654-656, Chapter 16 
inhibiting activity of cancer-promoting 
proteins, 656-658, Chapter 16 
targeted therapies, 654, Chapter 16 
diet and, 632, Chapter 16 
gene-expression analysis, 651-654, 652f, 653f, 
654f, Chapter 16 
as genetic disorder, 636-638, Chapter 16 
microRNAs, 649, Chapter 16 
multiple myeloma, 679, 680f, 681, 738-739, 
739f, Chapter 17 
mutator phenotype, 649, Chapter 16 
new cases and deaths in US in 2015, 628, 629f, 
Chapter 16 
oncogenes, 646-648, Chapter 16 
activation of proto-oncogene to, 638, 639, 
639f, Chapter 16 
discovery of, 633-636EP, Chapter 16 
overview of, 638-639, 638f, Chapter 16 
research efforts, 628, Chapter 16 
therapy, PLK1 as target for, 433HP, 433HPf, 
Chapter 11 
tumor-suppressor genes, 638-646, 638f, 640t, 
Chapter 16 
APC genes, 645, Chapter 16 
BRCA1/BRCA2 genes, 512, 645, Chapter 13, 
Chapter 16 
overview of, 638-639, Chapter 16 
PTEN gene, 646, Chapter 16 
RB gene, 640-641, 641f, Chapter 16 
TP53 gene, 642-645, 642f, Chapter 16 
Carcinogens, 631, 632f, 643, Chapter 16 
Cell-mediated immunity, 665, 685, Chapter 17 
Cell replacement therapy, 17-21HP, 
Chapter 1 
adult stem cells, 18HP, 18HPf, Chapter 1 
direct cell reprogramming, 21HP, Chapter 1 
embryonic stem cells, 18-19HP, 19HPf, 
Chapter 1 


induced pluripotent stem cells, 19-21HP, 
20HPf, Chapter 1 
Cervical cancer, 631, 632, 637, 637f, 660, 
Chapter 17 
Chemotherapy, 628, 637, 644, 648, 653, 654, 655, 
659, Chapter 16 
Cholera, 592-593EP, Chapter 15 
Cholesterol 
in cell membranes, 47, 47f, Chapter 2 
in membrane lipids, 120, 120f, Chapter 4 
in phospholipids, 120, 120f, Chapter 4 
simple sugars and, 42, Chapter 2 
as steroid, importance of, 47, Chapter 2 
structure of, 39, 40f, 47f, Chapter 2 
Chromosomes 
aberrations and human disorders, 478-479HP, 
Chapter 12 
as carriers of genetic information, 369-370, 
Chapter 10 
compaction, 544EP, 545EP, 552, 554, 574, 
Chapter 14 
premature, 542, Chapter 14 
defined, 368, Chapter 10 
discovery of, 368-369, Chapter 10 
fertilization, process of, 368-369, 369f, 
Chapter 10 
giant, 371-373, 372f, Chapter 10 
heterochromatin, 469-473, Chapter 12 
homologous, 369, 370f, 371, 371f, Chapter 10 
mitotic, structure of, 473-478, 474f, Chapter 12 
polytene, 373, Chapter 10 
X chromosome inactivation, 470, Chapter 12 
Chronic lymphocytic leukemia (CLL), 655-656, 
Chapter 16 
Chronic myelogenous leukemia (CML), 75, 75f, 
76, 649, 656, 659, Chapter 2, Chapter 16 
Ciliopathies, 333-334HP, Chapter 9 
CLL (chronic lymphocytic leukemia), 655-656, 
Chapter 16 
Clonal selection theory applied to B cells, 
666-668, 666f, 667f, 668f, Chapter 17 
Cloning of animals, 483-484, 484f, Chapter 12 
CML (chronic myelogenous leukemia), 75, 75f, 
76, 649, 656, 659, Chapter 2, Chapter 16 
Cockayne syndrome (CS), 536HP, Chapter 13 
Collagen, diseases of, 226-228, 227f, 228f, 
Chapter 7 
Colon cancer 
cause of, 631, 632, Chapter 16 
EGF receptor and, 655, Chapter 16 
genetic mutations in, 642f, 645, 650, 650f, 658, 
Chapter 16 
NSAIDs and, 632, Chapter 16 
research efforts, 628, Chapter 16 
tumor-suppressor genes, 640t, Chapter 16 
Color blindness, 470, 594HP, Chapter 12, 
Chapter 15 
Congenital Diseases of Glycosylation (CDGs), 
274, Chapter 8 
Creutzfeld-Jakob disease (CJD), 62-64HP, 
Chapter 2 
Cystic fibrosis (CF), 139, 157-158HP, 157HPf, 275, 
449, Chapter 4, Chapter 8, Chapter 11 


Deafness, myosin mutations and, 344, 
Chapter 9 
Diabetes 
congenital nephrogenic diabetes insipidus, 
142, Chapter 4 
Hemoglobin Alc and, 42, Chapter 2 
type 1, 613, 669HP, Chapter 15, Chapter 17 


type 2, 613, Chapter 15 

Diarrhea, 142, 593EP, Chapter 4, Chapter 15 

Diet, cancer and, 632, Chapter 16 

DNA fingerprinting, 385, 385f, Chapter 10 

DNA repair, 531-535, Chapter 13 

Down syndrome (trisomy 21), 479HP, 
Chapter 14 

Drug development, 75, 111, 432HP, Chapter 2, 
Chapter 3, Chapter 11 

Dwarfism, 227, 536HP, Chapter 7, Chapter 13 


Embryonic development 
cadherins in, 239-240, 240f, 362, Chapter 7, 
Chapter 9 
changes in cell shape during, 362, 363f 
Embryonic stem cells, 18-19HP, 19HPf, 
735-736, Chapter 1, Chapter 18 
Enzyme replacement therapy, 287HP, Chapter 8 
Epstein-Barr virus, 631, Chapter 16 
Exercise, 177-178HP, Chapter 5 
Fabry’s disease, 287HPt, Chapter 8 
Fragile X syndrome, 389HPf, Chapter 10 
Free radicals, aging and, 34-35HP, Chapter 2 


Gaucher’s disease, 287—288HP, 287HPt, 
Chapter 8 

Gene number, 389, 390f, 394, Chapter 10 

Gene therapy, 25, 158HP, 197HP, 732, Chapter 1, 
Chapter 4, Chapter 5, Chapter 18 

Genome analysis, 390, Chapter 10 

Gleevec, 75, 75f, 76, 99, 656-657, 657f, 659, 
Chapter 2, Chapter 3, Chapter 16 

Glycolipids, diseases of, 118f, 119-120, 119f, 120f, 
122, 286-288HP, Chapter 4, Chapter 8 

Graft rejection, 684, Chapter 17 

Graves’ disease and thyroiditis, 669HP, 
Chapter 17 


Heart attack/heart disease, 18HP, 112HP, 231, 
233, 241HP, 297EP, 302-303, 302f, 461, 
Chapter 1, Chapter 3, Chapter 7, Chapter 8, 
Chapter 12 

Heartburn, 20HP, 154, 402HP, Chapter 1, 
Chapter 4, Chapter 10 

Heart muscle, 76, 178HP, 249, 620, Chapter 2; 
Chapter 5, Chapter 7, Chapter 15 

Hemolytic anemias, 137, Chapter 4 

Hemophilia, 392, Chapter 10 

Herceptin, 655, Chapter 16 

Herpes virus (HHV-8), 631, Chapter 16 

HIV (human immunodeficiency virus), 23, 24f, 
25, 25f, 102HP, 433HP, Chapter1, Chapter 3, 
Chapter 11 

azidothymidine (AZT) and, 526, Chapter 13 
T,, cells and, 678, Chapter 17 

Human Genome Project, 402, 730, Chapter 10, 
Chapter 18 

Human immunodeficiency virus. see HIV 
(human immunodeficiency virus) 

Human papilloma virus (HPV), 631, Chapter 16 

Humoral immunity, 665, Chapter 17 

Huntington’s disease, 433HP, Chapter 10 

Hydrocephalus, 239, 334HP, Chapter 7, 

Chapter 9 

Hypertension, 99, Chapter 3 


I-cell disease, 287-288HP, Chapter 8 
Immune response, 661-691, Chapter 17 
antibodies, modular structure of, 678-681, 
Chapter 17 
autoimmune diseases, 668-671HP, Chapter 17 


clonal selection theory applied to B cells, 
666-668, 666f, Chapter 17 
distinguishing self from nonself, 686-688, 
Chapter 17 
DNA rearrangements producing genes 
encoding B- and T-cell antigen receptors, 
681-683, Chapter 17 
lymphocyte activation by cell-surface signals, 
689, Chapter 17 
major histocompatibility complex, 672-675HP, 
684-686, Chapter 17 
membrane-bound antigen receptor complexes, 
683-684, Chapter 17 
overview of, 662-665, Chapter 17 
adaptive immune responses, 665, 
Chapter 17 
immune system, 662-663, 662f, Chapter 17 
immunity, 662, Chapter 17 
innate immune responses, 663-665, 665f, 
Chapter 17 
signal transduction pathways in lymphocyte 
activation, 689-691, Chapter 17 
T cell activation and mechanism of action, 
675-678, Chapter 17 
vaccination, 671, Chapter 17 
Immune system, 662-663, 662f, Chapter 17 
Immunization, 671, Chapter 17 
Immunotherapy, 654-656, Chapter 16 
Inborn errors of metabolism, 405, Chapter 11 
Induced pluripotent stem cells (iPS cells), 
19-21HP, 20HPf, 489, Chapter 1, Chapter 12 
Infections 
adaptive immune responses, 665, Chapter 17 
cancer-causing, 623fn, Chapter 15 
innate immune responses, 663-665, 665f, 
Chapter 17 
lytic, 24, 25f, Chapter 1 
protective mechansims, 664, Chapter 17 
resistant bacterial, 102HP, Chapter 3 
viral, 24-25, 25f, Chapter 1 
Inflammation 
cell adhesion in, 241-243HP, 242HPf, 
Chapter 7 
as innate responses to invading pathogens, 
664, Chapter 17 
Inflammatory bowel diseases (IBDs), 669HP, 
Chapter 17 
Influenza, 23, Chapter 1, 74, Chapter 2 
Innate immune responses, 663-665, 665f, 
Chapter 17 
Insulin signaling, 611-613, Chapter 15 
Interferons (IFNs), 676, Chapter 17 
Interleukins (ILs), 676, Chapter 17 


Kaposi’s sarcoma, 594HP, 631, Chapter 15, 
Chapter 16 
Karyotypes, 473, 630f, 640, 645, 651, Chapter 12, 
Chapter 16 
Kidneys 
cancer, 640t, 642f, Chapter 16 
failure from diabetes, 224, Chapter 7 
polycystic kidney diseases, 334HP, 334HPf, 
Chapter 9 
Klinefelter syndrome, 578HP, Chapter 14 
Krabbe’s disease, 287HPt, Chapter 8 


Lactose tolerance, 402HP, Chapter 10 
Leukemias 
acute lymphoblastic, 644, 651, 653, Chapter 16 
acute myeloid, 647-648, 651, 652f, 653f, 
Chapter 16 


cancer genome and, 649, Chapter 16 
cancer stem cells, 659, Chapter 16 
chronic lymphocytic, 655-656, Chapter 16 
chronic myelogenous, 656, Chapter 16 
gene-expression analysis, 651-652, 653f, 
Chapter 16 
genetic mutations in, 644, Chapter 16 
genetics and, 636, Chapter 16 
immunotherapy for, 655, Chapter 16 
microRNAs and, 649, Chapter 16 
new cases and deaths in US in 2015, 629f, 
Chapter 16 
oncogenes and, 632-636EP, 647-648, 
Chapter 16 
protein-inhibiting drugs for, 656, Chapter 16 
tumor-suppressor genes and, 642, Chapter 16 
Leukocyte adhesion deficiency (LAD), 242HP, 
Chapter 7 
Listeria monocytogenes, 304, 358EP, Chapter 8, 
Chapter 9 
Longevity, 111-112HP, 477, 480, 507, 510, 
Chapter 3, Chapter 12 
Lysosomal storage disorders, 287HP, 287HPf, 
287HPt, Chapter 8 


Macular degeneration, 19-20HP, 401HP, 433HP, 
Chapter 1, Chapter 10, Chapter11 
Mad cow disease, 63HP, Chapter 2 
Malaria, 137, 631, 662, 685, Chapter 4, 
Chapter 16, Chapter 17 
Marijuana, 165, 211, Chapter 4, Chapter 6 
Marker chromosome, 477, Chapter 12 
Melanoma 
BRAF genes in, 647, 649, Chapter 16 
genetic mutations in, 642f, Chapter 16 
immunotherapy for, 655, Chapter 16 
new cases and deaths in US in 2015, 629f, 
Chapter 16 
protein-inhibiting drugs for, 657, 657£, 658, 
Chapter 16 
tumor-suppressor genes, 640t, Chapter 16 
ultraviolet radiation and, 535, 536-537HP, 631, 
Chapter 13, Chapter 16 
Metabolism, 103-105, Chapter 3 
capture and utilization of energy, 103-105, 
Chapter 3 
defined, 103, Chapter 3 
enzymes of, 6, 29EP, Chapter 1 
inborn errors of, 405, Chapter 11 
oxidation-reduction (redox) reactions, 103, 
Chapter 3 
photosynthetic, 202-203, Chapter 6 
stages of, 103, 104f, Chapter 3 
Metastasis, 628, 628f, 637, 645, Chapter 16 
cell adhesion in, 241-243HP, 243HPf, 
Chapter 7 
cell-surface properties of, 242HP, 
Chapter 7 
defined, 242HP, Chapter 7 
gap Junctions in, 251-253EP, Chapter 7 
miRNAs in, 649, Chapter 16 
oncogenes in, 638, 639, Chapter 16 
spread of, steps in, 243HPf, Chapter 7 
Methicillin resistant Staph aureus (MRSA), 
101, Chapter 3. 
Microbiome, 14-15, Chapter 1 
Mitochondrial diseases, 195-196HP, 196HPf, 
Chapter 5 
Multiple sclerosis (MS), 162, 669HP, Chapter 4, 
Chapter 17 
Muscle fiber, 346, 346f, Chapter 9 


Muscular dystrophy, 139, 338, 353, 449, 461, 
Chapter 4, Chapter 9, Chapter 11, 
Chapter 12 
Mutagenic agents, 371-372, Chapter 10 
Mutations, 370, Chapter 10 
altering structure of signaling proteins, 594HP, 
Chapter 15 
in DNA, 27EP, Chapter 1 
gain-of-function, 388HP, Chapter 10 
genetic 
in breast cancer, 640, 642f, 644, Chapter 16 
colon cancer in, 642f, 645, 650, 650f, 658, 
Chapter 16 
in leukemias, 644, Chapter 16 
in melanoma, 642f, Chapter 16 
in prostate cancer, 640, 642f, 644, Chapter 16 
in inherited disorders, 62, 64HP, 66HP, 67HP, 
Chapter 2 
in iPS cells, 20HP, Chapter 1 
J. D., 299EP, Chapter 8 
molecular structure of genes and, 5, Chapter 1 
myosin, deafness and, 344, Chapter 9 
nonsense, 448, Chapter 11 
somatic, 594HP, Chapter 15 
spontaneous rate of, 524, Chapter 13 
viral, 24, Chapter 1 


Nerve cells (neurons) 
function of, 159, Chapter 4 
postsynaptic, 148EPn, 164, 165, Chapter 4 
presynaptic, 163, 163f, 164, 165, Chapter 4 
structure of, 159f, Chapter 4 
synaptic transmission, 162-165, 163f, 164f, 
Chapter 4 
Nerve gas, 98, 114, 115, 165, Chapter 3, 
Chapter 4 
Nervous system disorders, 195-197HP, 338, 
387-389HP, 669HP, 671HP, Chapter 5, 
Chapter 9, Chapter 10, Chapter 16 
Neurofibrillary tangles (NFTs), 64HPf, 67HP, 
Chapter 2 
Nicotine addiction, 148EP, Chapter 4 
Niemann-Pick type C disease, 287HPt, 288HP, 
302, Chapter 8 


Non-Hodgkin's B-cell lymphoma, 655, Chapter 16 


Nonself, distinguishing from self, 686-688, 
Chapter 17 

Nonsteroidal anti-inflammatory drugs 
(NSAIDs), 631, Chapter 16 


Ovarian cancer, 642f, 645, 655, 658, Chapter 16 


Pap smear, 637, 637f, 660, Chapter 16 
Parkinson's disease (PD), 195-196HP, Chapter 5 
Periodontal disease, 243HP, Chapter 7 
Peroxisomal diseases, 196-197HP, Chapter 7 
Polycystic kidney diseases (PKD), 334HP, 
334HPf, Chapter 9 
Prader-Willi syndrome (PWS), 501, Chapter 12 
Precocious puberty, 594HPt, 595HP, Chapter 15 
Pregnancy, IgG-based immunity, 680-681, 
Chapter 17 
Prilosec, 154, Chapter 4 
Prions, 64HP, 249, Chapter 2, Chapter 7 
Prostate cancer 
cause of, 632, Chapter 16 
genetics and, 636, Chapter 16 
immunotherapy, 655, Chapter 16 
new cases and deaths in US in 2015, 629f, 
Chapter 16 
protein-inhibiting drugs for, 657, Chapter 16 


PSA test for, 72, Chapter 2 

RB mutations, 640, Chapter 16 

research efforts, 628, Chapter 16 

TP53 mutations in, 642f, 644, Chapter 16 
Prozac, 165, Chapter 4 


Radiation, as carcinogen, 372, 384, 534, 535, 
536HB, 537HP, 550-551, Chapter 10, 
Chapter 13, Chapter 14 

Retinoblastoma, 640-641, 641f, Chapter 16 

Retroviruses, 393, 607, 631, 732, Chapter 10, 
Chapter 15, Chapter 16, Chapter 18 

Rheumatoid arthritis, 669HP, Chapter 17 

Rheumatoid arthritis, 669HP, 670HP, 671 HP, 
740, Chapter 11, Chapter 18 

RNA interference (RNAi), 265, 265f, 430-432, 
430f, 736-737, 737f, Chapter 8, Chapter 11, 
Chapter 18 

clinical applications of, 432-433HP 


Sandhoff ’s disease, 287HPt, Chapter 8 
Scurvy, 226, Chapter 1 
Self, distinguishing from nonself, 686-688, 
Chapter 17 
Sex chromosomes, 382, 469, 577—578HP, 
Chapter 10, Chapter 12, Chapter 14 
Sexual arousal, 620, Chapter 15 
Sickle cell anemia, 53, 53f, 437, Chapter 2, 
Chapter 11 
Skin 
blistering diseases, 237, 245, 338, 671, 
Chapter 7, Chapter 9, Chapter 17 
cancer (see melanoma) 
grafts, 684, Chapter 17 
tight junctions, 244f, 245-247, 246f, 247f, 
Chapter 7 
Smell (olfaction), 603, Chapter 15 
Smoking, 148EPfn, 400HP, 631, Chapter 4, 
Chapter 10, Chapter 16 
Snake venom, 233, Chapter 7 
Speech and language disorders, 397, 
Chapter 10 
Sphingolipid storage diseases, 287HP, 287HPt, 
Chapter 8 
Spongiform encephalopathy, 63HP, Chapter 2 
Statin drugs, 302, Chapter 8 
Stem cells 
adult, 18HP, 18HPf, Chapter 1 
cancer, 658-659, Chapter 16 
defined, 18HP, Chapter 1 
embryonic, 18-19HP, 19HPf, Chapter 1 
hematopoietic, 665, 666f, 671HP, Chapter 17 
induced pluripotent, 19-21 HP, 20HPf, 
Chapter 1 
mesenchymal, 18HP, Chapter 1 
Stroke, 233, 241HP, Chapter 7 
Sulfatide lipidosis, 287HPt, Chapter 8 
Systemic lupus erythematosus (SLE), 669HP, 
669HPf, Chapter 17 


Taste (gustation), 603, Chapter 15 

Tay-Sachs disease, 287HPt, 288HP, Chapter 8 

Testosterone, 47, 47f, 632, Chapter 2, 
Chapter 15 

Thymus, 231f, 662, 662f, 665, 666f, 669HP, 
671HP, 686, 688, 688f, Chapter 7, 
Chapter 17 

Tolerance, immunologic, 668, 670HP, 678, 
Chapter 17 

Trans fats, 47, Chapter 2 

Transplant rejection, 684, 688, Chapter 17 


Treatment of, 670-671HP, Chapter 17 

Tuberculosis, 303, Chapter 3 

Tumor necrosis factor (TNF), 622, 623f, 676, 
Chapter 15 

Type 1 diabetes (T1D), 669HP, Chapter 17 

Ultraviolet (UV) radiation, DNA damage from, 
535, 536-537HP, Chapter 13 


Vaccination, 671, Chapter 17 
Vancomycin, 102, Chapter 3 
Viagra, 620, Chapter 15 
Viruses, 23-25, Chapter 1 
benefits of, 25, Chapter 1 


capsids of, 23, 24f, Chapter 1 
DNA replication in, 526, Chapter 13 
host-cell specificity of, change in, 24, Chapter 1 
host range of, 24, Chapter 1 
innate immune response to, 663-665, 665f, 
Chapter 17 
oncogenes and, 632-635EP, Chapter 16 
properties of, 23, Chapter 1 
protein on surface of, 23-24, 24f, 
Chapter 1 
resistance to, interferon and, 676, Chapter 17 
treatment with RNAi, 432-433HP, 
Chapter 11 


viral infections, types of, 24-25, 25f, Chapter 1 
Vision, 601, 603, Chapter 15 
Vitamin C deficiency, 226, Chapter 7 


Whooping cough, 399, Chapter 19 


X chromosome inactivation, 470, 
Chapter 12 

Xeroderma pigmentosum (XP), 535, 536HP, 
Chapter 13 


Zellweger syndrome (ZS), 196-197HP, 
Chapter 5 
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Introduction to the Study 
of Cell and Molecular Biology 


WE ARE CELLS 
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W. are made of cells. Cells make up our skin, our 
organs, and our muscles. The brain, the seat of our 
thoughts and desires, is made of cells. Our blood 
vessels teem with cells. Fertilization is no more or less 
than a joining of two separate cells to produce a single 
new cell, which then multiplies to produce the embryo. 
When we grow from a tiny embryo into a large adult, we 
do so by adding more and more cells. When we get sick, 
it is often because our cells have run amok. And when 
we grow old, it is because our cells gradually give up the 
ghost. After we die and are buried, soon the only 
remnants of our existence are bones, teeth, and hair, 
structures that were sculpted in life by the ceaseless 
activity of cells. Many medicines work by changing how 
cells behave, and in recent years cells themselves are 
being used as medicines to cure sick people. Because all 
living things are made of one or more cells, the origin of 
life corresponds to the origin of cells. Starting with this 
chapter, we will explore what cells are and how they work, 
themes that will be expanded throughout this book. 
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Diagram of nerve cells from the cat brain, hand-drawn by Santiago Ramon 
y Cajal. Ramén y Cajal was the first to recognize that the brain is made 
up of huge numbers of individual cells, rather than a continuous 
connected network as proposed by his competitor, Camillo Golgi. Ramon 
y Cajal and Golgi fought a protracted battle over this point, but 
eventually the meticulous detail of Ramon y Cajal’s work convinced the 
world that the brain is indeed a collective of individual cells. 

Source: Histology of the Nervous System of Man and Vertebrates by 
Cajal (1995) Fig. “Neurons in the cat brain.” By permission of Oxford 
University Press. 
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1.1 The Discovery of Cells 


Cells, and the structures they comprise, are too small to be directly 
seen, heard, or touched. In spite of this tremendous handicap, cells 
are the subject of hundreds of thousands of publications each year, 
with virtually every aspect of their minuscule structure coming under 
scrutiny. In many ways, the study of cell and molecular biology stands 
as a tribute to human curiosity for seeking to discover and to human 
creative intelligence for devising the complex instruments and elabo- 
rate techniques by which these discoveries can be made. This is not to 
imply that cell and molecular biologists have a monopoly on these 
noble traits. At one end of the scientific spectrum, astronomers are 
utilizing an orbiting telescope to capture images of primordial galax- 
ies that are so far from Earth they appear to us today as they existed 
more than 13 billion years ago, only a few hundred million years after 
the Big Bang. At the other end of the spectrum, nuclear physicists 
have recently forced protons to collide with one another at velocities 
approaching the speed of light, confirming the existence of a hypoth- 
esized particle—the Higgs boson—that is proposed to endow all 
other subatomic particles with mass. Clearly, our universe consists of 
worlds within worlds, all aspects of which make for fascinating study. 

As will be apparent throughout this book, cell and molecular 
biology is reductionist; that is, it is based on the view that knowledge 
of the parts of the whole can explain the character of the whole. 
When viewed in this way, our feeling for the wonder and mystery of 
life may be replaced by the need to explain everything in terms of the 
workings of the “machinery” of the living system. To the degree to 
which this occurs, it is hoped that this loss can be replaced by an 
equally strong appreciation for the beauty and complexity of the 
mechanisms underlying cellular activity. 


Microscopy 


Because of their small size, cells can only be observed with the aid of a 
microscope, an instrument that provides a magnified image of a tiny 
object. We do not know when humans first discovered the remarkable 
ability of curved-glass surfaces to bend light and form images. 
Spectacles were first made in Europe in the thirteenth century, and the 
first compound (double-lens) light microscopes were constructed by 
the end of the sixteenth century. By the mid-1600s, a handful of pio- 
neering scientists had used their handmade microscopes to uncover a 
world that would never have been revealed to the naked eye. The dis- 
covery of cells (FIGURE 1.1a) is generally credited to Robert Hooke, an 
English microscopist who, at age 27, was awarded the position of cura- 
tor of the Royal Society of London, England's foremost scientific acad- 
emy. One of the many questions Hooke attempted to answer was why 
stoppers made of cork (part of the bark of trees) were so well suited to 
holding air in a bottle. As he wrote in 1665: “I took a good clear piece 
of cork, and with a Pen-knife sharpend as keen as a Razor, I cut a piece 
of it off, and... . then examining it with a Microscope, me thought I 
could perceive it to appear a little porous . .. much like a Honeycomb,” 
Hooke called the pores cells because they reminded him of the cells 
inhabited by monks living in a monastery. In actual fact, Hooke had 
observed the empty cell walls of dead plant tissue, walls that had origi- 
nally been produced by the living cells they surrounded. 

Meanwhile, Antonie van Leeuwenhoek, a Dutchman who earned 
a living selling clothes and buttons, was spending his spare time 
grinding lenses and constructing simple microscopes of remarkable 
quality (Figure 1.1b). For 50 years, Leeuwenhoek sent letters to the Royal 
Society of London describing his microscopic observations—along 


(a) 


(b) 


FIGURE 1.1 The discovery of cells. (a) One of Robert Hooke’s more 
ornate compound (double-lens) microscopes. (Inset) Hooke’s drawing of a 
thin slice of cork, showing the honeycomb-like network of “cells.” 

(b) Single-lens microscope used by Antonie van Leeuwenhoek to observe 
bacteria and other microorganisms. The biconvex lens, which was capable 
of magnifying an object approximately 270 times and providing a resolution 
of approximately 1.35 um, was held between two metal plates. 

Source: (a) The Granger Collection, New York; inset Biophoto Associates/ 
Getty Images, Inc.; (6) © Bettmann/Corbis 


with a rambling discourse on his daily habits and the state of his 
health. Leeuwenhoek was the first to examine a drop of pond water 
under the microscope and, to his amazement, observe the teeming 
microscopic “animalcules” that darted back and forth before his 
eyes. He was also the first to describe various forms of bacteria, 
which he obtained from water in which pepper had been soaked and 
from scrapings from his teeth. His initial letters to the Royal Society 
describing this previously unseen world were met with such skepti- 
cism that the society dispatched its curator, Robert Hooke, to con- 
firm the observations. Hooke did just that, and Leeuwenhoek was 
soon a worldwide celebrity, receiving visits in Holland from Peter 
the Great of Russia and the queen of England. 


Cell Theory 


It wasn’t until the 1830s that the widespread importance of cells was 
realized. In 1838, Matthias Schleiden, a German lawyer turned bota- 
nist, concluded that, despite differences in the structure of various 


tissues, plants were made of cells and that the plant embryo arose 
from a single cell. In 1839, Theodor Schwann, a German zoologist 
and colleague of Schleiden’s, published a comprehensive report on 
the cellular basis of animal life. Schwann concluded that the cells of 
plants and animals are similar structures and proposed these two 
tenets of the cell theory: 


e All organisms are composed of one or more cells. 


e The cell is the structural unit of life. 


Schleiden and Schwann’s ideas on the origin of cells proved to be less 
insightful; both agreed that cells could arise from noncellular mate- 
rials. Given the prominence that these two scientists held in the sci- 
entific world, it took a number of years before observations by other 
biologists were accepted as demonstrating that cells did not arise in 
this manner any more than organisms arose by spontaneous genera- 
tion. By 1855, Rudolf Virchow, a German pathologist, had made a 
convincing case for the third tenet of the cell theory: 


e Cells can arise only by division from a preexisting cell. 


REVIEW | 
1. When Robert Hooke first described cells, what was he 
actually looking at? 


2. What are the three componnents of cell theory? 
eo. OOOO 


1.2 Basic Properties of Cells 


Just as plants and animals are alive, so too are cells. Life, in fact, is the 
most basic property of cells, and cells are the smallest units to exhibit 
this property. Unlike the parts of a cell, which simply deteriorate 
if isolated, whole cells can be removed from a plant or animal and 
cultured in a laboratory where they will grow and reproduce for 
extended periods of time. If mistreated, they may die. Death can also 
be considered one of the most basic properties of life, because only a 
living entity faces this prospect. Remarkably, cells within the body 
generally die “by their own hand”—the victims of an internal program 
that causes cells that are no longer needed or cells that pose a risk of 
becoming cancerous to eliminate themselves. 

The first culture of human cells was begun by George and 
Martha Gey of Johns Hopkins University in 1951. The cells were 
obtained from a malignant tumor and named HeLa cells after the 
donor, Henrietta Lacks. HeLa cells—descended by cell division from 
this first cell sample—are still being grown in laboratories around 
the world today (FIGURE 1.2). Because they are so much simpler to 
study than cells situated within the body, cells grown in vitro (i.e., in 
culture, outside the body) have become an essential tool of cell and 
molecular biologists. In fact, much of the information that will be 
discussed in this book has been obtained using cells grown in 
laboratory cultures. 

We will begin our exploration of cells by examining a few of 
their most fundamental properties. 


Cells Are Highly Complex and Organized 


Complexity is a property that is evident when encountered, but dif- 
ficult to describe. For the present, we can think of complexity in 
terms of order and consistency. The more complex a structure, the 


FIGURE 1.2 HeLa cells, such as the ones pictured here, were the first 
human cells to be kept in culture for long periods of time and are still in use 
today. Unlike normal cells, which have a finite lifetime in culture, these 
cancerous HeLa cells can be cultured indefinitely as long as conditions are 
favorable to support cell growth and division. 


Source: Torsten Wittmann/Photo Researchers, Inc. 


greater the number of parts that must be in their proper place, the 
less tolerance for errors in the nature and interactions of the parts, 
and the more regulation or control that must be exerted to maintain 
the system. Cellular activities can be remarkably precise. DNA dupli- 
cation, for example, occurs with an error rate of less than one mistake 
every ten million nucleotides incorporated—and most of these are 
quickly corrected by an elaborate repair mechanism that recognizes 
the defect. 

During the course of this book, we will have occasion to con- 
sider the complexity of life at several different levels. We will discuss 
the organization of atoms into small-sized molecules; the organiza- 
tion of these molecules into giant polymers; and the organization of 
different types of polymeric molecules into complexes, which in 
turn are organized into subcellular organelles and finally into cells. 
As will be apparent, there is a great deal of consistency at every level. 
Each type of cell has a consistent appearance when viewed under a 
high-powered electron microscope; that is, its organelles have a par- 
ticular shape and location, from one individual of a species to 
another. Similarly, each type of organelle has a consistent composi- 
tion of macromolecules, which are arranged in a predictable pattern. 
Consider the cells lining your intestine that are responsible for 
removing nutrients from your digestive tract. FIGURE 1.3 illustrates 
the many different levels of organization present in such a tissue. 

The epithelial cells that line the intestine are tightly connected 
to each other like bricks in a wall (Figure 1.3 inset 1). The apical ends 
of these cells, which face the intestinal channel, have long processes 
(microvilli) that facilitate absorption of nutrients (inset 2). The 
microvilli are able to project outward from the apical cell surface 
because they contain an internal skeleton made of filaments, which 
in turn are composed of protein (actin) monomers polymerized in a 
characteristic array (inset 3). At their basal ends, intestinal cells have 
large numbers of mitochondria (inset 4) that provide the energy 
required to fuel various membrane transport processes. Each mito- 
chondrion is composed of a defined pattern of internal membranes, 
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Villus of the small intestinal wall 
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FIGURE 1.3 Levels of cellular and molecular organization. The brightly colored photograph of a stained section shows the microscopic structure of a 
villus of the wall of the small intestine, as seen through the light microscope. Inset 1 shows an electron micrograph of the epithelial layer of cells that lines 
the inner intestinal wall. The apical surface of each cell, which faces the channel of the intestine, contains a large number of microvilli involved in nutrient 
absorption. The basal region of each cell contains large numbers of mitochondria, where energy is made available to the cell. Inset 2 shows the apical 
microvilli; each microvillus contains a bundle of actin filaments. Inset 3 shows the actin protein subunits that make up each filament. Inset 4 shows an 
individual mitochondrion similar to those found in the basal region of the epithelial cells. Inset 5 shows a portion of an inner membrane of a mitochondrion 
including the stalked particles that project from the membrane and correspond to the sites where ATP is synthesized. Insets 6 and 7 show molecular models 
of the ATP- synthesizing machinery, which is discussed at length in Chapter 5. 


Source: Light micrograph Cecil Fox/Photo Researchers; inset 1 courtesy of Shakti P. Kapur, Georgetown University Medical Center; inset 2 from Mark S. 
Mooseker and Lewis G. Tilney, J. Cell Biol. 67:729, 1975, reproduced with permission of the Rockefeller University Press; inset 3 courtesy of Kenneth C. 
Holmes; inset 4 Keith R. Porter/Photo Researchers; inset 5 courtesy of Humberto Fernandez-Moran; inset 6 courtesy of Roderick A. Capaldi; inset 7 courtesy 
of Wolfgang Junge, Holger Lill, and Siegfried Engelbrecht, University of Osnabriick, Germany. 


which in turn are composed of a consistent array of proteins, includ- 
ing an electrically powered ATP-synthesizing machine that projects 
from the inner membrane like a ball on a stick (insets 5-7). 

One of the truly fascinating aspects of cells is that they achieve 
organization at many different scales using physical processes that 
are essentially random. Even though living cells are highly complex 
and ordered, it has become increasingly evident in recent years that 
random (stochastic) events play a critical role in all cellular activities. 
Many of the molecules within living cells are in a constant state of 
random movement, propelled by thermal energy they acquire from 
their environment. Cells have evolved the capacity to utilize this 
movement in highly directed ways. We can consider one example of 
this phenomenon, keeping in mind that many other cases could be 
described. Proteins are complex molecules often consisting of hun- 
dreds of amino acid building blocks and attaining molecular masses 
over a hundred thousand Daltons. Despite their huge size, proteins 
consist of a polypeptide chain that has to fold into a precisely defined 
three-dimensional (native) structure. If it fails to fold properly, the 
protein will lack meaningful function. In 1969, Cyrus Levinthal of 
Columbia University identified certain features of this folding pro- 
cess that became known as Levinthal’s paradox. For one part of the 
paradox, Levinthal noted that, if protein folding depended solely on 
random molecular movements, it would require a period of time 
greater than the age of the universe for a protein to fold into its native 
structure. According to this scenario, the time it would take for a 
protein to fold properly might be compared to the period required 
for a monkey sitting at a piano to compose one of Beethoven's con- 
certos. The paradox inherent in protein folding becomes evident 
knowing that, despite their enormous complexity, proteins actually 
acquire their native structures within fractions of a second. How is 
the paradox resolved? Even though folding of a protein is driven by 
random thermal motion, the process occurs in stepwise fashion so 
that the protein folds along pathways in which less structured inter- 
mediates guide the formation of better formed subsequent interme- 
diates. In other words, the folding pathway allows proteins to rapidly 
“jump” from one step to the next until the native structure is reached. 
To carry over the solution of the protein folding paradox to the mon- 
key at the piano, it would be as if every time the monkey tapped an 
appropriate key, that note would be recorded, allowing the monkey 
to move toward the next note in the concerto. As long as the monkey 
was an active player, the composition of the concerto could be 
accomplished quite rapidly. It can be said that these types of events 
are “biased” They depend upon random activities, but they lead to 
directed outcomes because they select for intermediate stages that lie 
on the path leading to the desired outcome. 

Fortunately for cell and molecular biologists, evolution has 
moved rather slowly at the levels of biological organization with 
which they are concerned. Whereas a human and a cat, for example, 
have very different anatomical features, the cells that make up their 
tissues, and the organelles that make up their cells, are very similar. 
The actin filament portrayed in Figure 1.3, inset 3, and the ATP- 
synthesizing enzyme of inset 6 are virtually identical to similar 
structures found in such diverse organisms as humans, snails, yeast, 
and redwood trees. Information obtained by studying cells from one 
type of organism often has direct application to other forms of life. 
Many of the most basic processes, such as the synthesis of proteins, 
the conservation of chemical energy, or the construction of a 
membrane, are remarkably similar in all living organisms. 


Cells Possess a Genetic Program 
and the Means to Use It 


Organisms are built according to information encoded in a collec- 
tion of genes, which are constructed of DNA. The human genetic 
program contains enough information, if converted to words, to fill 
millions of pages of text. Remarkably, this vast amount of informa- 
tion is packaged into a set of chromosomes that occupies the space 
of a cell nucleus—hundreds of times smaller than the dot on this i. 

Genes are more than storage lockers for information: They con- 
stitute the recipes for constructing cellular structures, the directions 
for running cellular activities, and the program for making more of 
themselves. The molecular structure of genes allows for changes in 
genetic information (mutations) that lead to variation among indi- 
viduals, which forms the basis of biological evolution. Discovering 
the mechanisms by which cells use and transmit their genetic infor- 
mation has been one of the greatest achievements of science in recent 
decades. 


Cells Are Capable of Producing 
More of Themselves 


Just as individual organisms are generated by reproduction, so too are 
individual cells. Cells reproduce by division, a process in which the con- 
tents of a “mother” cell are distributed into two “daughter” cells. Prior to 
division, the genetic material is faithfully duplicated, and each daughter 
cell receives a complete and equal share of genetic information. In most 
cases, the two daughter cells have approximately equal volume. In some 
cases, however, as occurs when a human oocyte undergoes division, 
one of the cells can retain nearly all of the cytoplasm, even though it 
receives only half of the genetic material (FIGURE 1.4). 


Cells Acquire and Utilize Energy 


Every biological process requires the input of energy. Virtually all of 
the energy utilized by life on the Earth’s surface arrives in the form of 
electromagnetic radiation from the sun. The energy of light is trapped 
by light-absorbing pigments present in the membranes of photosyn- 
thetic cells (FIGURE 1.5). Light energy is converted by photosynthesis 


20 um 


FIGURE 1.4 Cell reproduction. This mammalian oocyte has recently 
undergone a highly unequal cell division in which most of the cytoplasm 
has been retained within the large oocyte, which has the potential to be 
fertilized and develop into an embryo. The other cell is a nonfunctional 
remnant that consists almost totally of nuclear material (indicated by the 
blue-staining chromosomes, arrow). 


Source: Courtesy of Jonathan van Blerkom. 
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FIGURE 1.5 Acquiring energy. A living cell of the filamentous alga 
Spirogyra. The ribbon-like chloroplast, which is seen to zigzag through the 
cell, is the site where energy from sunlight is captured and converted to 
chemical energy during photosynthesis. 

Source: M. |. Walker/Photo Researchers, Inc. 


into chemical energy that is stored in energy-rich carbohydrates, 
such as sucrose or starch. For most animal cells, energy arrives pre- 
packaged, often in the form of the sugar glucose. In humans, glucose 
is released by the liver into the blood where it circulates through the 
body delivering chemical energy to all the cells. Once in a cell, the 
glucose is disassembled in such a way that its energy content can be 
stored in a readily available form (usually as ATP) that is later put to 
use in running all of the cell's myriad energy-requiring activities. 
Cells expend an enormous amount of energy simply breaking down 
and rebuilding the macromolecules and organelles of which they are 
made. This continual “turnover,” as it is called, maintains the integ- 
rity of cell components in the face of inevitable wear and tear and 
enables the cell to respond rapidly to changing conditions. 


Cells Carry Out a Variety 
of Chemical Reactions 


Cells function like miniaturized chemical plants. Even the simplest 
bacterial cell is capable of hundreds of different chemical transfor- 
mations, none of which occurs at any significant rate in the inani- 
mate world. Virtually all chemical changes that take place in cells 
require enzymes—molecules that greatly increase the rate at which a 
chemical reaction occurs. The sum total of the chemical reactions in 
a cell represents that cell’s metabolism. 


Cells Engage in Mechanical Activities 


Cells are sites of bustling activity. Materials are transported from place 
to place, structures are assembled and then rapidly disassembled, and, 
in many cases, the entire cell moves itself from one site to another. 
These types of activities are based on dynamic, mechanical changes 
within cells, many of which are initiated by changes in the shape of 
“motor” proteins. Motor proteins are just one of many types of molec- 
ular “machines” employed by cells to carry out mechanical activities. 


Cells Are Able to Respond to Stimuli 


Some cells respond to stimuli in obvious ways; a single-celled protist, 
for example, moves away from an object in its path or moves toward 
a source of nutrients. Cells within a multicellular plant or animal 
respond to stimuli less obviously. Most cells are covered with recep- 
tors that interact with substances in the environment in highly spe- 
cific ways. Cells possess receptors to hormones, growth factors, and 


extracellular materials, as well as to substances on the surfaces of 
other cells. A cell's receptors provide pathways through which exter- 
nal stimuli can evoke specific responses in target cells. Cells may 
respond to specific stimuli by altering their metabolic activities, 
moving from one place to another, or even committing suicide. 


Cells Are Capable of Self-Regulation 


In recent years, a new term has been used to describe cells: robustness. 
Cells are robust, that is, hearty or durable, because they are protected 
from dangerous fluctuations in composition and behavior. Should 
such fluctuations occur, specific feedback circuits are activated that 
serve to return the cell to the appropriate state. In addition to requir- 
ing energy, maintaining a complex, ordered state requires constant 
regulation. The importance ofa cell’s regulatory mechanisms becomes 
most evident when they break down. For example, failure of a cell to 
correct a mistake when it duplicates its DNA may result in a debilitat- 
ing mutation, or a breakdown in a cell's growth-control safeguards 
can transform the cell into a cancer cell with the capability of destroy- 
ing the entire organism. We are gradually learning how a cell controls 
its activities, but much more is left to discover. 

Consider the following experiment conducted in 1891 by Hans 
Driesch, a German embryologist. Driesch found that he could com- 
pletely separate the first two or four cells of a sea urchin embryo and 
each of the isolated cells would proceed to develop into a normal 
embryo (FIGURE 1.6). How can a cell that is normally destined to 


Experimental result 


Normal development 


FIGURE 1.6 Self-regulation. The left panel depicts the normal develop- 
ment of a sea urchin in which a fertilized egg gives rise to a single embryo. 
The right panel depicts an experiment in which the cells of an early embryo 
are separated from one another after the first division, and each cell is 
allowed to develop in isolation. Rather than developing into half of an 
embryo, as it would if left undisturbed, each isolated cell recognizes the 
absence of its neighbor, regulating its development to form a complete 
(although smaller) embryo. 


form only part of an embryo regulate its own activities and form an 
entire embryo? How does the isolated cell recognize the absence of 
its neighbors, and how does this recognition redirect the entire 
course of the cell’s development? How can a part of an embryo have 
a sense of the whole? We are not able to answer these questions much 
better today than we were more than a hundred years ago when the 
experiment was performed. 

Throughout this book we will be discussing processes that 
require a series of ordered steps, much like the assembly-line con- 
struction of an automobile in which workers add, remove, or make 
specific adjustments as the car moves along. In the cell, the informa- 
tion for product design resides in the nucleic acids, and the con- 
struction workers are primarily proteins. It is the presence of these 
two types of macromolecules that, more than any other factor, sets 
the chemistry of the cell apart from that of the nonliving world. In the 
cell, the workers must act without the benefit of conscious direction. 
Each step of a process must occur spontaneously in such a way that 
the next step is automatically triggered. In many ways, cells operate 
in a manner analogous to the orange-squeezing contraption discov- 
ered by “The Professor” and shown in FIGURE 1.7. Each type of cel- 
lular activity requires a unique set of highly complex molecular tools 
and machines—the products of eons of natural selection and bio- 
logical evolution. A primary goal of biologists is to understand the 
molecular structure and role of each component involved in a par- 
ticular activity, the means by which these components interact, and 
the mechanisms by which these interactions are regulated. 


Cells Evolve 


How did cells arise? Of all the major questions posed by biologists, 
this question may be the least likely ever to be answered. It is pre- 
sumed that cells evolved from some type of precellular life form, 
which in turn evolved from nonliving organic materials that were 


- “Orange Juice Squeezing Machine 


present in the primordial seas. Whereas the origin of cells is shrouded 
in near-total mystery, the evolution of cells can be studied by exam- 
ining organisms that are alive today. If you were to observe the fea- 
tures of a bacterial cell living in the human intestinal tract (see 
FIGURE 1.18a) and a cell that is part of the lining of that tract 
(Figure 1.3), you would be struck by the differences between the two 
cells. Yet both of these cells, as well as all other cells that are present 
in living organisms, share many features, including a common 
genetic code, a plasma membrane, and ribosomes. According to one 
of the tenets of modern biology, all living organisms have evolved 
from a single, common ancestral cell that lived more than three bil- 
lion years ago. Because it gave rise to all the living organisms that we 
know of, this ancient cell is often referred to as the last universal com- 
mon ancestor (or LUCA). We will examine some of the events that 
occurred during the evolution of cells in the Experimental Pathway 
at the end of the chapter. Future chapters will explore biochemical 
aspects of the origin of life. Keep in mind that evolution is not simply 
an event of the past, but an ongoing process that continues to modify 
the properties of cells that will be present in organisms that have yet 
to appear. For example, evolution of drug resistance in bacteria is a 
major health concern and will be discussed in Section 3.8. 


REVIEW | 


1. List the fundamental properties shared by all cells. 
Describe the importance of each of these properties. 


2. Describe the features of cells that suggest that all 
living organisms are derived from a common ancestor. 


3. What is the source of energy that supports life on 
Earth? How is this energy passed from one organism 
to the next? 
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Professor Butts steps into an open elevator shaft 
and when he lands at the bottom he finds a sim- 
ple orange squeezing machine. Milkman takes 
empty milk bottle (A), pulling string (B) which 
causes sword (C) to sever cord (D) and allow 
guillotine blade (E) to drop and cut rope (F) which 
releases battering ram (G). Ram bumps against 
open door (H), causing it to close. Grass sickle (I) 
cuts a slice off end of orange (J)—at the same 
time spike (K) stabs “prune hawk” (L) he opens 


his mouth to yell in agony, thereby releasing 
prune and allowing diver’s boot (M) to drop and 
step on sleeping octopus (N). Octopus awakens 
in a rage and, seeing diver's face which is painted 
on orange, attacks it and crushes it with tenta- 
cles, thereby causing all the juice in the orange to 
run into glass (O). 

Later on you can use the log to build a log 
cabin where you can raise your son to be Presi- 
dent like Abraham Lincoln. 


FIGURE 1.7 Cellular activities are often analogous to this 
“Rube Goldberg machine” in which one event “automatically” 
triggers the next event in a reaction sequence. 


Source: Rube Goldberg is the ® and © of Rube Goldberg, Inc. 
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1.3 Ch aracteristics Th at cells include bacteria, whereas the structurally more complex eukaryotic 


cells include protists, fungi, plants, and animals.' 


D istinguish Pro ka ryotic We are not sure when prokaryotic cells first appeared on Earth. 


and Eukaryotic Cells Evidence of prokaryotic life has been obtained from rocks approxi- 
mately 2.7 billion years of age. Not only do these rocks contain what 
Once the electron microscope became widely available, biologists were appear to be fossilized microbes, they contain complex organic mol- 
able to examine the internal structure of a wide variety of cells. It ecules that are characteristic of particular types of prokaryotic organ- 
became apparent from these studies that there were two basic classes of isms, including cyanobacteria. It is unlikely that such molecules could 
cells—prokaryotic and eukaryotic—distinguished by their size and the have been synthesized abiotically, that is, without the involvement of 
types of internal structures, or organelles, they contain (FIGURE 1.8). _ living cells. Cyanobacteria almost certainly appeared by 2.4 billion 
The existence of two distinct classes of cells, without any known inter- 
mediates, r epresents one of the most fundamental evolutionar y divi- 'Those interested in examining a proposal to do away with the concept of prokaryotic 
sions in the biological world. The structurally simpler prokaryotic versus eukaryotic organisms can read a brief essay by N. R. Pace in Nature 441:289, 2006. 
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FIGURE 1.8 The structure of cells. Schematic diagrams of a “generalized” bacterial (a), plant (b), and animal (c) cell. Note: Organelles are not drawn 
to scale. 
Source: From D. J. Des Marais, Science 289:1704, 2001. Copyright © 2000. Reprinted with permission from AAAS. 
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FIGURE 1.8 (continued) 


years ago, because that is when the atmosphere became infused with 
molecular oxygen (O2), which is a by-product of the photosynthetic 
activity of these prokaryotes. The dawn of the age of eukaryotic cells 
is also shrouded in uncertainty. Complex multicellular animals 
appear rather suddenly in the fossil record approximately 600 million 
years ago, but there is considerable evidence that simpler eukaryotic 
organisms were present on Earth more than one billion years earlier. 
The estimated time of appearance on Earth of several major groups of 
organisms is depicted in FIGURE 1.9. Even a superficial examination 
of Figure 1.9 reveals how “quickly” life arose following the formation 
of Earth and cooling of its surface and how long it took for the subse- 
quent evolution of complex animals and plants. 

The following brief comparison between prokaryotic and eukary- 
otic cells reveals many basic differences between the two types, as well 
as many similarities (see Figure 1.8). The similarities and differences 
between the two types of cells are listed in Table 1.1. The shared prop- 
erties reflect the fact that eukaryotic cells almost certainly evolved 
from prokaryotic ancestors. Because of their common ancestry, both 
types of cells share an identical genetic language, a common set of 
metabolic pathways, and many common structural features. For exam- 
ple, both types of cells are bounded by plasma membranes of similar 
construction that serve as a selectively permeable barrier between the 
living and nonliving worlds. Both types of cells may be surrounded by 
a rigid, nonliving cell wall that protects the delicate life form within. 
Although the cell walls of prokaryotes and eukaryotes may have simi- 
lar functions, their chemical composition is very different. 

Internally, eukaryotic cells are much more complex—both 
structurally and functionally—than prokaryotic cells (Figure 1.8). 
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FIGURE 1.9 Earth's biogeologic clock. A portrait of the past five billion 
years of Earth’s history showing a proposed time of appearance of major 
groups of organisms. Complex animals (shelly invertebrates) and vascular 
plants are relatively recent arrivals. The time indicated for the origin of life is 
speculative. In addition, photosynthetic bacteria may have arisen much 
earlier, hence the question mark. The geologic eras are indicated in the 
center of the illustration. 

Source: From D. J. Des Marais, Science 289:1704, 2001. Copyright © 2001. 
Reprinted with permission from AAAS. 
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10 TABLE 1.1 A Comparison of Prokaryotic and Eukaryotic Cells 


Features held in common by the two types of cells: 


Plasma membrane of similar construction 
Genetic information encoded in DNA using identical genetic code 
Similar mechanisms for transcription and translation of genetic information, 
including similar ribosomes 
Shared metabolic pathways (e.g., glycolysis and TCA cycle) 
Similar apparatus for conservation of chemical energy as ATP (located in the 
plasma membrane of prokaryotes and the mitochondrial membrane of 
eukaryotes) 
Similar mechanism of photosynthesis (between cyanobacteria and green plants) 
Similar mechanism for synthesizing and inserting membrane proteins 
Proteasomes (protein digesting structures) of similar construction (between 
archaebacteria and eukaryotes) 

¢ Cytoskeletal filaments built of proteins similar to actin and tubulin 


Features of eukaryotic cells not found in prokaryotes: 


Division of cells into nucleus and cytoplasm, separated by a nuclear envelope 
containing complex pore structures 

Complex chromosomes composed of DNA and associated proteins that are 
capable of compacting into mitotic structures 

Complex membranous cytoplasmic organelles (includes endoplasmic 
reticulum, Golgi complex, lysosomes, endosomes, peroxisomes, and 
glyoxisomes) 

Specialized cytoplasmic organelles for aerobic respiration (mitochondria) and 
photosynthesis (chloroplasts) 

Complex cytoskeletal system (including actin filaments, intermediate 
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filaments, and microtubules) and associated motor proteins 

Complex flagella and cilia 

Ability to ingest particulate material by enclosure within plasma membrane 
vesicles (phagocytosis) 

Cellulose-containing cell walls (in plants) 

Cell division using a microtubule-containing mitotic spindle that separates 
chromosomes 

Presence of two copies of genes per cell (diploidy), one from each parent 
Presence of three different RNA synthesizing enzymes (RNA polymerases) 
Sexual reproduction requiring meiosis and fertilization 


The difference in structural complexity is evident in the electron 
micrographs of a bacterial and an animal cell shown in Figures 1.184 
and 1.10, respectively. Both contain a nuclear region that houses the 
cell’s genetic material, surrounded by cytoplasm. The genetic mate- 
rial of a prokaryotic cell is present in a nucleoid: a poorly demar- 
cated region of the cell that lacks a boundary membrane to separate 
it from the surrounding cytoplasm. In contrast, eukaryotic cells pos- 
sess a nucleus: a region bounded by a complex membranous struc- 
ture called the nuclear envelope. This difference in nuclear structure 
is the basis for the terms prokaryotic (pro = before, karyon = nucleus) 
and eukaryotic (eu = true, karyon = nucleus). Prokaryotic cells con- 
tain relatively small amounts of DNA; the DNA content of bacteria 
ranges from about 600,000 base pairs to nearly 8 million base pairs 
and encodes between about 500 and several thousand proteins.” 
Although a “simple” baker’s yeast cell has only slightly more DNA 
(12 million base pairs encoding about 6200 proteins) than the most 
complex prokaryotes, most eukaryotic cells contain considerably 
more genetic information. Both prokaryotic and eukaryotic cells 
have DNA-containing chromosomes. Eukaryotic cells possess a 
number of separate chromosomes, each containing a single linear 
molecule of DNA. In contrast, nearly all prokaryotes that have been 
studied contain a single, circular chromosome. More importantly, 
the chromosomal DNA of eukaryotes, unlike that of prokaryotes, is 


*Eight million base pairs is equivalent toa DNA molecule nearly 3 mm long. 


tightly associated with proteins to form a complex nucleoprotein 
material known as chromatin. 

The cytoplasm of the two types of cells is also very different. The 
cytoplasm of a eukaryotic cell is filled with a great diversity of struc- 
tures, as is readily apparent by examining an electron micrograph of 
nearly any plant or animal cell (FIGURE 1.10). Even yeast, the sim- 
plest eukaryote, is much more complex structurally than an average 
bacterium, even though these two organisms have a similar number 
of genes. Eukaryotic cells contain an array of membrane-bound 
organelles. Eukaryotic organelles include mitochondria, where 
chemical energy is made available to fuel cellular activities; an endo- 
plasmic reticulum, where many of a cell’s proteins and lipids are 
manufactured; Golgi complexes, where materials are sorted, modi- 
fied, and transported to specific cellular destinations; and a variety 
of simple membrane-bound vesicles of varying dimension. Plant 
cells contain additional membranous organelles, including chloro- 
plasts, which are the sites of photosynthesis, and often a single large 
vacuole that can occupy most of the volume of the cell. Taken as a 
group, the membranes of the eukaryotic cell serve to divide the cyto- 
plasm into compartments within which specialized activities can 
take place. In contrast, the cytoplasm of prokaryotic cells is essen- 
tially devoid of membranous structures. The complex photosyn- 
thetic membranes of the cyanobacteria are a major exception to this 
generalization (see Figure 1.15). 

The cytoplasmic membranes of eukaryotic cells form a system 
of interconnecting channels and vesicles that function in the trans- 
port of substances from one part of a cell to another, as well as 
between the inside of the cell and its environment. Because of their 
small size, directed intracytoplasmic communication is less impor- 
tant in prokaryotic cells, where the necessary movement of materials 
can be accomplished by simple diffusion. 

Eukaryotic cells also contain numerous structures lacking a sur- 
rounding membrane. Included in this group are the elongated tubules 
and filaments of the cytoskeleton, which participate in cell contractil- 
ity, movement, and support. It was thought for many years that 
prokaryotic cells lacked any trace of a cytoskeleton, but primitive 
cytoskeletal filaments have been found in bacteria (see Chapter 9). It is 
still fair to say that the prokaryotic cytoskeleton is much simpler, both 
structurally and functionally, than that of eukaryotes. Both eukaryotic 
and prokaryotic cells possess ribosomes, which are nonmembranous 
particles that function as “workbenches” on which the proteins of the 
cell are manufactured. Even though ribosomes of prokaryotic and 
eukaryotic cells have considerably different dimensions (those of 
prokaryotes are smaller and contain fewer components), these struc- 
tures participate in the assembly of proteins by a similar mechanism in 
both types of cells. FIGURE 1.11 is a colorized electron micrograph of a 
portion of the cytoplasm near the thin edge of a single-celled eukary- 
otic organism. This is a region of the cell where membrane-bound 
organelles tend to be absent. The micrograph shows individual fila- 
ments of the cytoskeleton (orange) and other large macromolecular 
complexes of the cytoplasm (turquoise). Most of these complexes are 
ribosomes. It is evident from this type of image that the cytoplasm of a 
eukaryotic cell is extremely crowded, leaving very little space for the 
soluble phase of the cytoplasm, which is called the cytosol. 

Other major differences between eukaryotic and prokaryotic 
cells can be noted. Eukaryotic cells divide by a complex process of 
mitosis in which duplicated chromosomes condense into compact 
structures that are segregated by an elaborate microtubule-containing 
apparatus (FIGURE 1.12). This apparatus, which is called a mitotic spindle, 
allows each daughter cell to receive an equivalent array of genetic 
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FIGURE 1.10 The structure of a eukaryotic cell. This epithelial cell lines the male reproductive tract in the rat. A number of different organelles are 
indicated and depicted in schematic diagrams around the border of the figure. 


Source: David M. Phillips/Photo Researchers, Inc. 
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FIGURE 1.11 The cytoplasm of a eukaryotic cell is a crowded 
compartment. This colorized electron micrographic image shows a small 
region near the edge of a single-celled eukaryotic organism that had been 
quickly frozen prior to microscopic examination. The three-dimensional 
appearance is made possible by capturing two-dimensional digital images 
of the specimen at different angles and merging the individual frames using 
a computer. Cytoskeletal filaments are shown in orange, macromolecular 
complexes (primarily ribosomes) are turquoise, and portions of cell 
membranes are blue. 

Source: From Ohad Medalia et al., Science 298:1211, 2002, Figure 3a. 

© 2002, reprinted with permission from AAAS. Photo provided courtesy of 
Wolfgang Baumeister. 
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FIGURE 1.12 Cell division in eukaryotes requires the assembly of an 
elaborate chromosome-separating apparatus called the mitotic spindle, 
which is constructed primarily of microtubules. The microtubules in this 
micrograph appear green because they are bound by an antibody that is 
linked to a green fluorescent dye. The chromosomes, which were about to be 
separated into two daughter cells when this cell was fixed, are stained blue. 


Source: Courtesy of Conly L. Rieder. 


material. In prokaryotes, there is no mitotic spindle to separate the 
genome copies after replication. It was once thought that the two copies 
are separated by attaching the DNA to the cell surface allowing the 
growth of the cell membrane to pull them apart. However, live cell 


imaging showed that the DNA separates faster than the cell grows, 
and the precise mechanism by which prokaryotes segregate their 
genomes remains an open question. Some current models are based 
on regulated compaction or folding of the DNA so that the two cop- 
ies would fold into two separate masses, thus separating them. 

For the most part, prokaryotes are nonsexual organisms. They 
contain only one copy of their single chromosome and have no pro- 
cesses comparable to meiosis, gamete formation, or true fertilization. 
Even though true sexual reproduction is lacking among prokaryotes, 
some are capable of conjugation, in which a piece of DNA is passed 
from one cell to another (FIGURE 1.13). However, the recipient 
almost never receives a whole chromosome from the donor, and the 
condition in which the recipient cell contains both its own and its 
partner’s DNA is fleeting. The cell soon reverts back to possession of 
a single chromosome. Although prokaryotes may not be as efficient 
as eukaryotes in exchanging DNA with other members of their own 
species, they are more adept than eukaryotes at picking up and 
incorporating foreign DNA from their environment, which has had 
considerable impact on microbial evolution (page 29). 

Eukaryotic cells possess a variety of complex locomotor mecha- 
nisms, whereas those of prokaryotes are relatively simple. The move- 
ment of a prokaryotic cell may be accomplished by a thin protein 
filament, called a flagellum, which protrudes from the cell and rotates 
(FIGURE 1.14a). The rotations of the flagellum, which can exceed 
1000 times per second, exert pressure against the surrounding fluid, 
propelling the cell through the medium. Certain eukaryotic cells, 
including many protists and sperm cells, also possess flagella, but the 
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FIGURE 1.13 Bacterial conjugation. Electron micrograph showing a 
conjugating pair of bacteria joined by a structure of the donor cell, termed 
the F pilus, through which DNA is thought to be passed. 


Source: Courtesy of Charles C. Brinton, Jr., and Judith Carnahan. 
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FIGURE 1.14 The difference between prokaryotic and eukaryotic 
flagella. (a) The bacterium Salmonella with its numerous flagella. Inset shows 
a high-magnification view of a portion of a single bacterial flagellum, which 
consists largely of a single protein called flagellin. (b) Each of these human 
sperm cells is powered by the undulatory movements of a single flagellum. 
The inset shows a cross section of the central core of a mammalian sperm 
flagellum. The flagella of eukaryotic cells are so similar that this cross section 
could just as well have been taken of a flagellum from a protist or green alga. 


Source: (a) From Bernard R. Gerber, Lewis M. Routledge, and Shiro Takashima, 
J. Mol. Biol. 71:322, © 1972, with permission from Elsevier; (b) Juergen 
Berger/Photo Researchers, Inc. Inset: Don W. Fawcett/Photo Researchers, Inc. 


eukaryotic versions are much more complex than the simple protein 
filaments of bacteria (Figure 1.14b), and they generate movement by 
a different mechanism. 


In the preceding paragraphs, many of the most important differ- 
ences between the prokaryotic and eukaryotic levels of cellular 
organization were mentioned. We will elaborate on many of these 
points in later chapters. Before you dismiss prokaryotes as inferior, 
keep in mind that these organisms have remained on Earth for more 
than three billion years, and at this very moment, trillions of them are 
clinging to the outer surface of your body and feasting on the nutri- 
ents within your digestive tract. We think of these organisms as indi- 
vidual, solitary creatures, but recent insights have shown that they 
live in complex, multispecies communities called biofilms. The layer 
of plaque that grows on our teeth is an example of a biofilm. Different 
cells in a biofilm may carry out different specialized activities, not 
unlike the cells in a plant or an animal. Consider also that, metaboli- 
cally, prokaryotes are very sophisticated, highly evolved organisms. 
For example, a bacterium, such as Escherichia coli, a common inhab- 
itant of both the human digestive tract and the laboratory culture 
dish, has the ability to live and prosper in a medium containing one 
or two low-molecular-weight organic compounds and a few inor- 
ganic ions. Other bacteria are able to live on a diet consisting solely of 
inorganic substances. One species of bacteria has been found in wells 
more than a thousand meters below the Earth's surface living on 
basalt rock and molecular hydrogen (H,) produced by inorganic 
reactions. In contrast, even the most metabolically talented cells in 
your body require a variety of organic compounds, including a num- 
ber of vitamins and other essential substances they cannot make on 
their own. In fact, many of these essential dietary ingredients are pro- 
duced by the bacteria that normally live in the large intestine. 


REVIEW 


1. Compare a prokaryotic and eukaryotic cell on the 
basis of structural, functional, and metabolic 
differences. 


1.4 Types of Prokaryotic Cells 


The distinction between prokaryotic and eukaryotic cells is based on 
structural complexity (as detailed in Table 1.1) and not on phyloge- 
netic relationship. Prokaryotes are divided into two major taxo- 
nomic groups, or domains: the Archaea (or archaebacteria) and the 
Bacteria (or eubacteria). Members of the Archaea are more closely 
related to eukaryotes than they are to the other group of prokaryotes 
(the Bacteria). The experiments that led to the discovery that life 
is represented by three distinct branches are discussed in the 
Experimental Pathways at the end of the chapter. 


Domain Archaea and Domain Bacteria 


The domain Archaea includes several groups of organisms whose 
evolutionary ties to one another are revealed by similarities in the 
nucleotide sequences of their nucleic acids. The best known Archaea 
are species that live in extremely inhospitable environments; they are 
often referred to as “extremophiles.” Included among the Archaea 
are the methanogens [prokaryotes capable of converting CO, and H, 
gases into methane (CH,) gas]; the halophiles (prokaryotes that live 
in extremely salty environments, such as the Dead Sea or certain 
deep sea brine pools that possess a salinity equivalent to 5M MgCl,); 
acidophiles (acid-loving prokaryotes that thrive at a pH as low as 0, 
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and thermophiles (prokaryotes that live at very high temperatures). 
Included in this last-named group are hyperthermophiles, which 
live in the hydrothermal vents of the ocean floor. The latest record 
holder among this group has been named “strain 121” because it is 
able to grow and divide in superheated water at a temperature of 
121°C, which just happens to be the temperature used to sterilize 
surgical instruments in an autoclave. Recent analyses of soil and 
ocean microbes indicate that many members of the Archaea are also 
at home in habitats of normal temperature, pH, and salinity. 

All other prokaryotes are classified in the domain Bacteria. This 
domain includes the smallest known cells, the mycoplasma (0.2 um 
diameter), which are the only known prokaryotes to lack a cell wall 
and to contain a genome with fewer than 500 genes. Bacteria are 
present in every conceivable habitat on Earth, from the permanent 
ice shelf of the Antarctic to the driest African deserts, to the internal 
confines of plants and animals. Bacteria have even been found living 
in rock layers situated several kilometers beneath the Earth's surface. 
Some of these bacterial communities are thought to have been cut 
off from life on the surface for more than one hundred million years. 
The most complex prokaryotes are the cyanobacteria. Cyanobacteria 
contain elaborate arrays of cytoplasmic membranes, which serve as 
sites of photosynthesis (FIGURE 1.15a). The membranes of cyano- 
bacteria are very similar to the photosynthetic membranes present 
within the chloroplasts of plant cells. As in eukaryotic plants, photo- 
synthesis in cyanobacteria is accomplished by splitting water mole- 
cules, which releases molecular oxygen. 

Many cyanobacteria are capable not only of photosynthesis, but 
also of nitrogen fixation, the conversion of nitrogen (N;) gas into 
reduced forms of nitrogen (such as ammonia, NH;) that can be used 
by cells in the synthesis of nitrogen-containing organic compounds, 
including amino acids and nucleotides. Those species capable of both 
photosynthesis and nitrogen fixation can survive on the barest of 
resources—light, N,, CO , and H,O. It is not surprising, therefore, 
that cyanobacteria are usually the first organisms to colonize the bare 
rocks rendered lifeless by a scorching volcanic eruption. Another unu- 
sual habitat occupied by cyanobacteria is illustrated in Figure 1.150. 


Prokaryotic Diversity 


For the most part, microbiologists are familiar only with those 
microorganisms they are able to grow in a culture medium. When a 
patient suffering from a respiratory or urinary tract infection sees 
his or her physician, one of the first steps often taken is to culture the 
pathogen. Once it has been cultured, the organism can be identified 
and the proper treatment prescribed. It has proven relatively easy to 
culture most disease-causing prokaryotes, but the same is not true 
for those living free in nature. The problem is compounded by the 
fact that prokaryotes are barely visible in a light microscope and 
their morphology is often not very distinctive. To date, roughly 6000 
species of prokaryotes have been identified by traditional tech- 
niques, which is less than one-tenth of 1 percent of the millions of 
prokaryotic species thought to exist on Earth! Our appreciation for 
the diversity of prokaryotic communities has increased dramatically 
in recent years with the use of molecular techniques that do not 
require the isolation of a particular organism. 

Suppose one wanted to learn about the diversity of prokaryotes 
that live in the upper layers of the Pacific Ocean off the coast of 
California. Rather than trying to culture such organisms, which 
would prove largely futile, a researcher could concentrate the cells 


(b) 


FIGURE 1.15 Cyanobacteria. (a) Electron micrograph of a cyanobacterium 
showing the cytoplasmic membranes that carry out photosynthesis. These 
concentric membranes are very similar to the thylakoid membranes present 
within the chloroplasts of plant cells, a reminder that chloroplasts evolved 
from a symbiotic cyanobacterium. (b) Cyanobacteria living inside the hairs of 
these polar bears are responsible for the unusual greenish color of their coats. 
Source: (a) Courtesy of Norma J. Lang. (b) Courtesy Zoological Society of 
San Diego. 


from a sample of ocean water, extract the DNA, and analyze certain 
DNA sequences present in the preparation. All organisms share cer- 
tain genes, such as those that code for the RNAs present in ribo- 
somes or the enzymes of certain metabolic pathways. Even though 
all organisms may share such genes, the sequences of the nucleotides 
that make up the genes vary considerably from one species to 
another. This is the basis of biological evolution. By using techniques 
that reveal the variety of DNA sequences of a particular gene in a 
particular habitat, one learns directly about the diversity of species 
that live in that habitat. Recent sequencing techniques have become 
so rapid and cost-efficient that virtually all of the genes present in 
the microbes of a given habitat can be sequenced, generating a col- 
lective genome, or metagenome. This approach can provide informa- 
tion about the types of proteins these organisms manufacture and 
thus about many of the metabolic activities in which they engage. 
These same molecular strategies are being used to explore the 
remarkable diversity among the trillions of “unseen passengers” that 
live on or within our own bodies, in habitats such as the intestinal tract, 
mouth, vagina, and skin. This collection of microbes, which is known 
as the human microbiome, is the subject of several international 


TABLE 1.2 Number and Biomass of Prokaryotes in the World 


No. of prokaryotic 
cells, x 1078 


Aquatic habitats 12 Dp 
Oceanic subsurface 355 303 
Soil 26 26 
Terrestrial subsurface 25-250 22-215 

Total 415-640 353-546 


Pg of Cin 
prokaryotes" 


Environment 


*1 petagram (Pg) = 10'°¢. 
Source: W. B. Whitman et al., Proc. Nat'l. Acad. Sci. U.S.A. 95:6581, 1998. 


research efforts aimed at identifying and characterizing these organ- 
isms in people of different age, diet, geography, and state of health. It 
has already been demonstrated, for example, that obese and lean 
humans have markedly different populations of bacteria in their 
digestive tracts. As obese individuals lose weight, their bacterial pro- 
file shifts toward that of the leaner individuals. One recent study of 
fecal samples taken from 124 people of varying weight revealed the 
presence within the collective population of more than 1000 differ- 
ent species of bacteria. Taken together, these microbes contained 
more than 3 million distinct genes—approximately 150 times as 
many as the number present in the human genome. Among the 
functions of proteins encoded by these microbial genomes are the 
synthesis of vitamins, the breakdown of complex plant sugars, and 
the prevention of growth of pathogenic organisms. 

By using sequence-based molecular techniques, biologists have 
found that most habitats on Earth are teeming with previously unrec- 
ognized prokaryotic life. One estimate of the sheer numbers of 
prokaryotes in the major habitats of the Earth is given in Table 1.2. It 
is noteworthy that more than 90 percent of these organisms are now 
thought to live in the subsurface sediments well beneath the oceans 
and upper soil layers. Nutrients can be so scarce in some of these deep 
sediments that microbes living there are thought to divide only once 
every several hundred years! Table 1.2 also provides an estimate of the 
amount of carbon that is sequestered in the world’s prokaryotic cells. 
To put this number into more familiar terms, it is roughly comparable 
to the total amount of carbon present in all of the world’s plant life. 


REVIEW | 


1. Which group of prokaryotes is best known for 
containing many extremophiles? 


oe 


1.5 Types of Eukaryotic Cells 


In many regards, the most complex eukaryotic cells are not found inside 
of plants or animals, but rather among the single-celled (unicellular) 
protists, such as those pictured in FIGURE 1.16. All of the machinery 
required for the complex activities in which this organism engages— 
sensing the environment, trapping food, expelling excess fluid, evading 
predators—is housed within the confines of a single cell. 


Cell Differentiation 


Complex unicellular organisms represent one evolutionary pathway. An 
alternate pathway has led to the evolution of multicellular organisms in 
which different activities are conducted by different types of specialized 


FIGURE 1.16 Vorticella, a complex ciliated protist. A number of these 
unicellular organisms are seen here; most have withdrawn their “heads” 
due to shortening of the blue-stained contractile ribbon in the stalk. Each 
cell has a single large nucleus, called a macronucleus (arrow), which 
contains many copies of the genes. 


Source: Carolina Biological Supply Co./Phototake. 


cells. Specialized cells are formed bya process called differentiation. A 
fertilized human egg, for example, will progress through a course of 
embryonic development that leads to the formation of approximately 
250 distinct types of differentiated cells. Some cells become part of a 
particular digestive gland, others part of a large skeletal muscle, others 
part of a bone, and so forth (FIGURE 1.17). The pathway of differentia- 
tion followed by each embryonic cell depends primarily on the signals it 
receives from the surrounding environment; these signals in turn 
depend on the position of that cell within the embryo. As discussed in 
the accompanying Human Perspective, researchers are learning how to 
control the process of differentiation in the culture dish and applying 
this knowledge to the treatment of complex human diseases. 

As a result of differentiation, different types of cells acquire a 
distinctive appearance and contain unique materials. Skeletal muscle 
cells contain a network of precisely aligned filaments composed of 
unique contractile proteins; cartilage cells become surrounded by a 
characteristic matrix containing polysaccharides and the protein col- 
lagen, which together provide mechanical support; red blood cells 
become disk-shaped sacks filled with a single protein, hemoglobin, 
which transports oxygen; and so forth. Despite their many differ- 
ences, the various cells of a multicellular plant or animal are com- 
posed of similar organelles. Mitochondria, for example, are found in 
essentially all types of cells. In one type, however, they may have a 
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FIGURE 1.17 Pathways of cell differentiation. A few of the types of differentiated cells present in a human fetus. 


Source: Micrographs Courtesy of Michael Ross, University of Florida. 


rounded shape, whereas in another they may be highly elongated 
and thread-like. In each case, the number, appearance, and location 
of the various organelles can be correlated with the activities of the 
particular cell type. An analogy might be made to a variety of orchestral 
pieces: all are composed of the same notes, but varying arrangement 
gives each its unique character and beauty. 


Model Organisms 


Living organisms are highly diverse, and the results obtained from a 
particular experimental analysis may depend on the particular 
organism being studied. As a result, cell and molecular biologists 
have focused considerable research activities on a small number of 
“representative” or model organisms. It is hoped that a compre- 
hensive body of knowledge built on these studies will provide a 
framework to understand those basic processes that are shared by 
most organisms, especially humans. This is not to suggest that many 
other organisms are not widely used in the study of cell and molecu- 
lar biology. Nevertheless, six model organisms—one prokaryote and 
five eukaryotes—have captured much of the attention: a bacterium, 


E. coli; a budding yeast, Saccharomyces cerevisiae; a flowering plant, 
Arabidopsis thaliana; a nematode, Caenorhabditis elegans; a fruit fly, 
Drosophila melanogaster; and a mouse, Mus musculus. Each of these 
organisms has specific advantages that make it particularly useful as 
a research subject for answering certain types of questions. Each of 
these organisms is pictured in FIGURE 1.18, and a few of their advan- 
tages as research systems are described in the accompanying legend. 
We will concentrate in this text on results obtained from studies on 
mammalian systems—mostly on the mouse and on cultured mam- 
malian cells—because these findings are most applicable to humans. 
But a large portion of what we know about mammalian cells was first 
discovered by experiments in other model organisms that are easier 
to work with. Thus, we will have many occasions to describe research 
carried out on the cells of other species. You may be surprised to 
discover how similar you are at the cell and molecular level to these 
much smaller and simpler organisms. 


REVIEW [| 


1. What is the importance of cell differentiation? 


(d) (f) 


FIGURE 1.18 Six model organisms. (a) Escherichia coli is a rod-shaped bacterium that lives in the digestive tract of humans and other mammals. Much of 
what we will discuss about the basic molecular biology of the cell, including the mechanisms of replication, transcription, and translation, was originally 
worked out on this one prokaryotic organism. The relatively simple organization of a prokaryotic cell is illustrated in this electron micrograph. 

(b) Saccharomyces cerevisiae, more commonly known as baker's yeast or brewer's yeast. It is the least complex of the eukaryotes commonly studied, yet it 
contains a surprising number of proteins that are homologous to proteins in human cells. Such proteins typically have a conserved function in the two 
organisms. The species has a small genome encoding about 6200 proteins; it can be grown in a haploid state (one copy of each gene per cell rather than 
two as in most eukaryotic cells); and it can be grown under either aerobic (O2-containing) or anaerobic (O2-lacking) conditions. It is ideal for the identification 
of genes through the use of mutants. (c) Arabidopsis thaliana, a weed (called the thale cress) that is related to mustard and cabbage, which has an unusually 
small genome (120 million base pairs) for a flowering plant, a rapid generation time, and large seed production, and it grows to a height of only a few 
inches. (a) Caenorhabditis elegans, a microscopic-sized nematode, consists of a defined number of cells (roughly 1000), each of which develops according 
to a precise pattern of cell divisions. The animal is easily cultured, can be kept alive in a frozen state, has a transparent body wall, a short generation time, 
and facility for genetic analysis. This micrograph shows the larval nervous system, which has been labeled with the green fluorescent protein (GFP). The 2002 
Nobel Prize was awarded to the researchers who pioneered its study. (e) Drosophila melanogaster, the fruit fly, is a small but complex eukaryote that is 
readily cultured in the lab, where it grows from an egg to an adult in a matter of days. Drosophila has been a favored animal for the study of genetics, the 
molecular biology of development, and the neurological basis of simple behavior. Certain larval cells have giant chromosomes, whose individual genes can 
be identified for studies of evolution and gene expression. In the mutant fly shown here, a leg has developed where an antenna would be located in a 
normal (wild type) fly. (f/ Mus musculus, the common house mouse, is easily kept and bred in the laboratory. Thousands of different genetic strains have 
been developed, many of which are stored simply as frozen embryos due to lack of space to house the adult animals. The “nude mouse” pictured here 
develops without a thymus gland and, therefore, is able to accept human tissue grafts that are not rejected. 

Source: (a) Biophoto Associates/Photo Researchers; (b) Biophoto Associates/Photo Researchers; (c) Courtesy of Erik Jorgensen, University of Utah. From 
Trends Genetics, Vol. 14, Cover#12, 1998, with permission from Elsevier; (d) Courtesy of Erik Jorgensen, University of Utah. From Trends Genetics, Vol. 14, 
Cover#12, 1998, with permission from Elseviere: David Scharf/Photo Researchers, Inc. f: Ted Spiegel/© Corbis Images. 


1.6 THE HUMAN PERSPECTIVE 


The Prospect of Cell Replacement Therapy 


Many human diseases result from the deaths of specific types of be traced to the death of cadiac muscle cells (cardiomyocytes) in 
cells. Type 1 diabetes, for example, results from the destruction of the heart. Imagine the possibilities if we could isolate cells from 
beta cells in the pancreas; Parkinson's disease occurs with the loss a patient, convert them into the cells that are needed by that 
of dopamine-producing neurons in the brain; and heart failure can patient, and then infuse them back into the patient to restore the 
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body's lost function. Recent studies have given researchers hope 
that one day this type of therapy will be common place. To better 
understand the concept of cell replacement therapy, we can 
consider a procedure used widely in current practice known as 
bone marrow transplantation in which cells are extracted from the 
pelvic bones of a donor and infused into the body of a recipient. 

Bone marrow transplantation is used most often to treat lym- 
phomas and leukemias, which are cancers that affect the nature 
and number of white blood cells. To carry out the procedure, the 
patient is exposed to a high level of radiation and/or toxic chemi- 
cals, which kills the cancer cells, but also kills all of the cells involved 
in the formation of red and white blood cells. This treatment has 
this effect because blood-forming cells are particularly sensitive 
to radiation and toxic chemicals. Once a person’s blood-forming 
cells have been destroyed, they are replaced by bone marrow 
cells transplanted from a healthy donor. Bone marrow can regen- 
erate the blood tissue of the transplant recipient because it con- 
tains a small percentage of cells that can proliferate and restock 
the patient's blood-forming bone marrow tissue.' These blood- 
forming cells in the bone marrow are termed hematopoietic 
stem cells (or HSCs), and they were discovered in the early 1960s 
by Ernest McCulloch and James Till at the University of Toronto. 
HSCs are responsible for replacing the millions of red and white 
blood cells that age and die every minute in our bodies (see 
Figure 17.6). Amazingly, a single HSC is capable of reconstituting 
the entire hematopoietic (blood-forming) system of an irradiated 
mouse. An increasing number of parents are saving the blood 
from the umbilical cord of their newborn baby as a type of “stem- 
cell insurance policy” in case that child should ever develop a dis- 
ease that might be treated by administration of HSCs. Now that 
we have described one type of cell replacement therapy, we can 
consider several other types that have a much wider therapeutic 
potential. We will divide these potential therapies into four types. 


Adult Stem Cells 


Hematopoietic stem cells in the bone marrow are an example of 
an adult stem cell. Stem cells are defined as undifferentiated cells 
that (1) are capable of self-renewal, that is, production of more 
cells like themselves, and (2) are multipotent, that is, are capable 
of differentiating into two or more mature cell types. HSCs of the 
bone marrow are only one type of adult stem cell. Most, if not all, of 
the organs in a human adult contain stem cells that are capable of 
replacing the particular cells of the tissue in which they are found. 
Even the adult brain, which is not known for its ability to regen- 
erate, contains stem cells that can generate new neurons and glial 
cells (the supportive cells of the brain). FIGURE 1a shows an iso- 
lated stem cell present in adult skeletal muscle; these “satellite 
cells,” as they are called, are thought to divide and differentiate 
as needed for the repair of injured muscle tissue. Figure 1b shows 
a culture of adipose (fat) cells that have differentiated in vitro from 
adult stem cells that are present within fat tissue. 

The adult human heart contains stem cells that are capable 
of differentiating into the cells that form both the muscle tissue of 
the heart (the cardiomyocytes of the myocardium) and the heart's 
blood vessels. It had been hoped that these cardiac stem cells 
might have the potential to regenerate healthy heart tissue in a 
patient who had experienced a serious heart attack. This hope 
has apparently been realized based on the appearance of two 
landmark reports in late 2011 on the results from clinical trials of 
patients that had suffered significant heart-tissue damage fol- 
lowing heart attacks. Stem cells were harvested from each of the 
patients during heart surgeries, expanded in number through in 
vitro culture, and then infused back into each patient’s heart. Over 
the next few months, a majority of treated patients experienced 
significant replacement (e.g., 50 percent) of the damaged heart 


(a) (b) 


FIGURE 1 An adult muscle stem cell. (a) A portion of a muscle fiber, 
with its many nuclei stained blue. A single stem cell (yellow) is seen to be 
lodged between the outer surface of the muscle fiber and an extracellular 
layer (or basement membrane), which is stained red. The undifferentiated 
stem cell exhibits this yellow color because it expresses a protein that is not 
present in the differentiated muscle fiber. (b) Adult stem cells undergoing 
differentiation into adipose (fat) cells in culture. Stem cells capable of this 
process are present in adult fat tissue and also bone marrow. 

Source: (a) From Charlotte A. Collins; et al., Cell 122:291, 2005; by 
permission of Elsevier; (b) Courtesy of Thermo Fisher Scientific, fom 
Nature 451:855, 2008. 


muscle by healthy tissue derived from the infused stem cells. This 
regeneration of heart tissue was accompanied by a clear improve- 
ment in quality of life compared to patients in the placebo group 
that did not receive stem cells. Adult stem cells are an ideal 
system for cell replacement therapies because they represent an 
autologous treatment; that is, the cells are taken from the same 
patient in which they are used. Consequently, these stem cells do 
not face the prospect of immune rejection. These dramatic results 
with cardiac stem cells rekindled interest in adult stem cells, which 
had waned after a number of failed attempts to direct stem cells 
isolated from bone marrow to regenerate diseased tissues. The 
great majority of adult stem cell therapies under development 
use a type of adult stem cell known as a mesenchymal stem cell 
(MSC). These can be obtained from bone marrow but they are 
different from the HSCs discussed above in that they do not pro- 
duce blood cells but rather a variety of other cell types found in 
various tissues and organs. MSCs can also be obtained from fat 
tissue obtained during liposuction procedures. Currently there 
are well over 100 controlled clinical trials underway for treat- 
ing a wide range of diseases with MSC-derived cells, including 
heart disease, diabetes, and immune diseases such as Lupus 
and Crohn's disease. An MSC-based therapy called “Prochymal” 
became the first FDA approved stem cell therapy. It is used to 
treat Crohn's disease as well as to treat immune reactions that can 
occur in patients who receive bone marrow transplants. 


Embryonic Stem Cells 


Much of the excitement that has been generated in the field over 
the past decade or two has come from studies on embryonic 
stem (ES) cells, which are a type of stem cell isolated from very 
young mammalian embryos (FIGURE 2a). These are the cells in 
the early embryo that give rise to all of the various structures of the 
mammalian fetus. Unlike adult stem cells, ES cells are pluripotent; 
that is, they are capable of differentiating into every type of cell 
in the body. In most cases, human ES cells have been isolated 
from embryos provided by in vitro fertilization clinics. Worldwide, 
dozens of genetically distinct human ES cell lines, each derived 
from a single embryo, are available for experimental investigation. 
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FIGURE 2 Embryonic stem cells; their isolation and potential use. 

(a) Micrograph of a mammalian blastocyst, an early stage during 
embryonic development, showing the inner cell mass, which is composed 
of pluripotent ES cells. Once isolated, such cells are readily grown in 
culture. (b) A potential procedure for obtaining differentiated cells for use 
in cell replacement therapy. A small piece of tissue is taken from the 
patient, and one of the somatic cells is fused with a donor oocyte whose 
own nucleus had been previously removed. The resulting oocyte (egg), 
with the patient's cell nucleus, is allowed to develop into an early embryo, 
and the ES cells are harvested and grown in culture. A population of ES 
cells are induced to differentiate into the required cells, which are 
subsequently transplanted into the patient to restore organ function. (At 
the present time, it has not been possible to obtain blastocyst stage 
embryos, that is, ones with ES cells, from any primate species by the 
procedure shown here, although it has been accomplished using an 
oocyte from which the nucleus is not first removed. The ES cells that 

are generated in such experiments are triploid; that is, they have three 
copies of each chromosome—one from the oocyte and two from the 
donor nucleus—rather than two, as would normally be the case. 
Regardless, these triploid ES cells are pluripotent and capable of 
transplantation.) 


Source: © Phanie/SuperStock 


The long-range goal of clinical researchers is to learn how 
to coax ES cells to differentiate in culture into each of the many 
cell types that might be used for cell replacement therapy. 
Considerable progress has been made in this pursuit, and 
numerous studies have shown that transplants of differentiated, 
ES-derived cells can improve the condition of animals with dis- 
eased or damaged organs. The first trial in humans was begun 
in 2009 on patients who had experienced debilitating spinal cord 
injuries. The trial to treat spinal cord injuries utilized cells, called 
oligodendrocytes, that produce the myelin sheaths that become 
wrapped around nerve cells (see Figure 4.5). The oligodendro- 
cytes transplanted into these patients were differentiated from 
human ES cells that were cultured in a medium containing insulin, 
thyroid hormone, and a combination of certain growth factors. 
This particular culture protocol had been found to direct the 
differentiation of ES cells into oligodendrocytes rather than any 
other cell type. Unfortunately, no significant improvement was 
reported in the treated patients, and the company conducting the 
trial decided to cease further involvement in the effort. 

Embryonic stem cell therapy is currently under intense study 
as a treatment for retinal degeneration diseases such as macular 
degeneration. At the time of this writing there are eight government- 
approved clinical trials using ES cells induced to differentiate into 
retinal pigmented epithelial cells, a key cell type within the retina, in 
an attempt to cure different forms of retinal degeneration. 

The primary risk with the therapeutic use of ES cells is the 
unnoticed presence of undifferentiated ES cells among the dif- 
ferentiated cell population. Undifferentiated ES cells are capable 
of forming a type of benign tumor, called a teratoma, which may 
contain a bizarre mass of various differentiated tissues, including 
hair and teeth. The formation of a teratoma within the central ner- 
vous system could have severe consequences. In addition, the 
culture of ES cells at the present time involves the use of non- 
human biological materials, which also poses potential risks. 

The ES cells used in these early trials were derived from cell 
lines that had been isolated from human embryos unrelated to 
the patients who are being treated. Such cells face the prospect 
of immunologic rejection by the transplant recipient. It may be 
possible, however, to “customize” ES cells so that they possess 
the same genetic makeup of the individual who is being treated. 
This may be accomplished one day by a roundabout procedure 
called somatic cell nuclear transfer (SCNT), shown in Figure 2b, 
that begins with an unfertilized egg—a cell that is obtained from 
the ovaries of an unrelated woman donor. In this approach, the 
nucleus of the unfertilized egg would be replaced by the nucleus 
of a cell from the patient to be treated, which would cause the 
egg to have the same chromosome composition as that of the 
patient. The egg would then be allowed to develop to an early 
embryonic stage, and the ES cells would be removed, cultured, 
and induced to differentiate into the type of cells needed by the 
patient. Because this procedure involves the formation of a human 
embryo that is used only as a source of ES cells, there are major 
ethical questions that must be settled before it could be routinely 
practiced. In addition, the process of SCNT is so expensive and 
technically demanding that it is highly improbable that it could 
ever be practiced as part of any routine medical treatment. It is 
more likely that, if ES cell-based therapy is ever practiced, it would 
depend on the use of a bank of hundreds or thousands of differ- 
ent ES cells. Such a bank could contain cells that are close enough 
as a tissue match to be suitable for use in the majority of patients. 


Induced Pluripotent Stem Cells 


It had long been thought that the process of cell differentiation 
in mammals was irreversible; once a cell had become a fibroblast, 
or white blood cell, or cartilage cell, it could never again revert 
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to any other cell type. This concept was shattered in 2006 when 
Shinya Yamanaka and co-workers of Kyoto University announced 
a stunning discovery; his lab had succeeded in reprogramming a 
fully differentiated mouse cell—in this case a type of connective 
tissue fibroblast—into a pluripotent stem cell. They accomplished 
the feat by introducing into the mouse fibroblast the genes that 
encoded four key proteins that are characteristic of ES cells. These 
genes (Oct4, Sox2, KIf4, and Myc, known collectively as OSKM) 
are thought to play a key role in maintaining the cells in an undif- 
ferentiated state and allowing them to continue to self-renew. 
The genes were introduced into cultured fibroblasts using gene- 
carrying viruses, and those rare cells that became reprogrammed 
were selected from the others in the culture by specialized tech- 
niques. They called this new type of cells induced pluripotent cells 
(iPS cells) and demonstrated that they were indeed pluripotent by 
injecting them into a mouse blastocyst and finding that they par- 
ticipated in the differentiation of all the cells of the body, including 
eggs and sperm. Within the next year or so the same reprogram- 
ming feat had been accomplished in several labs with human cells. 
What this means is that researchers now have available to them an 
unlimited supply of pluripotent cells that can be directed to differ- 
entiate into various types of body cells using similar experimental 
protocols to those already developed for ES cells. 

Indeed, iPS cells have already been used to correct certain 
disease conditions in experimental animals, including sickle cell 
anemia in mice as depicted in FIGURE 3. Based on the promising 
results of animal experiments, attempts to use iPS cells in patients 
are beginning. The first clinical trial of an iPS cell based therapy 
was begun in 2014. Similar to ongoing embryonic stem cell trials 
mentioned above, this trial is testing the use of iPS cell derived 
retinal pigmented epithelial cells to treat macular degeneration. 
Results from this trial were not available at the time of this writing. 

The utility of iPS cells may extend far beyond cell replacement 
therapy. iPS cells have also been prepared from adult cells taken 
from patients with a multitude of genetic disorders. Researchers are 
then able to follow the differentiation of these iPS cells in culture 
into the specialized cell types that are affected by the particular 
disease. It is hoped that such studies will reveal the mechanisms of 
disease formation as it unfolds in a culture dish just as it would nor- 
mally occur in an unobservable way deep within the body. These 
“diseased iPS cells” have been referred to as “patients in a Petri 
dish.” The clinical relevance of these cells can be illustrated by 
an example. iPS cells derived from patients with a heart disorder 
called long QT syndrome differentiate into cardiac muscle cells 


aa Mouse with 
sickle cell anemia 


Transplant 

fe ne ™ 

Peete ce © ™:,, « a € 

into blood © c 
stem cells ‘ “A 

J re 

m Oct4, Sox2 


Recovered mouse Kif4, c-Myc 
viruses 


Genetically 
corrected iPS cells 


DNA with mutant 
gene (yellow) 


iD 
D 

normal gene 

(red) 


Collect skin cells 
| 
3 


Reprogram into ES- 
like iPS cells 


that exhibit irregular contractions (“beats”) in culture. This disease- 
specific phenotype seen in culture can be corrected by several med- 
icines normally prescribed to treat this disorder. Moreover, when 
these cardiomyocytes that had differentiated from the diseased iPS 
cells were exposed to the drug cisapride, the irregularity of their 
contractions increased. Cisapride is a drug that was used to treat 
heartburn before it was pulled from the market in the United States 
after it was shown to cause heart arrhythmias in certain patients. 
Results of this type suggest that differentiated cells derived from 
diseased iPS cells will serve as valuable targets for screening poten- 
tial drugs for their effectiveness in halting disease progression. 

Unlike ES cells, the generation of iPS cells does not require 
the use of an embryo. This feature removes all of the ethical reser- 
vations that accompany work with ES cells and also makes it much 
easier to generate these cells in the lab. However, as research on 
iPS cells has increased, the therapeutic potential for these cells 
has become less clear. For the first several years of study, it was 
thought that iPS cells and ES cells were essentially indistinguish- 
able. Recent studies, however, have shown that iPS cells lack the 
“high quality” characteristic of ES cells and that not all iPS cells are 
the same. For example, iPS cells exhibit certain genomic abnor- 
malities that are not present in ES cells, including the presence of 
mutations and extra copies of random segments of the genome. 
In addition, the DNA-containing chromatin of iPS cells retains 
certain traces of the original cells from which they were derived, 
which means that they are not completely reprogrammed into 
ES-like, pluripotent cells. This residual memory of their origin 
makes it is easier to direct iPS cells toward differentiation back 
into the cells from which they were derived than into other types 
of cells. It may be that these apparent deficiencies in iPS cells 
will not be a serious impediment in the use of these cells to treat 
diseases that affect adult tissues, but it has raised important ques- 
tions. There are other issues with iPS cells as well. It will be impor- 
tant to develop efficient cell reprogramming techniques that do 
not use genome-integrating viruses because such cells carry the 
potential of developing into cancers. Progress has been made in 
this regard, but the efficiency of iPS cell formation typically drops 
when other procedures are used to introduce genes. 

Like ES cells, undifferentiated iPS cells also give rise to tera- 
tomas, so it is essential that only fully differentiated cells are trans- 
planted into human subjects. Also like ES cells, the iPS cells in 
current use have the same tissue antigens as the donors who orig- 
inally provided them, so they would stimulate an immune attack if 
they were to be transplanted into other human recipients. Unlike 


FIGURE 3 Steps taken to generate induced pluripotent 
stem (iPS) cells for use in correcting the inherited disease 
sickle cell anemia in mice. Skin cells are collected from the 
diseased animal, reprogrammed in culture by introducing 
the four required genes that are ferried into the cells by 
viruses, and allowed to develop into undifferentiated 
pluripotent iPS cells. The iPS cells are then treated so as to 
replace the defective (globin) gene with a normal copy, and 
the corrected iPS cells are caused to differentiate into normal 
blood stem cells in culture. These blood stem cells are then 
injected back into the diseased mouse, where they prolifer- 
ate and differentiate into normal blood cells, thereby curing 
the disorder. 


Source: Reprinted from an illustration by Rudolf Jaenisch, 
Cell 132:5, 2008, with permission from Elsevier. 


the formation of ES cells, however, it will be much easier to gen- 
erate personalized, tissue-compatible iPS cells, because they can 
be derived from a simple skin biopsy from each patient. Still, it does 
take considerable time, expense, and technical expertise to gen- 
erate a population of iPS cells from a specific donor. Consequently, 
if iPS cells are ever developed for widespread therapeutic use, they 
would likely come from a large cell bank that could provide cells that 
are close tissue matches to most potential recipients. It may also be 
possible to remove all of the genes from iPS cells that normally pre- 
vent them from being transplanted into random recipients. 


Direct Cell Reprogramming 


In 2008 the field of cellular reprogramming took another unex- 
pected turn with the announcement that one type of differentiated 
cell had been converted directly into another type of differentiated 
cell, a case of “transdifferentiation.” In this report, the acinar cells of 
the pancreas, which produce enzymes responsible for digestion of 
food in the intestine, were transformed into pancreatic beta cells, 
which synthesize and secrete the hormone insulin. The reprogram- 
ming process occurred directly, in a matter of a few days, without 
the cells passing through an intermediate stem cell state—and it 
occurred while the cells remained in their normal residence within 
the pancreas of a live mouse. This feat was accomplished by injec- 
tion into the animals of viruses that carried three genes known to 
be important in differentiation of beta cells in the embryo. In this 


case, the recipients of the injection were diabetic mice, and the 
transdifferentiation of a significant number of acinar cells into beta 
cells allowed the animals to regulate their blood sugar levels with 
much lower doses of insulin. It is also noteworthy that the adenovi- 
ruses used to deliver the genes in this experiment do not become 
a permanent part of the recipient cell, which removes some of the 
concerns about the use of viruses as gene carriers in humans. Since 
this initial report, a number of laboratories have developed in vitro 
techniques to directly convert one type of differentiated cell (typi- 
cally a fibroblast) into another type of cell, such as a neuron, cardio- 
myocyte, or blood-cell precursor, in culture, without passing through 
a pluripotent intermediate. In all of these cases, transdifferentiation 
occurs when the original cells are forced to express certain genes 
that play a role in the normal embryonic differentiation of the other 
cell type. It is too early to know whether this type of direct repro- 
gramming strategy has therapeutic potential, but it certainly raises 
the prospect that diseased cells that need to be replaced might be 
formed directly from other types of cells within the same organ. 


Reference 


1. Bone marrow transplantation can be contrasted to a simple blood transfu- 
sion where the recipient receives differentiated blood cells (especially red 
blood cells and platelets) present in the circulation. 


1.7. The Sizes of Cells 
and Their Components 


FIGURE 1.19 shows the relative size of a number of structures of inter- 
est in cell biology. Two units of linear measure are most commonly 
used to describe structures within a cell: the micrometer (um) and 
the nanometer (nm). One pm is equal to 10° meters, and one nm is 
equal to 10°” meters. The angstrom (A), which is equal to one-tenth 
of a nm, is commonly employed by molecular biologists for atomic 
dimensions. One angstrom is roughly equivalent to the diameter of a 
hydrogen atom. Large biological molecules (i.e, macromolecules) are 
described in either angstroms or nanometers. Myoglobin, a typical 
globular protein, is approximately 4.5 nm x 3.5 nm xX 2.5 nm; highly 
elongated proteins (such as collagen or myosin) are over 100 nm in 
length; and DNA is approximately 2.0 nm in width. Complexes of 
macromolecules, such as ribosomes, microtubules, and microfila- 
ments, are between 5 and 25 nm in diameter. Despite their tiny 
dimensions, these macromolecular complexes constitute remarkably 
sophisticated “nanomachines” capable of performing a diverse array 
of mechanical, chemical, and electrical activities. 

Cells and their organelles are more easily defined in microme- 
ters. Nuclei, for example, are approximately 5-10 um in diameter, 
and mitochondria are approximately 2 um in length. Prokaryotic 
cells typically range in length from about 1 to 5 um, eukaryotic cells 
from about 10 to 30 um. There are a number of reasons most cells 
are so small. Consider the following. 


e Most eukaryotic cells possess a single nucleus that contains only two 
copies of most genes. Because genes serve as templates for the 
production of information-carrying messenger RNAs, a cell can only 
produce a limited number of these messenger RNAs in a given 
amount of time. The greater a cell’s cytoplasmic volume, the longer it 
will take to synthesize the number of messages required by that cell. 


e Asacell increases in size, the surface area/volume ratio decreases.’ 
The ability of a cell to exchange substances with its environment is 
proportional to its surface area. Ifa cell were to grow beyond a 
certain size, its surface would not be sufficient to take up the 
substances (e.g., oxygen, nutrients) needed to support its meta- 
bolic activities. Cells that are specialized for absorption of solutes, 
such as those of the intestinal epithelium, typically possess 
microvilli, which greatly increase the surface area available for 
exchange (see Figure 1.3). The interior of a large plant cell is 
typically filled by a large, fluid-filled vacuole rather than metaboli- 
cally active cytoplasm (see Figure 8.36b). 


e A cell depends to a large degree on the random movement of 
molecules (diffusion). Oxygen, for example, must diffuse from 
the cell’s surface through the cytoplasm to the interior of its 
mitochondria. The time required for diffusion is proportional to 
the square of the distance to be traversed. For example, O, 
requires only 100 microseconds to diffuse a distance of 1 um, 
but requires 10° times as long to diffuse a distance of 1 mm. Asa 
cell becomes larger and the distance from the surface to the 
interior becomes greater, the time required for diffusion to move 
substances in and out of a metabolically active cell becomes 
prohibitively long. 


But despite these constraints, some eukaryotic cells can be extremely 
large. The free-living single celled organism Stentor coeruleus, which 
lives in freshwater ponds, grows to be more than a millimeter long, 
and the giant single-celled green alga Acetabularia is more than 
10 cm long. The gargantuan single-celled green alga Caulerpa can 
grow to a length of several meters and contains millions of nuclei in 
a common cytoplasm. Examples of large cell size are not restricted to 


*You can verify this statement by calculating the surface area and volume of a cube 
whose sides are 1 cm in length versus a cube whose sides are 10 cm in length. The surface 
area/volume ratio of the smaller cube is considerably greater than that of the larger cube. 
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FIGURE 1.19 Relative sizes of cells and cell components. These 
structures differ in size by more than seven orders of magnitude. 


such strange organisms, however. Indeed, we have some examples in 
our own bodies. Neurons send out extremely long processes; motor 
neurons in the human spinal cord, for example, send out axons that 
can be a meter long. 

The fact that cells are collections of nanomachines has inspired 
a research field known as synthetic biology, whose ultimate goal is to 
create some minimal type of living cell in the laboratory out of the 
same types of component parts found in real cell. Synthetic biology 
uses the molecules, molecular complexes, and organelles of a cell as 
buiding blocks, as suggested by the cartoon in FIGURE 1.20. One 
motivation of these researchers is simply to accomplish the feat and, 
in the process, demonstrate that life at the cellular level emerges 
spontaneously when the proper constituents are brought together 
from chemically synthesized materials. At this point in time, biolo- 
gists have only begun the first steps in this direction. Such work 
holds the potential to illuminate the possible origins of life and to 
launch an entirely new approach to biotechnology. However, creat- 
ing life may raise interesting moral and even religious questions. A 
more modest goal of synthetic biology is to develop novel life forms, 
using existing organisms as a starting point, that have a unique value 
in medicine and industry, or in cleaning up the environment. 


FIGURE 1.20 The synthetic biologist’s toolkit of the future? Such a 
toolkit would presumably contain nucleic acids, proteins, lipids, and many 
other types of biomolecules. 


Source: Courtesy of Jakob C. Schweizer. 


If, as most biologists would argue, the properties and activities 
of a cell spring from the genetic blueprint of that cell, then it should 
be possible to create a new type of cell by introducing a new genetic 
blueprint into the cytoplasm of an existing cell. This feat was accom- 
plished by J. Craig Venter and colleagues in 2007, when they replaced 
the genome of one bacterium with a genome isolated from a closely 
related species, effectively transforming one species into the other. 
By 2010, after overcoming a number of stubborn technical road- 
blocks, the team was able to accomplish a similar feat using a copy 
of a bacterial genome that had been assembled (inside of a yeast cell) 
from fragments of DNA that had been chemically synthesized in the 
laboratory. The synthetic copy of the donor genome, which totaled 
approximately 1.1 million base pairs of DNA, contained a number 
of modifications introduced by the researchers. The modified copy 
of the genome (from M. mycoides) was transplanted into a cell of a 
closely related bacterial species (M. capricolum), where it replaced 
the host’s original genome. Following genome transplantation, the 
recipient cell rapidly took on the characteristics of the species from 
which the donor DNA has been derived. In effect, these researchers 
have produced cells containing a “genetic skeleton” to which they 
can add combinations of new genes taken from other organisms. 

Researchers around the world are attempting to genetically 
engineer organisms to possess metabolic pathways capable of pro- 
ducing pharmaceuticals, hydrocarbon-based fuel molecules, and 
other useful chemicals from cheap, simple precursors. Several 
companies are growing genetically engineered cyanobacteria capable 
of producing diesel fuel from sunlight, water, and CO). Researchers 
at another company have genetically engineered the common lab 
bacterium E. coli to ferment the complex polysaccharides present in 
seaweed into the biofuel ethanol. This feat required the introduction 
into E. coli of a combination of genes derived from three other bacte- 
rial species. Work has also begun on “rewriting” the yeast genome, 
signifying that eukaryotic cells have also become part of the effort to 
design genetically engineered biological manufacturing plants. 

In principle, the work described in the Human Perspective, in 
which one type of cell is directed into the formation of an entirely 
different type of cell, is also a form of synthetic biology. As a result of 


these many efforts, biologists are no longer restricted to studying 
cells that are available in Nature, but can also turn their attention to 
cells that can become available through experimental manipulation. 


REVIEW | 


1. Why are cells almost always microscopic? 


2. If a mitochondrion were 2 um in length, how many 
angstroms would it be? How many nanometers? How 
many millimeters? 
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1.8 Viruses and Viroids 


By the end of the nineteenth century, the work of Louis Pasteur and 
others had convinced the scientific world that infectious diseases of 
plants and animals were due to bacteria. But studies of tobacco 
mosaic disease in tobacco plants and hoof-and-mouth disease in cat- 
tle pointed to the existence of another type of infectious agent. It was 
found, for example, that sap from a diseased tobacco plant could 
transmit mosaic disease to a healthy plant, even when the sap showed 
no evidence of bacteria in the light microscope. To gain further 
insight into the size and nature of the infectious agent, Dmitri 
Ivanovsky, a Russian biologist, forced the sap from a diseased plant 
through filters whose pores were so small that they retarded the pas- 
sage of the smallest known bacterium. The filtrate was still infective, 
causing Ivanovsky to conclude in 1892 that certain diseases were 
caused by pathogens that were even smaller, and presumably sim- 
pler, than the smallest known bacteria. These pathogens became 
known as viruses. 

In 1935, Wendell Stanley of the Rockefeller Institute reported 
that the virus responsible for tobacco mosaic disease could be crystal- 
lized and that the crystals were infective. Substances that form crys- 
tals have a highly ordered, well-defined structure and are vastly less 
complex than the simplest cells. Stanley mistakenly concluded that 
tobacco mosaic virus (TMV) was a protein. In fact, TMV is a rod- 
shaped particle consisting of a single molecule of RNA surrounded by 
a helical shell composed of protein subunits (FIGURE 1.21). 

Viruses are responsible for dozens of human diseases, including 
AIDS, polio, influenza, ebola, measles, and a few types of cancer. 
Viruses occur in a wide variety of very different shapes, sizes, and 
constructions, but all of them share certain common properties. All 
viruses are obligatory intracellular parasites; that is, they cannot 


Protein coat 
(capsid) 


Nucleic acid 


(b) 


reproduce unless present within a host cell. Depending on the 23 


specific virus, the host may be a plant, animal, or bacterial cell. 
Outside ofa living cell, the virus exists as a particle, or virion, which 
is little more than a macromolecular package. The virion contains a 
small amount of genetic material that, depending on the virus, can 
be single-stranded or double-stranded, RNA or DNA. Remarkably, 
some viruses have as few as three or four different genes, but others 
may have as many as several hundred. The genetic material of the 
virion is surrounded by a protein capsule, or capsid. Virions are 
macromolecular aggregates, inanimate particles that by themselves 
are unable to reproduce, metabolize, or carry on any of the other 
activities associated with life. For this reason, viruses are not consid- 
ered to be organisms and are not described as being alive. 

Viral capsids are generally made up of a specific number of sub- 
units. There are numerous advantages to construction by subunit, one 
of the most apparent being an economy of genetic information. If a 
viral coat is made of many copies of a single protein, as is that of TMV, 
or a few proteins, as are the coats of many other viruses, the virus 
needs only one or a few genes to code for its protein container. Many 
viruses have a capsid whose subunits are organized into a polyhedron, 
that is, a structure having planar faces. A particularly common poly- 
hedral shape of viruses is the 20-sided icosahedron. For example, ade- 
novirus, which causes respiratory infections in mammals, has an 
icosahedral capsid (FIGURE 1.22a). In many animal viruses, including 
the human immunodeficiency virus (HIV) responsible for AIDS, the 
protein capsid is surrounded by a lipid-containing outer envelope that 
is derived from the modified plasma membrane of the host cell as the 
virus buds from the host-cell surface (Figure 1.22b). Bacterial viruses, 
or bacteriophages, are among the most complex viruses (Figure 1.22c). 
They are also the most abundant biological entities on Earth. The 
T bacteriophages (which were used in key experiments that revealed 
the structure and properties of the genetic material) consist of a poly- 
hedral head containing DNA, a cylindrical stalk through which the 
DNA is injected into the bacterial cell, and tail fibers, which together 
cause the particle to resemble a landing module for the moon. 

Each virus has on its surface a protein that is able to bind to a 
particular surface component of its host cell. For example, the pro- 
tein that projects from the surface of the HIV particle (labeled gp120 
in Figure 1.22b, which stands for glycoprotein of molecular mass 
120,000 daltons*) interacts with a specific protein (called CD4) on 
the surface of certain white blood cells, facilitating entry of the virus 


One dalton is equivalent to one unit of atomic mass, the mass of a single hydrogen 
(1H) atom. 


FIGURE 1.21 Tobacco mosaic virus 
(TMV). (a) Model of a portion of a TMV 
particle. The protein subunits, which are 
identical along the entire rod-shaped 
particle, enclose a single helical RNA 
molecule (red). (b) Electron micrograph of 
TMV particles after phenol has removed the 
protein subunits from the middle part of the 
upper particle and the ends of the lower 
particle. Intact rods are approximately 300 
nm long and 18 nm in diameter. 

Source: (a) Courtesy of Gerald Stubbs, 
Keuchi Namba, and Donald Caspar; 

(b) Courtesy M.K. Corbett. 
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FIGURE 1.22 Virus diversity. The structures of (a) an adenovirus, (b) a human immunodeficiency virus (HIV), and (c) a T-even bacteriophage. Note: These 


viruses are not drawn to the same scale. 


into its host cell. The interaction between viral and host proteins 
determines the specificity of the virus, that is, the types of host cells 
that the virus can enter and infect. Some viruses have a wide host 
range, being able to infect cells from a variety of different organs or 
host species. The virus that causes rabies, for example, is able to 
infect many different types of mammalian hosts, including dogs, 
bats, and humans. Most viruses, however, have a relatively narrow 
host range. This is true, for example, of human cold and influenza 
viruses, which are generally able to infect only the respiratory epi- 
thelial cells of human hosts. 

A change in host-cell specificity can have striking conse- 
quences. This point is dramatically illustrated by the 1918 influenza 
pandemic, which killed more than 30 million people worldwide. 
The virus was especially lethal in young adults, who do not nor- 
mally fall victim to influenza. In fact, the 675,000 deaths from this 
virus in the United States temporarily lowered average life expec- 
tancy by several years. In one of the most acclaimed—and 
controversial—feats of the past few years, researchers have been 
able to determine the genomic sequence of the virus responsible for 
this pandemic and to reconstitute the virus in its full virulent state. 
This was accomplished by isolating the viral genes (which are part 
of a genome consisting of eight separate RNA molecules encoding 
11 different proteins) from the preserved tissues of victims who had 
died from the infection 90 years earlier. The best preserved samples 
were obtained from a Native American woman who had been bur- 
ied in the Alaskan permafrost. The sequence of the “1918 virus” 
suggested that the pathogen had jumped from birds to humans. 
Although the virus had accumulated a considerable number of 


mutations, which adapted it to a mammalian host, it had never 
exchanged genetic material with that of a human influenza virus as 
had been thought likely. 

Analysis of the sequence of the 1918 virus has provided some 
clues to explain why it was so deadly and how it spread so efficiently 
from one human to another. Using the genomic sequence, research- 
ers reconstituted the 1918 virus into infectious particles, which were 
found to be exceptionally virulent in laboratory tests. Whereas labo- 
ratory mice normally survive infection by modern human influenza 
viruses, the reconstituted 1918 strain killed 100 percent of infected 
mice and produced enormous numbers of viral particles in the ani- 
mals’ lungs. Because of the potential risk to public health, publica- 
tion of the full sequence of the 1918 virus and its reconstitution went 
forward only after approval by governmental safety panels and the 
demonstration that existing influenza vaccines and drugs protect 
mice from the reconstituted virus. 

There are two basic types of viral infection. (1) In most cases, 
the virus arrests the normal synthetic activities of the host and redi- 
rects the cell to use its available materials to manufacture viral 
nucleic acids and proteins, which assemble into new virions. Viruses, 
in other words, do not grow like cells; they are assembled from com- 
ponents directly into the mature-sized virions. Ultimately, the 
infected cell ruptures (lyses) and releases a new generation of viral 
particles capable of infecting neighboring cells. An example of this 
type of lytic infection is shown in FIGURE 1.23a. (2) In other cases, 
the infecting virus does not lead to the death of the host cell, but 
instead inserts (integrates) its DNA into the DNA of the host cell’s 
chromosomes. The integrated viral DNA is called a provirus. An 
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FIGURE 1.23 A virus infection. (a) Micrograph showing a late stage in the infection of a bacterial cell by a bacteriophage. Virus particles are being 
assembled within the cell, and empty phage coats are still present on the cell surface. (b) Micrograph showing HIV particles budding from an infected human 


lymphocyte. 


Source: (a) Courtesy of Jonathan King and Erika Hartwig; (b) Courtesy of Hans Gelderblom. 


integrated provirus can have different effects depending on the type 
of virus and host cell. For example, 


e Bacterial cells containing a provirus behave normally until 
exposed to a stimulus, such as ultraviolet radiation, that activates 
the dormant viral DNA, leading to the lysis of the cell and release 
of viral progeny. 

e@ Some animal cells containing a provirus produce new viral 
progeny that bud at the cell surface without lysing the infected 
cell. Human immunodeficiency virus (HIV) acts in this way; an 
infected cell may remain alive for a period, acting as a factory for 
the production of new virions (Figure 1.23b). 


e@ Some animal cells containing a provirus lose control over their 
own growth and division and become malignant. This phenom- 
enon is readily studied in the laboratory by infecting cultured 
cells with the appropriate tumor virus. 


Viruses are not without their virtues. Because the activities of 
viral genes mimic those of host genes, investigators have used viruses 
for decades as a research tool to study the mechanism of DNA repli- 
cation and gene expression in their much more complex hosts. In 
addition, viruses are now being used as a means to introduce foreign 
genes into human cells, a technique that will likely serve as the basis 
for the treatment of human diseases by gene therapy. Last, insect- 
and bacteria-killing viruses may play an increasing role in the war 
against insect pests and bacterial pathogens. Bacteriophages have 
been used for decades to treat bacterial infections in eastern Europe 
and Russia, while physicians in the West have relied on antibiotics. 
Given the rise in antibiotic-resistant bacteria, bacteriophages may be 
making a comeback on the heels of promising studies on infected 
mice. Several biotechnology companies are now producing bacterio- 
phages intended to combat bacterial infections and to protect certain 
foods from bacterial contamination. 


It came as a surprise in 1971 to discover that viruses are not the 
simplest types of infectious agents. In that year, T. O. Diener of the 
US. Department of Agriculture reported that potato spindle-tuber 
disease, which causes potatoes to become gnarled and cracked, is 
caused by an infectious agent consisting of a small circular RNA 
molecule that totally lacks a protein coat. Diener named the patho- 
gen a viroid. The RNAs of viroids range in size from about 240 to 
600 nucleotides, one-tenth the size of the smaller viruses. No 
evidence has been found that the naked viroid RNA encodes any 
proteins. Rather, any biochemical activities in which viroids engage 
take place using host-cell proteins. For example, duplication of the 
viroid RNA within an infected cell utilizes the host's RNA polymer- 
ase II, an enzyme that normally transcribes the host’s DNA into 
messenger RNAs. Viroids are thought to cause disease by interfering 
with the cell’s normal path of gene expression. The effect on crops 
can be serious: A viroid disease called cadang-cadang has devastated 
the coconut palm groves of the Philippines, and another viroid has 
wreaked havoc on the chrysanthemum industry in the United 
States. The discovery of a different type of infectious agent even 
simpler than a viroid is described in the Human Perspective in 
Chapter 2. 


REVIEW | 


1. What properties distinguish a virus from a bacterium? 
2. What types of infections are viruses able to cause? 


3. Compare and contrast: nucleoid and nucleus; the 
flagellum of a bacterium and a sperm; an 
archaebacterium and a cyanobacterium; nitrogen 
fixation and photosynthesis; bacteriophages and 


tobacco mosaic virus; a provirus and a virion. 
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1.9 EXPERIMENTAL PATHWAYS 


The Origin of Eukaryotic Cells 


We have seen in this chapter that cells can be divided into two 
groups: prokaryotic cells and eukaryotic cells. Almost from the time 
this division of cellular life was proposed, biologists have been 
fascinated by the question: What is the origin of the eukaryotic 
cell? It is generally agreed that prokaryotic cells (1) arose before 
eukaryotic cells and (2) gave rise to eukaryotic cells. The first point 
can be verified directly from the fossil record, which shows that 
prokaryotic cells were present in rocks approximately 2.7 billion 
years old (page 8), which is roughly one billion years before any 
evidence is seen of eukaryotes. The second point follows from the 
fact that the two types of cells have to be related to one another 
because they share many complex traits (e.g., very similar genetic 
codes, enzymes, metabolic pathways, and plasma membranes) 
that could not have evolved independently in different organisms. 

Until about 1970, it was generally believed that eukaryotic 
cells evolved from prokaryotic cells by a process of gradual evo- 
lution in which the organelles of the eukaryotic cell became pro- 
gressively more complex. Acceptance of this concept changed 
dramatically about that time largely through the work of Lynn 
Margulis, then at Boston University. Margulis resurrected an 
idea that had been proposed earlier, and dismissed, that certain 
organelles of a eukaryotic cell—most notably the mitochondria 
and chloroplasts—had evolved from smaller prokaryotic cells that 
had taken up residence in the cytoplasm of a larger host cell.' This 
hypothesis is referred to as the endosymbiont theory because it 
describes how a single “composite” cell of greater complexity 
could evolve from two or more separate, simpler cells living in a 
symbiotic relationship with one another. 


Prokaryote 


First eukaryotic common ancestor 


(FECA) 


Our earliest prokaryotic ancestors were presumed to have 
been anaerobic heterotrophic cells: anaerobic meaning they 
derived their energy from food matter without employing molec- 
ular oxygen (Oz) and heterotrophic meaning they were unable 
to synthesize organic compounds from inorganic precursors 
(such as CO, and water), but instead had to obtain preformed 
organic compounds from their environment. These prokary- 
otic ancestors are then thought to have acquired the ability to 
form internal membrane compartments, allowing formation 
of a nucleus by containing the DNA within an internal mem- 
brane. This development of internal membranes produced the 
first organism that would be considered eukaryote-like in terms 
of having a nucleus or other internal compartments (FIGURE 1). 
Because this is the first organism that subsequently gave rise to 
all eukaryotes, it is known as the first eukaryotic common ances- 
ter (FECA). Although the presence of internal membranes was 
once though to be an exclusively eukaryotic trait, it is now known 
that some bacteria can in fact form extensive complex internal 
membrane systems. The most dramatic example known to date is 
the bacterium Gemmata obscuriglobus, which forms a variety of 
complex internal membranes (FIGURE 2). However, careful three- 
dimensional reconstructions of G. obscuriglobus structure show 
that these membranes do not form closed compartments like 
eukaryotic organelles.” It thus appears that the key step in pro- 
ducing the FECA was not formation of internal membranes per 
se, but the further development of these membranes into closed 
internal compartments, particularly a compartment surrounding 
the DNA to produce a nucleus. 


Last eukaryotic common ancestor 
(LECA) 


FIGURE 1 A model depicting stages in the evolution of eukaryotes. Starting from a prokaryotic ancestor, internal compartments began to 
develop, leading to an organism with internal membrane compartments such as a nucleus. Such an organism is known as the First Eukaryotic 
Common Ancestor (FECA). The molecular machinery for making internal membranes then allowed the FECA to engulf and maintain endosymbiotic 
organisms, allowing acquisition of mitochondria. Additional evolutionary innovations gave rise to cellular features common to all eukaryotic lineages, 
including cilia, intron splicing, and meiosis. The organism that had all these traits, and therefore gave rise to all existing eukaryotic lineages, is known 
as the Last Eukaryotic Common Ancestor (LECA). After the LECA arose, further evolutionary steps, such as endosymbiosis of photosynthetic bacteria 


to produce chloroplasts, gave rise to different classes of eukaryotic cells. 
Source: From F.D. Mast et al., Trends Cell Biol. 24:435-442, 2014. 


FIGURE 2 Prokaryotes with complex internal membrane systems. 
Electron micrograph of Gemmata obscuriglobus, a bacterium with a 
complex set of internal membranes. Although these membranes do not 
form closed organelles as they would in eukaryotes, they show that a 
potential for membrane organization exists even in prokaryotes. 

Source: From R. Santarella-Mellwig et al., Plos Biol. 11:E1001565, 2013. 


According to the endosymbiont theory, the next step in the 
evolution of modern eukaryotes was when a descendent of the 
FECA cell ingested a small, aerobic prokaryote which somehow 
resisted digestion within the cytoplasm, taking up residence as 
a permanent endosymbiont. As the host cell reproduced, so did 
the endosymbiont, so that a colony of these composite cells was 
soon produced. Over many generations, endosymbionts lost 
many of the traits that were no longer required for survival, and 
the once-independent oxygen-respiring microbes evolved into 
precursors of modern-day mitochondria. A cell whose ances- 
tors had formed through the sequence of symbiotic events just 
described could have given rise to a line of cells that evolved 
other basic characteristics of eukaryotic cells, including additional 
internal organelles (endoplasmic reticulum, Golgi complex, lyso- 
somes), a complex cytoskeleton including cilia, intron splicing, 
and both mitotic and meiotic cell division. These characteristics, 
which are shared among all existing eukaryotic lineages, are pro- 
posed to have arisen by a gradual process of evolution, rather 
than in a single step as might occur through acquisition of an 
endosymbiont. All eukaryotes alive today descended from the 
cell that acquired these traits, and it is therefore known as the last 
eukaryotic common ancester (LECA). Current research on evolu- 
tionary cell biology is focused on reconstructing the molecular, 
structural, and functional features of the FECA and LECA by com- 
paring features of existing eukaryotic and prokaryotic lineages. 
The oldest fossils thought to be the remains of eukaryotes date 
back about 1.8 billion years. 

Margulis proposed that the acquisition of another endosym- 
biont, specifically a cyanobacterium, converted an early hetero- 
trophic eukaryote into an ancestor of photosynthetic eukaryotes: 
the green algae and plants.? The acquisition of chloroplasts 
(roughly one billion years ago) must have been one of the last 
steps in the sequence of endosymbioses because these organ- 
elles are only present in plants and algae. In contrast, all known 


groups of eukaryotes either (1) possess mitochondria or (2) show 
definitive evidence they have evolved from organisms that pos- 
sessed these organelles.° The concept that mitochondria and 
chloroplasts arose via evolution from symbiotic organisms is now 
supported by an overwhelming body of evidence, some of which 
will be described in numerous chapters of this text. 

The division of all living organisms into two categories, 
prokaryotes and eukaryotes, reflects a basic dichotomy in the 
structures of cells, but it is not necessarily an accurate phylogenetic 
distinction, that is, one that reflects the evolutionary relationships 
among living organisms. How does one determine evolutionary 
relationships among organisms that have been separated in time 
for billions of years, such as prokaryotes and eukaryotes? Modern 
taxonomic schemes that attempt to classify organisms are based 
on comparisons of the DNA sequences of living organisms.* 
Differences between organisms in the sequence of nucleotides 
that make up a nucleic acid are the result of mutations in DNA that 
have been transmitted to offspring. Mutations can accumulate in a 
given gene at a relatively constant rate over long periods of time. 
Consequently, comparisons of nucleotide sequences can be used 
to determine how closely organisms are related to one another. 
For example, two organisms that are closely related, that is, have 
diverged only recently from a common ancestor, should have fewer 
sequence differences in a particular gene than two organisms that 
are distantly related, that is, do not have a recent common ancestor. 
Using this type of sequence information as an “evolutionary clock,” 
researchers can construct phylogenetic trees showing proposed 
pathways by which different groups of living organisms may have 
diverged from one another during the course of evolution. 

Beginning in the mid-1970s, Carl Woese and his colleagues 
at the University of Illinois began a series of studies that compared 
the nucleotide sequence in different organisms of the RNA mole- 
cule that resides in the small subunit of the ribosome. This RNA— 
which is called the 16S rRNA in prokaryotes or the 18S rRNA in 
eukaryotes—was chosen because it is present in large quantities 
in all cells, it is easy to purify, and it tends to change only slowly 
over long periods of evolutionary time, which means that it could 
be used to study relationships of very distantly related organisms. 
In one of their first studies, Woese and his colleagues analyzed 
the rRNA present in the ribosomes of chloroplasts from the pho- 
tosynthetic protist Euglena.° They found that the sequence of this 
chloroplast rRNA molecule was much more similar to that of the 
16S rRNA found in ribosomes of cyanobacteria than it was to its 
18S counterpart in the ribosomes from eukaryotic cytoplasm. This 
finding provided strong evidence for the symbiotic origin of chlo- 
roplasts from cyanobacteria. 

In 1977, Woese and George Fox published a landmark paper 
in the study of molecular evolution. They compared the nucle- 
otide sequences of small-subunit rRNAs that had been puri- 
fied from 13 different prokaryotic and eukaryotic species. They 
found that the sequences clustered into three distinct groups, 
such that the rRNAs within each group are much more similar 
to one another than they are to rRNAs of the other two groups. The 
first of the groups contained only eukaryotes; the second group 
contained the “typical” bacteria (gram-positive, gram-negative, 
and cyanobacteria); and the third group contained several species 
of methanogenic (methane-producing) “bacteria.” Woese and 


“There are a number of anaerobic unicellular eukaryotes (e.g., the intestinal 
parasite Giardia) that lack mitochondria. For years, these organisms formed 
the basis for a proposal that mitochondrial endosymbiosis was a late event 
that took place after the evolution of these mitochondria-lacking groups. 
However, recent analysis of the nuclear DNA of these organisms indicates 
the presence of genes that were likely transferred to the nucleus from 
mitochondria, suggesting that the ancestors of these organisms lost their 
mitochondria during the course of evolution. 


continued 
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FIGURE 3 Domains of life. (a) A phylogenetic tree based on rRNA sequence comparisons showing the three domains of life. The Archaea are 
divided into two subgroups as indicated. (b) Phylogenetic relation between existing eukaryotic lineages. Although initial analysis of rRNA such as in 


Panel A suggested a series of early and late branching events to produce different lineages of eukaryoties, more careful analysis of genes and 


genomes now suggests that six major lineages all diverged from the LECA to produce distinct classes of eukaryotes known as “supergroups.” 


Animals and fungi, while looking different, are highly similar at the molecular level and together form a single group, the Opisthokonta. Plants 


and green algae, again while looking different, are closely related by molecular phylgeny and form a group called Archaeplastida. A wide range of 


other eukaryotes including many different species formerly lumped together as “protists” are now clearly divided into four distinct groups with 


unfamiliar names Excavata (which includes the parasites Giardia and Naegleria), Amoebozoa, SAR (which includes ciliates like Paramecium as well as 


diatoms and brown algae such as giant kelp), and CCTH (composed entirely of obscure and unfamiliar single-celled organisms whose biology is 


poorly understood). 


Source: (a) From C. R. Woese et al., Proc. Nat’l. Acad. Sci. U.S.A. 87:4578, 1990; (bb) From F.D. Mast et al., Tends Cell Biol. 24:435-442, 2014. 


Fox concluded, to their surprise, that the methanogenic organisms 
“appear to be no more related to typical bacteria than they are 
to eukaryotic cytoplasms.” These results suggested that the mem- 
bers of these three groups represent three distinct evolutionary 
lines that branched apart from one another at a very early stage in 
the evolution of cellular organisms. Consequently, they assigned 
these organisms to three different kingdoms, which they named 
the Urkaryotes, Eubacteria, and Archaebacteria, a terminology that 
divided the prokaryotes into two fundamentally distinct groups. 
Subsequent research provided support for the concept that 
prokaryotes could be divided into two distantly related lineages, 
and it expanded the ranks of the archaebacteria to include at 
least two other groups, the thermophiles, which live in hot springs 
and ocean vents, and the halophiles, which live in very salty lakes and 
seas. In 1989, two published reports rooted the tree of life and 
suggested that the archaebacteria were actually more closely 
related to eukaryotes than they were to eubacteria.”® Both 
groups of researchers compared the amino acid sequences of 
several proteins that were present in a wide variety of different 
prokaryotes, eukaryotes, mitochondria, and chloroplasts. A phy- 
logenetic tree constructed from sequences of ribosomal RNAs, 
which comes to the same conclusion, is shown in FIGURE 3a.” In 
this latter paper, Woese and colleagues proposed a revised taxo- 
nomic scheme, which has been widely accepted. In this scheme, 
the archaebacteria, eubacteria, and eukaryotes are assigned 
to separate domains, which are named Archaea, Bacteria, and 
Eucarya, respectively. Similar DNA sequence analysis studies 


have shown that eukaryotes then split into six distinct lineages 
(Figure 3b), of which animals including humans fall into a group 
known as “opisthokonts.” According to the model in Figure 3a, 
the first major split in the tree of life produced two separate 
lineages, one leading to the Bacteria and the other leading to 
both the Archaea and the Eucarya. If this view is correct, it was 
an archaebacterium, not a eubacterium, that took in a symbiont 
and gave rise to the lineage that led to the first eukaryotic cells. 
Although the host prokaryote was presumably an archaebacte- 
rium, the symbionts that evolved into mitochondria and chlo- 
roplasts were almost certainly eubacteria, as indicated by their 
close relationship with modern members of this group. 

Until 1995, phylogenetic trees of the type shown in Figure 3a 
were based primarily on the analysis of the gene encoding the 
16S-18S rRNA. By then, phylogenetic comparisons of a number 
of other genes were suggesting that the scheme depicted in 
Figure 3a might be oversimplified. Questions about the origin of 
prokaryotic and eukaryotic cells came into sharp focus between 
1995 and 1997 with the publication of the entire sequences of a 
number of prokaryotic genomes, both archaebacterial and eubac- 
terial, and the genome of a eukaryote, the yeast Saccharomyces 
cerevisiae. Researchers could now compare the sequences of hun- 
dreds of genes simultaneously, and this analysis raised a number 
of puzzling questions and blurred the lines of distinction between 
the three domains. '° For example, the genomes of several archae- 
bacteria showed the presence of a significant number of eubacte- 
rial genes. For the most part, those genes in archaebacteria whose 


>Many biologists dislike the terms archaebacteria and eubacteria. Although these terms have gradually faded from the literature, being replaced simply by archaea 
and bacteria, many researchers in this field continue to use the former terms in published articles. Given that this is an introductory chapter in an introductory text, 
we have continued to refer to these organisms as archaebacteria and eubacteria to avoid possible confusion over the meaning of the term bacterial. 


products are involved with informational processes (chromosome 
structure, transcription, translation, and replication) were very 
different from their counterparts in eubacterial cells and, in fact, 
resembled the corresponding genes in eukaryotic cells. This 
observation fit nicely with the scheme in Figure 3a. In contrast, 
many of the genes in archaebacteria that encode the enzymes of 
metabolism exhibited an unmistakable eubacterial character.'''* 
The genomes of eubacterial species also showed evidence of a 
mixed origin, often containing a significant number of genes that 
bore an archaebacterial character.'% 

Most investigators who study the origin of ancient organ- 
isms have held on to the basic outline of the phylogenetic tree as 
demarcated in Figure 3a and argue that the presence of eubac- 
teria-like genes in archaebacteria, and vice versa, is the result of 
the transfer of genes from one species to another, a phenomenon 
referred to as horizontal gene transfer (HGT), sometimes also 
called lateral gene transfer.'* According to the original premise 
that led to the phylogenetic tree of Figure 3a, genes are inherited 
from one’s parents, not from one’s neighbors. This is the premise 
that allows an investigator to conclude that two species are closely 
related when they both possess a gene (e.g., the rRNA gene) of 
similar nucleotide sequence. If, however, cells can pick up genes 
from other species in their environment, then two species that are 
actually unrelated may possess genes of very similar sequence. 
An early measure of the importance of horizontal gene transfer 
in the evolution of prokaryotes came from a study that com- 
pared the genomes of two related eubacteria, Escherichia and 
Salmonella. It was found that 755 genes or nearly 20 percent of 
the E. coli genome is derived from “foreign” genes transferred 
into the E. coli genome over the past 100 million years, which 
is the time when the two eubacteria diverged. These 755 genes 
were acquired as the result of at least 234 separate lateral trans- 
fers from many different sources.'> (The effect of horizontal gene 
transfer on antibiotic resistance in pathogenic bacteria is dis- 
cussed in the Human Perspective of Chapter 3.) 

If genomes are a mosaic composed of genes from diverse 
sources, how does one choose which genes to use in determining 
phylogenetic relationships? According to one viewpoint, genes 
that are involved in informational activities (transcription, trans- 
lation, replication) make the best subjects for determining phy- 
logenetic relationships, because such genes are less likely to be 
transferred laterally than genes involved in metabolic reactions." 
These authors argue that the products of informational genes 
(e.g., rRNAs) are parts of large complexes whose components 
must interact with many other molecules. It is unlikely that a 
foreign gene product could become integrated into the existing 
machinery. When “informational genes” are used as the subjects 
of comparison, archaebacteria and eubacteria tend to sepa- 
rate into distinctly different groups, whereas archaebacteria and 
eukaryotes tend to group together as evolutionary relatives, just 
as they do in Figure 3 See reference 17 for further discussion. 

Analysis of eukaryotic genomes has produced similar evi- 
dence of a mixed heritage. Studies of the yeast genome show 
unmistakable presence of genes derived from both archaebac- 
teria and eubacteria. The “informational genes” tend to have 
an archaeal character and the “metabolic genes” a eubacterial 
character.'® There are several possible explanations for the mixed 
character of the eukaryotic genome. Eukaryotic cells may have 
evolved from archaebacterial ancestors and then picked up genes 
from eubacteria with which they shared environments. In addition, 
some of the genes in the nucleus of a eukaryotic cell are clearly 
derived from eubacterial genes that have been transferred from 
the genome of the symbionts that evolved into mitochondria and 


chloroplasts.'? A number of researchers have taken a more radical 
position and proposed that the eukaryote genome was originally 
derived from the fusion of an archaebacterial and a eubacterial 
cell followed by the integration of their two genomes.*9"7? Given 
these various routes of gene acquisition, it is evident that no 
simple phylogenetic tree, such as that depicted in Figure 3a, can 
represent the evolutionary history of the entire genome of an 
organism. ®evewed in 21-23 Instead, each gene or group of genes of 
a particular genome may have its own unique evolutionary tree, 
which can be a disconcerting thought to scientists seeking to 
determine the origin of our earliest eukaryotic ancestors. 
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Consider some question about cell structure or function that you 
would be interested in answering. Would the data required to 
answer the question be easier to collect by working on an entire 
plant or animal or on a population of cultured cells? What might 
be the advantages and disadvantages of working on a whole 
organism versus a cell culture? 


. Figure 1.3 shows an intestinal epithelial cell with large numbers of 


microvilli. What is the advantage to the organism of having these 
microvilli? What do you expect would happen to an individual 
that lacked such microvilli as the result of an inherited mutation? 


. The first human cells to be successfully cultured were derived 


from a malignant tumor. Do you think this simply reflects the 
availability of cancer cells, or might such cells be better subjects 
for cell culture? Why? 


. The drawings of plant and animal cells in Figure 1.8b,c include 


certain structures that are present in plant cells but absent in 
animal cells. How do you think each of these structures affects 
the life of the plant? 


. It was noted that cells possess receptors on their surface that 


allow them to respond to specific stimuli. Many cells in the 
human body possess receptors that allow them to bind specific 
hormones that circulate in the blood. Why do you think these 
hormone receptors are important? What would be the effect on 
the physiological activities of the body if cells lacked these 
receptors, or if all cells had the same receptors? 


. Ifyou were to argue that viruses are living organisms, what features 


of viral structure and function might you use in your argument? 


. If we presume that activities within cells do occur in a manner 


analogous to that shown in the Rube Goldberg cartoon of 
Figure 1.7, how would this differ from a human activity, such as 
building a car on an assembly line or shooting a free throw in a 
basketball game? 


. Unlike bacterial cells, the nucleus of a eukaryotic cell is bounded 


by a double-layered membrane studded by complex pores. How 
do you think this might affect traffic between the DNA and cyto- 
plasm of a eukaryotic cell compared to that of a prokaryotic cell? 


10. 


W. 


12: 


13. 


Examine the photograph of the ciliated protist in Figure 1.16 and 
consider some of the activities in which this cell engages that a 
muscle or nerve cell in your body does not. 


Which type of cell would you expect to achieve the largest 
volume: a highly flattened cell or a spherical cell? Why? 
(See Quantitative Tutorial Video). 


Suppose you were a scientist living in the 1890s and were study- 
ing a disease of tobacco crops that stunted the growth of the 
plants and mottled their leaves. You find that the sap from a 
diseased plant, when added to a healthy plant, is capable of 
transmitting the disease to that plant. You examine the sap in 
the best light microscopes of the period and see no evidence of 
bacteria. You force the sap through filters whose pores are so 
small that they retard the passage of the smallest known bacteria, 
yet the fluid that passes through the filters is still able to transmit 
the disease. Like Dimitri lvanovsky, who conducted these experi- 
ments more than a hundred years ago, you would probably 
conclude that the infectious agent was an unknown type of 
unusually small bacterium. What kinds of experiments might you 
perform today to test this hypothesis? 


Most evolutionary biologists believe that all mitochondria have 
evolved from a single ancestral mitochondrion and all chloroplasts 
have evolved from a single ancestral chloroplast. In other words, 
the symbiotic event that gave rise to each of these organelles 
occurred only once. If this is the case, where on the phylogenetic 
tree of Figure 3a of the Experimental Pathways section, page 28, 
would you place the acquisition of each of these organelles? 


Publication of the complete sequence of the 1918 flu virus and 
reconstitution of active viral particles was met with great contro- 
versy. Those who favored publication of the work argued that 
this type of information can help to better understand the viru- 
lence of influenza viruses and help develop better therapeutics 
against them. Those opposed to its publication argued that the 
virus could be reconstituted by bioterrorists or that another pan- 
demic could be created by the accidental release of the virus by 
a careless investigator. What is your opinion on the merits of 
conducting this type of work? 


The Chemical Basis of Life 


THE CHEMICAL ORIGIN OF LIFE 


Poona 4.6 billion years ago, our solar system was 
formed from an enormous rotating cloud of gas and 
dust. For its first billion years, Earth was a tumultuous 
place, with violent volcanic eruptions and near constant 
collisions with asteroids. Yet it is during this very time 
period that scientists believe that life, in the form of 
primitive cells, first appeared on the planet. Ancient 
microbes are thought to have formed stratified rock 
formations, called stromatolites, suggesting that 

life may have proliferated as early as 3.5 billion 


years ago. 


Researchers have hypothesized that the earliest cells 
(called protocells) were very simple, made up of just 
nucleic acids (such as DNA or RNA) surrounded by a 
membrane, and that these cells may have formed in 
warm pools of water, or in the ocean near deep-sea 
vents. A central dilemma, however, has been in under- 
standing how nucleic acid and membrane molecules 
formed spontaneously on the young planet—how, 
essentially, biology was born from chemistry. 


In 1952, Harold Urey and his graduate student, Stanley 
Miller, designed an experiment to test whether the 


continued 


Building life from scratch: computer rendering of a proto-cell, an artificial lipid 


vesicle containing self-replicating nucleic acids. The ability of such protocells to 


blocks underscores the critical role of chemistry in the origins of life. 


Source: Image created by Janet Iwasa. Used by permission. 
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self-assemble and replicate when supplied with the right mix of chemical building 
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32, conditions on the early Earth favored the spontaneous 
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synthesis of biological molecules. They simulated Earth's early 
atmosphere by circulating water, methane, ammonia, and 
hydrogen in a sealed glass apparatus, and introducing energy 
in the form of heat and electricity (which mimicked the effect 
of lightning). Over the course of two weeks, the glass became 
coated with organic compounds that included a variety amino 
acids and sugars, supporting the idea that the conditions on 
the young planet may have been ideal for creating the organic 
compounds that were eventually incorporated into early cells. 


In recent years, researchers have found prebiotically feasible 
pathways to create a number of additional molecules, 


2.1 Covalent Bonds 


We will begin this chapter with a brief examination of the atomic 
basis of living matter. The cellular level of organization is only a small 
step from the atomic level, as will become evident when we examine 
the importance of the movement of a few atoms of a molecule during 
such activities as muscle contraction or the transport of substances 
across cell membranes. The properties of cells and their organelles 
derive directly from the activities of the molecules of which they are 
composed. Consider a process such as cell division, which can be 
followed in considerable detail under a simple light microscope. To 
understand the activities that occur when a cell divides, one needs to 
know, for example, about the interactions between DNA and protein 
molecules that cause the chromosomes to condense into rod-shaped 
packages that can be separated into different cells; the molecular 
construction of protein-containing microtubules that allows them to 
disassemble at one moment in the cell and reassemble the next 
moment in an entirely different cellular location; and the properties 
of lipid molecules that make the outer cell membrane deformable so 
that it can be pulled into the middle of a cell, thereby pinching the 
cell in two. It is impossible even to begin to understand cellular func- 
tion without a reasonable knowledge of the structure and properties 
of the major types of biological molecules. This is the goal of the 
present chapter: to provide an understanding of the chemistry that 
allows biological events to occur, giving rise to life. We will begin by 
considering the types of bonds that atoms can form with one another. 

The atoms that make up a molecule are joined together by 
covalent bonds in which pairs of electrons are shared between 
pairs of atoms. The formation of a covalent bond between two atoms 
is governed by the fundamental principle that an atom is most stable 
when its outermost electron shell is filled. Consequently, the number 
of bonds an atom can form depends on the number of electrons 
needed to fill its outer shell. 

The electronic structure of a number of atoms is shown in 
FIGURE 2.1. The outer (and only) shell of a hydrogen or helium atom 
is filled when it contains two electrons; the outer shells of the other 
atoms in Figure 2.1 are filled when they contain eight electrons. Thus, 
an oxygen atom, with six outer-shell electrons, can fill its outer shell 
by combining with two hydrogen atoms, forming a molecule of water. 
The oxygen atom is linked to each hydrogen atom by a single cova- 
lent bond (denoted as H:O or H—O). The formation of a covalent 


including ribonucleotides and fatty acids, the building blocks 
of RNA and membranes. Some laboratories are now 
attempting to find conditions that would allow for the 
formation of protocells. 


There are many mysteries that still remain. Molecules created 
by the Miller-Urey experiment tend to be an even mixture of 
left- and right-handed isomers (which are mirror images of 
each other), but all life on Earth use only left-handed amino 
acids and right-handed sugars. How and why did this 
selection come about? 


bond is accompanied by the release of energy, which must be reab- 
sorbed at some later time if the bond is to be broken. The energy 
required to cleave C—H, C—C, or C—O covalent bonds is quite 
large—typically between 80 and 100 kilocalories per mole (kcal/ 
mol)! of molecules. By comparison, the thermal energy of a molecule 
is only 0.6 kcal/mol. The thermal vibrations acting on a molecule are 
thus far too weak to break a covalent bond, making these bonds sta- 
ble under most conditions. In this chapter, when we speak of bonds 
being strong, we mean that the energy required to break the bond is 
much greater than the thermal energy of the molecule. Conversely, 
when we talk about bonds being weak, we mean that the energy 
required to break the bond is of the same magnitude or smaller than 
the thermal energy. 

In many cases, two atoms can become joined by bonds in which 
more than one pair of electrons are shared. If two electron pairs are 
shared, as occurs in molecular oxygen (O,), the covalent bond is 
a double bond, and if three pairs of electrons are shared (as in molec- 
ular nitrogen, N;), it is a triple bond. Quadruple bonds are not known 
to occur. The type of bond between atoms has important conse- 
quences in determining the shapes of molecules. For example, atoms 
joined by a single bond are able to rotate relative to one another, 
whereas the atoms of double (and triple) bonds lack this ability. 
As illustrated in Figure 6.6, double bonds can function as energy- 
capturing centers, driving such vital processes as respiration and 
photosynthesis. 

When atoms of the same element bond to one another, as in H, 
the electron pairs of the outer shell are equally shared between the 
two bonded atoms. When two unlike atoms are covalently bonded, 
however, the positively charged nucleus of one atom exerts a greater 
attractive force on the outer electrons than the other. Consequently, 
the shared electrons tend to be located more closely to the atom with 
the greater attractive force, that is, the more electronegative atom. 
Among the atoms most commonly present in biological molecules, 
nitrogen and oxygen are strongly electronegative. 


‘One calorie is the amount of thermal energy required to raise the temperature of one 
gram of water one degree Celsius. One kilocalorie (kcal) equals 1000 calories (or one 
large Calorie). In addition to calories, energy can also be expressed in Joules, which is 
a term that was used historically to measure energy in the form of work. One 
kilocalorie is equivalent to 4186 Joules. Conversely, 1 Joule = 0.239 calories. A mole is 
equal to Avogadro’s number (6 x 10°) of molecules. A mole of a substance is its 
molecular weight expressed in grams. 
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FIGURE 2.1 A representation of the arrangement of electrons in a number of common atoms. Electrons are present around an atom’s nucleus in 
“clouds” or orbitals that are roughly defined by their boundaries, which may have a spherical or dumbbell shape. Each orbital contains a maximum of two 
electrons, which is why the electrons (dark dots in the drawing) are grouped in pairs. The innermost shell contains a single orbital (thus two electrons), the 
second shell contains four orbitals (thus eight electrons), and the third shell also contains four orbitals. The number of outer-shell electrons is a primary 
determinant of the chemical properties of an element. Atoms with a similar number of outer-shell electrons have similar properties. Lithium (Li) and sodium 
(Na), for example, have one outer-shell electron, and both are highly reactive metals. Carbon (C) and silicon (Si) atoms can each bond with four different 
atoms. Because of its size, however, a carbon atom can bond to other carbon atoms, forming long-chained organic molecules, whereas silicon is unable to 
form comparable molecules. Neon (Ne) and argon (Ar) have filled outer shells, making these atoms highly nonreactive; they are referred to as inert gases. 


Polar and Nonpolar Molecules 


Let’s examine a molecule of water. Water’s single oxygen atom 
attracts electrons much more forcefully than do either of its hydro- 
gen atoms. Asa result, the O—H bonds of a water molecule are said 
to be polarized, such that one of the atoms has a partial negative 
charge and the other a partial positive charge. This is generally 
denoted in the following manner: 


Molecules, such as water, that have an asymmetric distribution 
of charge (or dipole) are referred to as polar molecules. Polar mol- 
ecules of biological importance contain one or more electronegative 
atoms, usually O, N, and/or S. Molecules that lack electronegative 
atoms and strongly polarized bonds, such as molecules that consist 
entirely of carbon and hydrogen atoms, are said to be nonpolar. 
The presence of strongly polarized bonds is of utmost importance 
in determining the reactivity of molecules. Large nonpolar mole- 
cules, such as waxes and fats, are relatively inert. Some of the more 
interesting biological molecules, including proteins and phospho- 
lipids, contain both polar and nonpolar regions, which behave very 
differently. 


lonization 


Some atoms are so strongly electronegative that they can capture 
electrons from other atoms during a chemical reaction. For example, 
when the elements sodium (a silver-colored metal) and chlorine 
(a toxic gas) are mixed, the single electron in the outer shell of each 
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34 sodium atom migrates to the electron-deficient chlorine atom. As a 
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result, these two atoms are transformed into charged ions. 


2.Na +:CrCE — 2 Na:Cl — 2 Nat +2 :Cl 


Because the chloride ion has an extra electron (relative to the num- 
ber of protons in its nucleus), it has a negative charge (Cl) and is 
termed an anion. The sodium atom, which has lost an electron, has 
an extra positive charge (Na") and is termed a cation. When pre- 
sent in crystals, these two ions form sodium chloride, or table salt. 

The Na‘ and CI ions depicted above are relatively stable because 
they possess filled outer shells. A different arrangement of electrons 
within an atom can produce a highly reactive species, called a free 
radical. The structure of free radicals and their importance in biology 
are considered in the accompanying Human Perspective. 


REVIEW | 


1. Oxygen atoms have eight protons in their nucleus. 
How many electrons do they have? How many orbitals 
are in the inner electron shell? How many electrons are 
in the outer shell? How many more electrons can the 
outer shell hold before it is filled? 


2. Compare and contrast: a sodium atom and a sodium 
ion; a single bond and a double bond; an atom of 
weak and strong electronegativity; the electron 
distribution around an oxygen atom bound to another 
oxygen atom and an oxygen atom bound to two 
hydrogen atoms. 
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2.2 THE HUMAN PERSPECTIVE 


Do Free Radicals Cause Aging? 


During the course of this textbook, we will discuss several differ- 
ent biological factors that are thought to contribute to the pro- 
cess of aging. Here we will consider one factor that has long been 
imagined to drive aging: the gradual accumulation of damage to 
our body's tissues. This idea is appealing because it introduces a 
natural element of time—as long as damage occurs at some low 
constant rate, the longer you live, the more damage you accumu- 
late. But while this idea is appealing, actually testing it requires 
knowing the source of the damage, and then asking whether 
changing the rate of damage actually does alter aging. What 
is getting damaged, and what is doing the damage? The most 
destructive damage probably occurs to DNA. Alterations in DNA 
lead to the production of faulty genetic messages that promote 
gradual cellular deterioration. How does cellular damage occur, 
and why should it occur more rapidly in a shorter-lived animal, 
such as a chimpanzee, than a human? The answer may reside at 
the atomic level. 

Atoms are stabilized when their shells are filled with elec- 
trons. Electron shells consist of orbitals, each of which can hold 
a maximum of two electrons. Atoms or molecules that have 
orbitals containing a single unpaired electron tend to be highly 
unstable—they are called free radicals. Free radicals may be 
formed when a covalent bond is broken such that each portion 
keeps one-half of the shared electrons, or they may be formed 
when an atom or molecule accepts a single electron transferred 
during an oxidation-reduction reaction. For example, water can 
be converted into free radicals when exposed to radiation from 
the sun: 


H,O >HO ++H-« 
hydroxyl radical 
("«" indicates afree radical ) 


Free radicals are extremely reactive and capable of chemically 
altering many types of molecules, including proteins, nucleic 
acids, and lipids. This is illustrated by the fact that certain cells of 
the immune system generate free radicals within their cytoplasm 
as a means to kill bacteria that these immune cells have ingested. 


The formation of hydroxyl radicals is probably a major reason that 
sunlight is so damaging to skin. 

In 1956, Denham Harman of the University of Nebraska pro- 
posed that aging results from tissue damage caused by free rad- 
icals. Because the subject of free radicals was not one with which 
biologists and physicians were familiar, the proposal failed to 
generate significant interest. Then, in 1969, Joe McCord and Irwin 
Fridovich of Duke University discovered an enzyme, superoxide 
dismutase (SOD), whose sole function was the destruction of 
the superoxide radical (O2-"), a type of free radical formed when 
molecular oxygen picks up an extra electron. SOD catalyzes the 
following reaction: 


O,*° +O," +2H" > H,0,+0, 


hydrogen 
peroxide 


Hydrogen peroxide is also a potentially reactive oxidizing agent, 
which is why it is often used as a disinfectant and bleaching 
agent. If it is not rapidly destroyed, HzO, can break down to form 
hydroxyl radicals that attack the cell’s macromolecules. Hydrogen 
peroxide is normally destroyed in the cell by the enzymes cata- 
lase or glutathione peroxidase. 

Subsequent research has revealed that superoxide radi- 
cals are formed within cells during normal oxidative metabolism 
and that a superoxide dismutase is present in the cells of diverse 
organisms, from bacteria to humans. In fact, animals possess 
three different versions (isoforms) of SOD: a cytosolic, mitochon- 
drial, and extracellular isoform. It is estimated that as much as 
1-2 percent of the oxygen taken into human mitochondria can 
be converted to hydrogen peroxide rather than to water, the 
normal end product of respiration. The importance of SOD is 
most clearly revealed in studies of mutant bacteria and yeast that 
lack the enzyme; these cells are unable to grow in the presence of 
oxygen. Similarly, mice that are lacking the mitochondrial version 
of the enzyme (SOD2) are not able to survive more than a week 
or so after birth. Conversely, mice that have been genetically 
engineered so that their mitochondria contain elevated levels 
of the H,O>-destroying enzyme catalase live 20 percent longer, 


on average, than untreated controls. This finding, reported in 
2005, marked the first demonstration that enhanced antioxidant 
defenses can increase the life soan of a mammal. Although the 
destructive potential of free radicals, such as superoxide and 
hydroxyl radicals, is unquestioned, the importance of these agents 
as a factor in aging remains controversial. In some cases, perturba- 
tions that increase oxygen radicals were found to increase lifespan 
rather than decrease it. Moreover, the whole concept that aging 
involves accumulation of random damage has been challenged 
by the discovery of specific genes in model organisms like yeast 
and nematode worms that, when mutated, allow the organism to 
live much longer. The existence of such genes has led to a com- 
peting theory known as “programmed aging,” which posits that 
organisms have evolved mechanisms to induce their own decline 


after they have passed reproductive age. The degree to which 
free radical damage determines aging is thus very much an open 
question. 

A related area of research concerns the study of substances 
called antioxidants that are able to destroy free radicals in the 
test tube. The sale of these substances provides a major source 
of revenue for the vitamin/supplements industry. Common 
antioxidants found in the body include glutathione, vitamins 
E and C, and beta-carotene (the orange pigment in carrots 
and other vegetables). Although these substances may prove 
beneficial in the diet because of their ability to destroy free 
radicals, studies with rats and mice have failed to provide con- 
vincing evidence that they retard the aging process or increase 
maximum life span. 


2.3 Noncovalent Bonds 


Covalent bonds are strong bonds between the atoms that make up a 
molecule. Interactions between molecules (or between different 
parts of a large biological molecule) are governed by a variety of 
weaker linkages called noncovalent bonds. Noncovalent bonds do 
not depend on shared electrons but rather on attractive forces 
between atoms having an opposite charge. Individual noncovalent 
bonds are weak (about 1 to 5 kcal/mol) and are thus readily broken 
and reformed. As will be evident throughout this book, this feature 
allows noncovalent bonds to mediate the dynamic interactions 
among molecules in the cell. 

Even though individual noncovalent bonds are weak, when 
large numbers of them act in concert, as between the two strands of 
a DNA molecule or between different parts of a large protein, their 
attractive forces are additive. Taken as a whole, they provide the 
structure with considerable stability. We will examine several types 
of noncovalent bonds that are important in cells. 


lonic Bonds: Attractions between 
Charged Atoms 


A crystal of table salt is held together by an electrostatic attraction 
between positively charged Na* and negatively charged Cl” ions. 
This type of attraction between fully charged components is called 
an ionic bond (or a salt bridge). Ionic bonds within a salt crystal 
may be quite strong. However, if a crystal of salt is dissolved in water, 
each of the individual ions becomes surrounded by water molecules, 
which inhibit oppositely charged ions from approaching one another 
closely enough to form ionic bonds (FIGURE 2.2). Because cells are 
composed primarily of water, bonds between free ions are of little 
importance. In contrast, weak ionic bonds between oppositely 
charged groups of large biological molecules are of considerable 
importance. For example, when negatively charged phosphate atoms 
in a DNA molecule are closely associated with positively charged 
groups on the surface of a protein (FIGURE 2.3), ionic bonds between 
them help hold the complex together. The strength of ionic bonds in 
a cell is generally weak (about 3 kcal/mol) due to the presence of 
water, but deep within the core of a protein, where water is often 
excluded, such bonds can be much stronger. 


NaCl- 
crystal 


FIGURE 2.2 The dissolution of a salt crystal. When placed in water, the 
Na’ and Cl ions of a salt crystal become surrounded by water molecules, 
breaking the ionic bonds between the two ions. As the salt dissolves, the 
negatively charged oxygen atoms of the water molecules associate with the 
positively charged sodium ions, and the positively charged hydrogen atoms 
of the water molecules associate with the negatively charged chloride ions. 


Hydrogen Bonds 


When a hydrogen atom is covalently bonded to an electronegative 
atom, particularly an oxygen or a nitrogen atom, the single pair of 
shared electrons is greatly displaced toward the nucleus of the elec- 
tronegative atom, leaving the hydrogen atom with a partial positive 
charge. This is illustrated on page 33 for the water molecule. As a 
result of this shift in charge, the bare, positively charged nucleus of 
the hydrogen atom can approach near enough to an unshared pair of 
outer electrons of a second electronegative atom to form an attrac- 
tive interaction (FIGURE 2.4). This weak attractive interaction is 
called a hydrogen bond. Hydrogen bonds between or within mol- 
ecules in solution have an energy of roughly 1 kcal/mol, which is 
similar in magnitude to the thermal energy. As a result, hydrogen 
bonds in biological molecules are easily broken. 

Hydrogen bonds occur between most polar molecules and are 
particularly important in determining the structure and properties of 
water (discussed later). Hydrogen bonds also form between polar 
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FIGURE 2.3 Noncovalent ionic bonds play an important role in holding 
the protein molecule on the right (yellow atoms) to the DNA molecule on 
the left. lonic bonds form between positively charged nitrogen atoms in the 
protein and negatively charged oxygen atoms in the DNA. The DNA 
molecule itself consists of two separate strands held together by noncova- 
lent hydrogen bonds. Although a single noncovalent bond is relatively weak 
and easily broken, large numbers of these bonds between two molecules, 
as between two strands of DNA, make the overall complex quite stable. 


Source: Courtesy of Stephen Harrison, Harvard Biochemistry Department. 


groups present in large biological molecules, as occurs between the 
two strands of a DNA molecule (see Figure 2.3). Because their strength 
is additive, the large number of hydrogen bonds between the strands 
makes the DNA duplex a stable structure. However, because individual 
hydrogen bonds are weak, the two strands can be partially separated to 
allow enzymes access to individual strands of the DNA molecule. 


Hydrophobic Interactions 
and van der Waals Forces 


Because of their ability to interact with water, polar molecules, such as 
sugars and amino acids (described shortly), are said to be hydro- 
philic, or “water loving.’ Nonpolar molecules, such as steroid or fat 
molecules, are essentially insoluble in water because they lack the 
charged regions that would attract them to the poles of water mole- 
cules. When nonpolar compounds are mixed with water, the nonpolar, 
hydrophobic (“water fearing”) molecules are forced into aggregates, 
which minimizes their exposure to the polar surroundings 


Hydrogen bond 


} et a | 3t 


FIGURE 2.4 Hydrogen bonds form between a bonded electronegative 
atom, such as nitrogen or oxygen, which bears a partial negative charge, 
and a bonded hydrogen atom, which bears a partial positive charge. 
Hydrogen bonds (about 0.18 nm) are typically about twice as long as the 
much stronger covalent bonds. 


(FIGURE 2.5). This association of nonpolar molecules is called a 
hydrophobic interaction. This is why droplets of fat molecules 
rapidly reappear on the surface of beef or chicken soup even after the 
liquid is stirred with a spoon. 

Hydrophobic interactions of the type just described are not clas- 
sified as true bonds because they do not result from an attraction 
between hydrophobic molecules but rather from an energetic drive 
to exclude water away from the hydrophobic surfaces.” In addition 
to this type of interaction, hydrophobic groups can form weak bonds 
with one another based on electrostatic attractions. Polar molecules 
associate because they contain permanent asymmetric charge 
distributions within their structure. Closer examination of the cova- 
lent bonds that make up a nonpolar molecule (such as H, or CH,) 
reveals that electron distributions are not always symmetric. The 
distribution of electrons around an atom at any given instant is a 
statistical matter and, therefore, varies from one instant to the next. 
Consequently, at any given time, the electron density may happen to 
be greater on one side of an atom, even though the atom shares the 
electrons equally with some other atom. These transient asym- 
metries in electron distribution result in momentary separations of 
charge (dipoles) within the molecule. If two molecules with transi- 
tory dipoles are very close to one another and oriented in the appro- 
priate manner, they experience a weak attractive force, called a 
van der Waals force, that bonds them together. Moreover, the 
formation of a temporary separation of charge in one molecule 
can induce a similar separation in an adjacent molecule. In this way, 
additional attractive forces can be generated between nonpolar 
molecules. A single van der Waals force is very weak (0.1 to 0.3 kcal/ 
mol) and very sensitive to the distance that separates the two atoms 
(FIGURE 2.6a). As we will see in later chapters, however, biological 
molecules that interact with one another—for example, an antibody 
anda protein on the surface of a virus—often possess complementary 


This statement reflects an accepted hypothesis that hydrophobic interactions are 
driven by increased entropy (disorder). When a hydrophobic group projects into an 
aqueous solvent, the water molecules become ordered in a cage around the hydropho- 
bic group. These solvent molecules become disordered when the hydrophobic group 
withdraws from the surrounding solvent. A discussion of this and other views can be 
found in Nature 437:640, 2005 and Curr. Opin. Struct. Biol. 16:152, 2006. 


FIGURE 2.5 In a hydrophobic interaction, the nonpolar (hydrophobic) 
molecules are forced into aggregates, which minimizes their exposure to 
the surrounding water molecules. 


shapes. As a result, many atoms of both interactants may have the 
opportunity to approach each other very closely (Figure 2.6b), making 
van der Waals forces important in biological interactions. 


The Life-Supporting Properties of Water 


Life on Earth is totally dependent on water, and water may be essen- 
tial to the existence of life anywhere in the universe. Even though it 
contains only three atoms, a molecule of water has a unique structure 
that gives the molecule extraordinary properties.? Most importantly, 


1. Water is a highly asymmetric molecule with the O atom at one 
end and the two H atoms at the opposite end. 


2. Each of the two covalent bonds in the molecule is highly polarized. 


3. All three atoms in a water molecule are adept at forming hydro- 
gen bonds. 


The life-supporting attributes of water stem from these properties. 
Each molecule of water can form hydrogen bonds with as many 
as four other water molecules, producing a highly interconnected 
network of molecules (FIGURE 2.7). Each hydrogen bond is formed 
when the partially positive-charged hydrogen of one water molecule 
becomes aligned next to a partially negative-charged oxygen atom of 
another water molecule. Because of their extensive hydrogen bonding, 
water molecules have an unusually strong tendency to adhere to one 
another. This feature is most evident in the thermal properties of 


One way to appreciate the structure of water is by comparing it to H,S. Like oxygen, 
sulfur has six outer-shell electrons and forms single bonds with two hydrogen atoms. 
But because sulfur is a larger atom, it is less electronegative than oxygen, and its ability 
to form hydrogen bonds is greatly reduced. At room temperature, H)S is a gas, not a 
liquid. In fact, the temperature has to drop to —86°C before H,S freezes into a solid. 


Separation between centers of atoms (A) 


2 4 6 8 
= 
Ss 
n 
Ss 
lai 
Do 
a 
> 
20 
oO 
= 
wie 
xe) 
= 
oO 
a 
Bb 
a Optimal van 
der Waals 
interaction 
(a) 


van der Waals 
forces 


(b) 


FIGURE 2.6 Van der Waals forces. (a) As two atoms approach each 
other, they experience a weak attractive force that increases up to a specific 
distance, typically about 4 A. If the atoms approach more closely, their 
electron clouds repel one another, causing the atoms to be forced apart. 
(b) Although individual van der Waals forces are very weak and transient, 
large numbers of such attractive forces can be formed if two macromol- 
ecules have a complementary surface, as is indicated schematically in this 
figure (see Figure 2.40 for an example). 


FIGURE 2.7 Hydrogen bond formation between neighboring water 
molecules. Each H atom of the molecule has about four-tenths of a full 
positive charge, and the single O atom has about eight-tenths of a full 
negative charge. 
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FIGURE 2.8 The importance of water in protein structure. The water 
molecules (each with a single red oxygen atom and two smaller gray 
hydrogen atoms) are shown in their ordered locations between the two 
subunits of a clam hemoglobin molecule. 

Source: From Martin Chaplin, Nature Revs. Mol. Cell Biol. 7:864, 2006, 
© 2006, by Macmillan Publishers Limited. 


water. For example, when water is heated, most of the thermal energy 
is consumed in disrupting hydrogen bonds rather than contributing 
to molecular motion (which is measured as an increased tempera- 
ture). Similarly, evaporation from the liquid to the gaseous state 
requires that water molecules break the hydrogen bonds holding 
them to their neighbors, which is why it takes so much energy to 
convert water to steam. Mammals take advantage of this property 
when they sweat because the heat required to evaporate the water is 
absorbed from the body, which thus becomes cooler. 

The small volume of aqueous fluid present within a cell contains 
a remarkably complex mixture of dissolved substances, or solutes. In 
fact, water is able to dissolve more types of substances than any other 
solvent. But water is more than just a solvent; it determines the struc- 
ture of biological molecules and the types of interactions in which 
they can engage. Water is the fluid matrix around which the insolu- 
ble fabric of the cell is constructed. It is also the medium through 
which materials move from one compartment of the cell to another; 
it is a reactant or product in many cellular reactions; and it protects 
the cell in many ways—from excessive heat, cold, or damaging 
radiation. 

Water is such an important factor in a cell because it is able to form 
weak interactions with so many different types of chemical groups. 
Recall from page 35 how water molecules, with their strongly polarized 
O—H bonds, form a shell around ions, separating the ions from one 
another. Similarly, water molecules form hydrogen bonds with organic 
molecules that contain polar groups, such as amino acids and sugars, 
which ensure their solubility within the cell. Water also plays a key role 
in maintaining the structure and function of macromolecules and the 
complexes that they form (such as membranes). FIGURE 2.8 shows the 
ordered arrangement of water molecules between two subunits of a 
protein molecule. The water molecules are hydrogen bonded to each 
other and to specific amino acids of the protein. 


REVIEW | 


1. Describe some of the properties that distinguish 
covalent and noncovalent bonds. 


2. Why do polar molecules, such as table sugar, dissolve 
so readily in water? Why do fat droplets form on the 
surface of an aqueous solution? Why does sweating 
help cool the body? 


eo 


2.4 Acids, Bases, and Buffers 


Protons are not only found within atomic nuclei, they are also released 
into the medium whenever a hydrogen atom loses a shared electron. 
Consider acetic acid—the distinctive ingredient of vinegar—which 
can undergo the following reaction, described as a dissociation. 


HO: HO: 
HOC 3 HG:C 4 Ht 
H :O: H :O:- 
H 
Acetic Acetate Proton 
acid ion (hydrogen ion) 


A molecule that is capable of releasing (donating) a hydrogen ion is 
termed an acid. The proton released by the acetic acid molecule in the 
previous reaction does not remain in a free state; instead, it combines 
with another molecule. Possible reactions involving a proton include 


e Combination with a water molecule to form a hydronium 
ion (H;0*). 


H* +H,O>H,0O* 


e Combination with a hydroxyl ion (OH ) to form a molecule 
of water. 


H*+OH —+H,O 


e Combination with an amino group (—NH,) in a protein to 
form a charged amine. 


H* +-NH, > -NH} 


Any molecule that is capable of accepting a proton is defined as a 
base. Acids and bases exist in pairs, or couples. When the acid loses a 
proton (as when acetic acid gives up a hydrogen ion), it becomes a base 
(in this case, acetate ion), which is termed the conjugate base of the 
acid. Similarly, when a base (such as an —NH) group) accepts a pro- 
ton, it forms an acid (in this case —NH}3), which is termed the conju- 
gate acid of that base. Thus, the acid always contains one more positive 
charge than its conjugate base. Water is an example of an amphoteric 
molecule, that is, one that can serve both as an acid and a base: 


H,0* =H*+H,O=OH +H* 
Acid Base 
molecule 


Amphoteric 
molecule 


We will discuss another important group of amphoteric molecules, the 
amino acids, on page 49. 

Acids vary markedly in the ease with which they give up a proton. 
The more readily the proton is lost, that is, the less strong the attrac- 
tion of a conjugate base for its proton, the stronger the acid. Hydrogen 


TABLE 2.1 Strengths of Acids and Bases 


Very weak 
Weak 


Very weak 


chloride is a very strong acid, one that will readily transfer its proton 
to water molecules. The conjugate base of a strong acid, such as HCl, 
is a weak base (Table 2.1). Acetic acid, in contrast, is a relatively weak 
acid because for the most part it remains undissociated when dis- 
solved in water. In a sense, one can consider the degree of dissocia- 
tion of an acid in terms of the competition for protons among the 
components of a solution. Water is a better competitor, that is, a 
stronger base, than chloride ion, so HCl completely dissociates. In 
contrast, acetate ion is a stronger base than water, so it remains 
largely as undissociated acetic acid. 

The acidity of a solution is measured by the concentration of 
hydrogen ions* and is expressed in terms of pH. 


pH =—log[H*] 


where [H"] is the molar concentration of protons. For example, a solu- 
tion having a pH of 5 contains a hydrogen ion concentration of 10M. 
Because the pH scale is logarithmic, an increase of one pH unit cor- 
responds to a tenfold decrease in H* concentration (or a tenfold 
increase in OH concentration). Stomach juice (pH 1.8), for example, 
has nearly one million times the H* concentration of blood (pH 7.4). 

A water molecule can dissociate into a hydroxyl ion and a pro- 
ton, H,O >H* +OH. In pure water, the concentration of both H* 
and OH" is approximately 10~’M. Pure water thus has a pH of 7.0 and 
since we view water as being the standard solvent into which other 
molecules are dissolved, it is common to refer to pH 7 as “neutral”. 

Most biological processes are acutely sensitive to pH because 
changes in hydrogen ion concentration affect the ionic state of bio- 
logical molecules. For example, as the hydrogen ion concentration 
increases, the —NH, group of the amino acid arginine becomes pro- 
tonated to form —NH3%, which can disrupt the activity of the entire 
protein. Even slight changes in pH can impede biological reactions. 
Organisms, and the cells they comprise, are protected from pH fluc- 
tuations by buffers—compounds that react with free hydrogen or 
hydroxyl ions, thereby resisting changes in pH. Buffer solutions usu- 
ally contain a weak acid together with its conjugate base. Blood, for 
example, is buffered by carbonic acid and bicarbonate ions, which 
normally hold blood pH at about 7.4. 


HCO, + H” = H,Co, 
Bicarbonate Hydrogen Carbonic 
ion ion acid 
If the hydrogen ion concentration rises (as occurs during exercise), the 
bicarbonate ions combine with the excess protons, removing them 
from solution. Conversely, excess OH” ions (which are generated dur- 
ing hyperventilation) are neutralized by protons derived from carbonic 
acid. The pH of the fluid within the cell is regulated in a similar man- 
ner by a phosphate buffer system consisting of H,PO; and HPO7. 


“Tn aqueous solutions, protons do not exist in the free state, but rather as H,O* or H,O,"*. 
For the sake of simplicity, we will refer to them simply as protons or hydrogen ions. 


REVIEW | 


1. If you were to add hydrochloric acid to water, what 
effect would this have on the hydrogen ion 
concentration? on the pH? on the ionic charge of 
any proteins in solution? 


2. What is the relationship between a base and its 
conjugate acid? 


e—______! 


2.5 The Nature of Biological 
Molecules 


The bulk of an organism is water. If the water is evaporated away, most 
of the remaining dry weight consists of molecules containing atoms of 
carbon. When first discovered, it was thought that carbon-containing 
molecules were present only in living organisms and thus were referred 
to as organic molecules to distinguish them from inorganic molecules 
found in the inanimate world. As chemists learned to synthesize more 
and more of these carbon-containing molecules in the lab, the mys- 
tique associated with organic compounds disappeared. The com- 
pounds produced by living organisms are called biochemicals. 

The chemistry of life centers around the chemistry of the car- 
bon atom. The essential quality of carbon that has allowed it to play 
this role is the incredible number of molecules it can form. Having 
four outer-shell electrons, a carbon atom can bond with up to four 
other atoms. Most importantly, each carbon atom is able to bond 
with other carbon atoms so as to construct molecules with back- 
bones containing long chains of carbon atoms. Carbon-containing 
backbones may be linear, branched, or cyclic. 


f 
C 
/ Cc. | 
C—C—C—C—C—C © i eee 
C C—C—C 
Nc? | 
Linear Cyclic Branched 


Cholesterol, whose structure is depicted in FIGURE 2.9, illustrates 
various arrangements of carbon atoms. 

Both the size and electronic structure of carbon make it uniquely 
suited for generating large numbers of molecules, several hundred 
thousand of which are known. In contrast, silicon, which is just below 
carbon in the periodic table and also has four outer-shell electrons 
(see Figure 2.1), is too large for its positively charged nucleus to attract 
the outer-shell electrons of neighboring atoms with sufficient force to 
hold such large molecules together. We can best understand the nature 
of biological molecules by starting with the simplest group of organic 
molecules, the hydrocarbons, which contain only carbon and hydro- 
gen atoms. The molecule ethane (C,H) is a simple hydrocarbon 


Ethane 


consisting of two atoms of carbon in which each carbon is bonded 
to the other carbon as well as to three atoms of hydrogen. As more 
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Cholesterol 


FIGURE 2.9 Cholesterol, whose structure illustrates how carbon atoms 
(represented by the black balls) are able to form covalent bonds with as 
many as four other carbon atoms. As a result, carbon atoms can be linked 
together to form the backbones of a virtually unlimited variety of organic 
molecules. The carbon backbone of a cholesterol molecule includes four 
rings, which is characteristic of steroids (e.g., estrogen, testosterone, 
cortisol). The cholesterol molecule shown here is drawn as a ball-and-stick 
model, which is another way that molecular structure is depicted. 


carbons are added, the skeletons of organic molecules increase in 
length and their structure becomes more complex. 


Functional Groups 


Hydrocarbons do not occur in significant amounts within most liv- 
ing cells (though they constitute the bulk of the fossil fuels formed 
from the remains of ancient plants and animals). Many of the organic 
molecules that are important in biology contain chains of carbon 
atoms, like hydrocarbons, but certain hydrogen atoms are replaced 
by various functional groups. Functional groups are particular 
groupings of atoms that often behave as a unit and give organic mol- 
ecules their physical properties, chemical reactivity, and solubility in 
aqueous solution. Some of the more common functional groups are 
listed in Table 2.2. Two of the most common linkages between func- 
tional groups are ester bonds, which form between carboxylic 
acids and alcohols, and amide bonds, which form between car- 
boxylic acids and amines. 


| if | 
—C—OH+HO c —C=O c 
Acid Alcohol Ester 
i 1 Toy 
—C—OH + HN S >—C—N 5 
Acid Amine Amide 


TABLE 2.2 Functional Groups 
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Methyl Hydroxyl Carboxyl 


Amino 


Most of the groups in Table 2.2 contain one or more electron- 
egative atoms (N, P, O, and/or S) and make organic molecules more 
polar, more water soluble, and more reactive. Several of these func- 
tional groups can ionize and become positively or negatively charged. 
The effect on molecules by the substitution of various functional 
groups is readily demonstrated. The hydrocarbon ethane (CH3;CH3) 
depicted on page 39 is a toxic, flammable gas. Replace one of the 
hydrogens with a hydroxyl group (—OH) and the molecule 
(CH3;CH,OH) becomes palatable—it is ethyl alcohol (or ethanol). 
Substitute a carboxyl group (—COOH) and the molecule becomes 
acetic acid (CH;COOH), the strong-tasting ingredient in vinegar. 
Substitute a sulfhydryl group (—SH), and you have formed 
CH,;CH,SH, a strong, foul-smelling agent, ethyl mercaptan, used by 
biochemists in studying enzyme reactions. 


A Classification of Biological 
Molecules by Function 


The organic molecules commonly found within living cells can be 
divided into several categories based on their role in metabolism. 


1. Macromolecules. The molecules that form the structure and 
carry out the activities of cells are huge, highly organized mole- 
cules called macromolecules, which contain anywhere from 
dozens to millions of carbon atoms. Because of their size and the 
intricate shapes that macromolecules can assume, some of these 
molecular giants can perform complex tasks with great precision 
and efficiency. The presence of macromolecules, more than any 
other characteristic, endows organisms with the properties of life 
and sets them apart chemically from the inanimate world. 

Macromolecules can be divided into four major categories: 
proteins, nucleic acids, polysaccharides, and certain lipids. The 
first three types are polymers composed of a large number of low- 
molecular-weight building blocks, or monomers. These macro- 
molecules are constructed from monomers by a process of 
polymerization that resembles coupling railroad cars onto a train 
(FIGURE 2.10). The basic structure and function of each type of 
macromolecule are similar in all organisms. It is not until you 
look at the specific sequences of monomers that make up indi- 
vidual macromolecules that the diversity among organisms 
becomes apparent. The localization of these molecules in a num- 
ber of cellular structures is shown in an overview in FIGURE 2.11. 


2. The building blocks of macromolecules. Most of the macromol- 
ecules within a cell have a short lifetime compared with the cell 
itself; with the exception of the cells DNA, they are continually 
broken down and replaced by new macromolecules. Consequently, 
most cells contain a supply (or pool) of low-molecular-weight 
precursors that are ready to be incorporated into macromolecules. 
These include sugars, which are the precursors of polysaccharides; 
amino acids, which are the precursors of proteins; nucleotides, 
which are the precursors of nucleic acids; and fatty acids, which 
are incorporated into lipids. 
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FIGURE 2.10 Monomers and polymers; polymerization and hydrolysis. (a) Polysaccharides, proteins, and nucleic acids consist of monomers (subunits) linked 
together by covalent bonds. Free monomers do not simply react with each other to become macromolecules. Rather, each monomer is first activated by 

attachment to a carrier molecule that helps the monomer to chemically react with the end of the growing macromolecule. (b) A macromolecule is disassembled 
by hydrolysis of the bonds that join the monomers together. Hydrolysis is the splitting of a bond by water. All of these reactions are catalyzed by specific enzymes. 
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FIGURE 2.11 An overview of the types of biological molecules that make 
up various cellular structures. 


3. Metabolic intermediates (metabolites). The molecules in a cell 
have complex chemical structures and must be synthesized in a 
step-by-step sequence beginning with specific starting materials. 
In the cell, each series of chemical reactions is termed a meta- 
bolic pathway. The cell starts with compound A and converts 
it to compound B, then to compound C, and so on, until some 
functional end product (such as an amino acid building block of 
a protein) is produced. The compounds formed along the path- 
ways leading to the end products might have no function per se 
and are called metabolic intermediates. 


4. Molecules of miscellaneous function. This is obviously a broad 
category of molecules but not as large as you might expect; the 
vast bulk of the dry weight of a cell is made up of macromolecules 
and their direct precursors. The molecules of miscellaneous 
function include such substances as vitamins, which function 
primarily as adjuncts to proteins; certain steroid or amino acid 
hormones; molecules involved in energy storage, such as ATP; 
regulatory molecules such as cyclic AMP; and metabolic waste 
products such as urea. 


REVIEW | 


1. What properties of a carbon atom are critical to life? 


2. Draw the structures of four different functional groups. 
How would each of these groups alter the solubility of 


a molecule in water? 
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Carbohydrates (or glycans, as they are often called) include simple 
sugars (or monosaccharides) and all larger molecules constructed of 
sugar building blocks. Carbohydrates function primarily as stores of 
chemical energy and as durable building materials for biological con- 
struction. Most sugars have the general formula (CH,O),,. The sugars 
of importance in cellular metabolism have values of n that range from 
3 to 7. Sugars containing three carbons are known as trioses, those 
with four carbons as tetroses, those with five carbons as pentoses, those 
with six carbons as hexoses, and those with seven carbons as heptoses. 


The Structure of Simple Sugars 


Each sugar molecule consists of a backbone of carbon atoms linked 
together in a linear array by single bonds. Each of the carbon atoms 
of the backbone is linked to a single hydroxyl group, except for one 
that bears a carbonyl (C=O) group. If the carbonyl group is located 
at an internal position (to form a ketone group), the sugar is a ketose, 
such as fructose, which is shown in FIGURE 2.12a. If the carbony] is 
located at one end of the sugar, it forms an aldehyde group and the 
molecule is known as an aldose, as exemplified by glucose, which is 
shown in Figure 2.12b-f. Because of their large numbers of hydroxyl 
groups, sugars tend to be highly water soluble. 

Although the straight-chained formulas shown in Figure 2.12a,b 
are useful for comparing the structures of various sugars, sugars having 
five or more carbons spontaneously self-react (Figure 2.12c) to produce 
a closed, or ring-containing, molecule. Only a tiny fraction of sugar 
molecules in solution are found in the open-chain linear form; the rest 
are in the ring form. The linear form is biochemically important 
because the aldehyde group at the end of the chain is reactive and can 
react with proteins, notably hemoglobin. Patients with diabetes have 
higher levels of sugar in their blood, and this sugar in its open chain 
form reacts with hemoglobin to produce a modified hemoglobin called 
Hemoglobin Alc, which is often used in blood tests to track the pro- 
gress of diabetes. Similar reactions of linear-form sugar with proteins 
involved in cholesterol metabolism are one of the reasons that diabetes 


causes heart disease. The open chain form of sugars is thus quite 
important in medicine, but the vast majority of sugars are found in the 
ring form, and it is in this form that they are used as building blocks to 
build other types of carbohydrates. The ring forms of sugars are usually 
depicted as flat (planar) structures (Figure 2.12d) lying perpendicular 
to the plane of the paper with the thickened line situated closest to the 
reader. The H and OH groups lie parallel to the plane of the paper, 
projecting either above or below the ring of the sugar. In actual fact, 
the sugar ring is not a planar structure, but usually exists in a three- 
dimensional conformation resembling a chair (Figure 2.12e,f). 


Stereoisomerism 


As noted earlier, a carbon atom can bond with four other atoms. The 
arrangement of the groups around a carbon atom can be depicted as 
in FIGURE 2.13a with the carbon placed in the center of a tetrahedron 
and the bonded groups projecting into its four corners. Figure 2.13b 
depicts a molecule of glyceraldehyde, which is the only aldotriose. 
The second carbon atom of glyceraldehyde is linked to four different 
groups (—H, —-OH, —CHO, and —CH,OH). If the four groups 
bonded to a carbon atom are all different, as in glyceraldehyde, then 
two possible configurations exist that cannot be superimposed on one 
another. These two molecules (termed stereoisomers or enantiomers) 
have essentially the same chemical reactivities, but their structures are 
mirror images (not unlike a pair of right and left human hands). By 
convention, the molecule is called p-glyceraldehyde if the hydroxyl 
group of carbon 2 projects to the right, and L-glyceraldehyde if it pro- 
jects to the left (Figure 2.13c). Because it acts as a site of stereoisomer- 
ism, carbon 2 is referred to as an asymmetric carbon atom. 

As the backbone of sugar molecules increases in length, so too 
does the number of asymmetric carbon atoms and, consequently, the 
number of stereoisomers. Aldotetroses have two asymmetric carbons 
and thus can exist in four different configurations (FIGURE 2.14). Similarly, 
there are eight different aldopentoses and 16 different aldohexoses. The 
designation of each of these sugars as D or L is based by convention on 
the arrangement of groups attached to the asymmetric carbon atom 
farthest from the aldehyde (the carbon associated with the aldehyde is 
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FIGURE 2.12 The structures of sugars. (a) Straight-chain formula of fructose, a ketohexose [keto, indicating the carbonyl (yellow), is located internally, 
and hexose because it consists of six carbons]. (6) Straight-chain formula of glucose, an aldohexose (aldo because the carbonyl is located at the end of the 
molecule). (c) Self-reaction in which glucose is converted from an open chain to a closed ring (a pyranose ring). (d) Glucose is commonly depicted in the 
form of a flat (planar) ring lying perpendicular to the page with the thickened line situated closest to the reader and the H and OH groups projecting either 
above or below the ring. The basis for the designation «-D-glucose is discussed in the following section. (e) The chair conformation of glucose, which depicts 
its three-dimensional structure more accurately than the flattened ring of part d. (f) A ball-and-stick model of the chair conformation of glucose. 
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FIGURE 2.13 Stereoisomerism of glyceraldehyde. (a) The four groups 
bonded to a carbon atom (labeled a, b, c, and d) occupy the four corners of 
a tetrahedron with the carbon atom at its center. (b) Glyceraldehyde is the 
only three-carbon aldose; its second carbon atom is bonded to four 
different groups (—H, —-OH, —-CHO, and —CH;OH). As a result, 
glyceraldehyde can exist in two possible configurations that are not 
superimposable, but instead are mirror images of each other as indicated. 
These two stereoisomers (or enantiomers) can be distinguished by the 
configuration of the four groups around the asymmetric (or chiral) carbon 
atom. Solutions of these two isomers rotate plane-polarized light in 
opposite directions and, thus, are said to be optically active. (c) Straight- 
chain formulas of glyceraldehyde. By convention, the D-isomer is shown 
with the OH group on the right. 
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FIGURE 2.14 Aldotetroses. Because they have two asymmetric carbon 
atoms, aldotetroses can exist in four configurations. 


designated C1). If the hydroxyl group of this carbon projects to the 
right, the aldose is a D-sugar; if it projects to the left, it is an L-sugar. The 
enzymes present in living cells can distinguish between the p and L 
forms of a sugar. Typically, only one of the stereoisomers (such as D- 
glucose and 1-fucose) is used by cells. 

The self-reaction in which a straight-chain glucose molecule 
is converted into a six-membered (pyranose) ring was shown in 


o-D-Glucopyranose 


p-D-Glucopyranose 


FIGURE 2.15 Formation of an a- and B-pyranose. When a molecule 

of glucose undergoes self-reaction to form a pyranose ring (i.e., a six- 
membered ring), two stereoisomers are generated. The two isomers are in 
equilibrium with each other through the open-chain form of the molecule. 
By convention, the molecule is an a-pyranose when the OH group of the 
first carbon projects below the plane of the ring, and a B-pyranose when 
the hydroxyl group projects upward. 


Figure 2.12c. Unlike its precursor in the open chain, the C1 of the ring 
bears four different groups and thus becomes a new center of asym- 
metry within the sugar molecule. Because of this extra asymmetric 
carbon atom, each type of pyranose exists as « and { stereoisomers 
(FIGURE 2.15). By convention, the molecule is an a-pyranose when 
the OH group of the first carbon projects below the plane of the ring, 
and a B-pyranose when the hydroxyl projects upward. The difference 
between the two forms has important biological consequences, result- 
ing, for example, in the compact shape of glycogen and starch mole- 
cules and the extended conformation of cellulose (discussed later). 


Linking Sugars Together 


Sugars can be joined to one another by covalent glycosidic bonds 
to form larger molecules. Glycosidic bonds form by reaction between 
carbon atom Cl of one sugar and the hydroxyl group of another 
sugar, generating a —C—-O—C-— linkage between the two sugars. As 
discussed below (and indicated in FIGURES 2.16 and 2.17), sugars can 
be joined by quite a variety of different glycosidic bonds. Molecules 
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FIGURE 2.16 Disaccharides. Sucrose and lactose are two of the most 
common disaccharides. Sucrose is composed of glucose and fructose 
joined by an a(1 > 2) linkage, whereas lactose is composed of glucose and 
galactose joined by a B(1 > 4) linkage. 
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FIGURE 2.17 Three polysaccharides with identical sugar monomers but dramatically different properties. Glycogen (a), starch (b), and cellulose (c) are 
each composed entirely of glucose subunits, yet their chemical and physical properties are very different due to the distinct ways that the monomers are 
linked together (three different types of linkages are indicated by the circled numbers). Glycogen molecules are the most highly branched, starch molecules 
assume a helical arrangement, and cellulose molecules are unbranched and highly extended. Whereas glycogen and starch are energy stores, cellulose 
molecules are bundled together into tough fibers that are suited for their structural role. Colorized electron micrographs show glycogen granules in a liver 
cell, starch grains (amyloplasts) in a plant seed, and cellulose fibers in a plant cell wall; each is indicated by an arrow. 

Source: (a) Don W. Fawcett/Photo Researchers, Inc.; (b) Jeremy Burgess/Photo Researchers, Inc.; (c) Biophoto Associates/Photo Researchers, Inc. 


composed of only two sugar units are disaccharides (Figure 2.16). 
Disaccharides serve primarily as readily available energy stores. 
Sucrose, or table sugar, is a major component of plant sap, which car- 
ries chemical energy from one part of the plant to another. Lactose, 
present in the milk of most mammals, supplies newborn mammals 
with fuel for early growth and development. Lactose in the diet is 
hydrolyzed by the enzyme lactase, which is present in the plasma 
membranes of the cells that line the intestine. Many people lose this 
enzyme after childhood and find that eating dairy products causes 
digestive discomfort. 

Sugars may also be linked together to form small chains called 
oligosaccharides (oligo = few). Most often such chains are found 
covalently attached to lipids and proteins, converting them into 
glycolipids and glycoproteins, respectively. Oligosaccharides are 


particularly important on the glycolipids and glycoproteins of the plasma 
membrane, where they project from the cell surface (see Figure 4.4a). 
Because oligosaccharides may be composed of many different combina- 
tions of sugar units, these carbohydrates can play an informational role; 
that is, they can serve to distinguish one type of cell from another and 
help mediate specific interactions of a cell with its surroundings. 


Polysaccharides 


By the middle of the nineteenth century, it was known that the blood 
of people suffering from diabetes had a sweet taste due to an elevated 
level of glucose, the key sugar in energy metabolism. Claude Bernard, 
a prominent French physiologist of the period, was looking for the 
cause of diabetes by investigating the source of blood sugar. It was 


assumed at the time that any sugar present in a human or an animal 
had to have been previously consumed in the diet. Working with 
dogs, Bernard found that, even if the animals were placed on a diet 
totally lacking carbohydrates, their blood still contained a normal 
amount of glucose. Clearly, glucose could be formed in the body 
from other types of compounds. 

After further investigation, Bernard found that glucose enters 
the blood from the liver. Liver tissue, he found, contains an insoluble 
polymer of glucose he named glycogen. Bernard concluded that 
various food materials (such as proteins) were carried to the liver 
where they were chemically converted to glucose and stored as gly- 
cogen. Then, as the body needed sugar for fuel, the glycogen in the 
liver was transformed to glucose, which was released into the blood- 
stream to satisfy glucose-depleted tissues. In Bernard’s hypothesis, 
the balance between glycogen formation and glycogen breakdown in 
the liver was the prime determinant in maintaining the relatively 
constant (homeostatic) level of glucose in the blood. 

Bernard's hypothesis proved to be correct. The molecule he 
named glycogen is a type of polysaccharide—a polymer of sugar 
units joined by glycosidic bonds. 


GLYCOGEN AND STARCH: NUTRITIONAL POLY- 
SACCHARIDES Glycogen is a branched polymer containing 
only one type of monomer: glucose (Figure 2.17a). Most of the sugar 
units of a glycogen molecule are joined to one another by a(1 > 4) 
glycosidic bonds (type 2 bond in Figure 2.17a). Branch points con- 
tain a sugar joined to three neighboring units rather than to two, as 
in the unbranched segments of the polymer. The extra neighbor, 
which forms the branch, is linked by an a(1 > 6) glycosidic bond 
(type 1 bond in Figure 2.17a). 

Glycogen serves as a storehouse of surplus chemical energy in 
most animals. Human skeletal muscles, for example, typically con- 
tain enough glycogen to fuel about 30 minutes of moderate activity. 
Depending on various factors, glycogen typically ranges in molecu- 
lar weight from about one to four million daltons. When stored in 
cells, glycogen is highly concentrated in what appears as dark- 
staining, irregular granules in electron micrographs (Figure 2.17, 
right). 

Most plants bank their surplus chemical energy in the form of 
starch, which like glycogen is also a polymer of glucose. Potatoes 
and cereals, for example, consist primarily of starch. Starch is actu- 
ally a mixture of two different polymers, amylose and amylopectin. 
Amylose is an unbranched, helical molecule whose sugars are joined 
by a(1 > 4) linkages (Figure 2.17b), whereas amylopectin is branched. 
Amylopectin differs from glycogen in being much less branched and 
having an irregular branching pattern. Starch is stored as densely 
packed granules, or starch grains, which are enclosed in membrane- 
bound organelles (plastids) within the plant cell (Figure 2.17, right). 
Although animals don’t synthesize starch, they possess an enzyme 
(amylase) that readily hydrolyzes it. 


CELLULOSE, CHITIN, AND GLYCOSAMINOGLY- 
CANS: STRUCTURAL POLYSACCHARIDES Whereas 
some polysaccharides constitute easily digested energy stores, others 
form tough, durable structural materials. Cotton and linen, for example, 
consist largely of cellulose, which is the major component of plant 
cell walls. Cotton textiles owe their durability to the long, unbranched 
cellulose molecules, which are ordered into side-by-side aggregates to 
form molecular cables (Figure 2.17c, right panel) that are ideally con- 
structed to resist pulling (tensile) forces. Like glycogen and starch, 


cellulose consists solely of glucose monomers; its properties differ 
dramatically from these other polysaccharides because the glucose 
units are joined by B(1 — 4) linkages (bond 3 in Figure 2.17c) rather 
than a(1 > 4) linkages. Ironically, multicellular animals (with rare 
exception) lack the enzyme needed to degrade cellulose, which hap- 
pens to be the most abundant organic material on Earth and rich in 
chemical energy. Animals that “make a living” by digesting cellulose, 
such as termites and sheep, do so by harboring bacteria and protozoa 
that synthesize the necessary enzyme, cellulase. Cellulose is an impor- 
tant component of dietary fiber, a broad term that includes all the poly- 
saccharides we eat that cannot be digested by human enzymes. 

Not all biological polysaccharides consist of glucose monomers. 
Chitin is an unbranched polymer of the sugar N-acetylglucosamine, 
which is similar in structure to glucose but has an acetyl amino group 
instead ofa hydroxyl group bonded to the second carbon atom of the ring. 
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Chitin occurs widely as a structural material among invertebrates, 
particularly in the outer covering of insects, spiders, and crusta- 
ceans. Chitin is a tough, resilient, yet flexible material not unlike cer- 
tain plastics. Insects owe much of their success to this highly adaptive 
polysaccharide (FIGURE 2.18). 

Another group of polysaccharides that has a more complex 
structure is the glycosaminoglycans (or GAGs). Unlike other 
polysaccharides, they have the structure —A—B—A—B-—, where A 
and B represent two different sugars. The best-studied GAG is 


FIGURE 2.18 Chitin is the primary component of the outer skeleton 
of this grasshopper. 


Source: Anthony Bannister/Gallo Images/© Corbis. 


££ 
ol 


sajzeipAyoMes e 9°Z 


46 heparin, which is secreted by cells in the lungs and other tissues in 


a}!7 FO siseg [eD1WAEYD SY] e J YAldVHOD 


response to tissue injury. Heparin inhibits blood coagulation, 
thereby preventing the formation of clots that can block the flow of 
blood to the heart or lungs. Heparin accomplishes this feat by acti- 
vating an inhibitor (antithrombin) of a key enzyme (thrombin) that 
is required for blood coagulation. Heparin, which is normally 
extracted from pig tissue, has been used for decades to prevent blood 
clots in patients following major surgery. Unlike heparin, most 
GAGs are found in the spaces that surround cells, and their structure 
and function are discussed in Section 7.3. The most complex poly- 
saccharides are found in plant cell walls (Section 7.14). 


REVIEW | 


1. Name three polysaccharides composed of polymers 
of glucose. How do these macromolecules differ from 
one another? 


ee 


2.7. Lipids 


Lipids are a diverse group of nonpolar biological molecules whose 
common properties are their ability to dissolve in organic solvents, 
such as chloroform or benzene, and their inability to dissolve in 
water—a property that explains many of their varied biological func- 
tions. Lipids of importance in cellular function include fats, steroids, 
and phospholipids. 


Fats 


Fats consist of a glycerol molecule linked by ester bonds to three 
fatty acids; the composite molecule is termed a triacylglycerol 
(FIGURE 2.19A), also known as a triglyceride. We will begin by con- 
sidering the structure of fatty acids. Fatty acids are long, unbranched 
hydrocarbon chains with a single carboxyl group at one end (Figure 
2.19b). Because the two ends of a fatty acid molecule have a very dif- 
ferent structure, they also have different properties. The hydrocar- 
bon chain is hydrophobic, whereas the carboxyl group (—COOH), 
which bears a negative charge at physiological pH, is hydrophilic. 
Molecules having both hydrophobic and hydrophilic regions are said 
to be amphipathic; such molecules have unusual and biologically 
important properties. The properties of fatty acids can be appreciated 
by considering the use of a familiar product: soap, which consists of 
fatty acids. In past centuries, soaps were made by heating animal fat 
in strong alkali (NaOH or KOH) to break the bonds between the fatty 
acids and the glycerol. Today, most soaps are made synthetically. 
Soaps owe their grease-dissolving capability to the fact that the 
hydrophobic end of each fatty acid can embed itself in the grease, 
whereas the hydrophilic end can interact with the surrounding water. 
As a result, greasy materials are converted into complexes (micelles) 
that can be dispersed by water (FIGURE 2.20). 

Fatty acids differ from one another in the length of their hydro- 
carbon chain and the presence or absence of double bonds. Fatty 
acids present in cells typically vary in length from 14 to 20 carbons. 
Fatty acids that lack double bonds, such as stearic acid (Figure 2.19b), 
are described as saturated; those possessing double bonds are 
unsaturated. Naturally occurring fatty acids have double bonds in 
the cis configuration. Double bonds (of the cis configuration) 
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FIGURE 2.19 Fats and fatty acids. (a) The basic structure of a triacylglyc- 
erol (also called a triglyceride or a neutral fat). The glycerol moiety, 
indicated in orange, is linked by three ester bonds to the carboxyl groups 
of three fatty acids whose tails are indicated in green. (b) Stearic acid, an 
18-carbon saturated fatty acid that is common in animal fats. (c) Space- 
filling model of tristearate, a triacylglycerol containing three identical stearic 
acid chains. (d) Space-filling model of linseed oil, a triacylglycerol derived 
from flax seeds that contains three unsaturated fatty acids (linoleic, oleic, 
and linolenic acids). The sites of unsaturation, which produce kinks in the 
molecule, are indicated by the yellow-orange bars. 
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produce kinks in a fatty acid chain. Consequently, the more double 
bonds that fatty acid chains possess, the less effectively these long 


FIGURE 2.20 Soaps consist of fatty acids. In this schematic drawing of a 
soap micelle, the nonpolar tails of the fatty acids are directed inward, where 
they interact with the greasy matter to be dissolved. The negatively 
charged heads are located at the surface of the micelle, where they interact 
with the surrounding water. Membrane proteins, which also tend to be 
insoluble in water, can also be solubilized in this way by extraction of 
membranes with detergents. 


chains can be packed together. This lowers the temperature at which 
a fatty acid-containing lipid melts. Tristearate, whose fatty acids lack 
double bonds (Figure 2.19c), is a common component of animal fats 
and remains in a solid state well above room temperature. In contrast, 
the profusion of double bonds in vegetable fats accounts for their liq- 
uid state—both in the plant cell and on the grocery shelf—and for 
their being labeled as “polyunsaturated.” Fats that are liquid at room 
temperature are described as oils. Figure 2.19d shows the structure of 
linseed oil, a highly volatile lipid extracted from flax seeds, that 
remains a liquid at a much lower temperature than does tristearate. 
Solid shortenings, such as margarine, are formed from unsaturated 
vegetable oils by chemically reducing the double bonds with hydro- 
gen atoms (a process termed hydrogenation). The hydrogenation pro- 
cess also converts some of the cis double bonds into trans double 
bonds, which are straight rather than kinked. This process generates 
partially hydrogenated or trans-fats. Eating trans fat increases the 
risk of heart disease, and artificial trans fat is now banned in several 
countries, with others considering similar measures. 

A molecule of fat can contain three identical fatty acids (as in 
Figure 2.19c), or it can be a mixed fat, containing more than one fatty 
acid species (as in Figure 2.19d). Most natural fats, such as olive oil or 
butterfat, are mixtures of molecules having different fatty acid species. 

Fats are very rich in chemical energy; a gram of fat contains over 
twice the energy content of a gram of carbohydrate (for reasons dis- 
cussed in Section 3.9). Carbohydrates function primarily as a short- 
term, rapidly available energy source, whereas fat reserves store 
energy on a long-term basis. It is estimated that a person of average 
size contains about 0.5 kilograms (kg) of carbohydrate, primarily in 
the form of glycogen. This amount of carbohydrate provides approx- 
imately 2000 kcal of total energy. During the course of a strenuous 
day’s exercise, a person can virtually deplete his or her body’s entire 
store of carbohydrate. In contrast, the average person contains 


approximately 16 kg of fat (equivalent to 144,000 kcal of energy), 
and as we all know, it can take a very long time to deplete our store 
of this material. 

Because they lack polar groups, fats are extremely insoluble in 
water and are stored in cells in the form of dry lipid droplets. Because 
lipid droplets do not contain water as do glycogen granules, they 
represent an extremely concentrated storage fuel. In many animals, 
fats are stored in special cells (adipocytes) whose cytoplasm is filled 
with one or a few large lipid droplets. Adipocytes exhibit a remark- 
able ability to change their volume to accommodate varying quanti- 
ties of fat. 


Steroids 


Steroids are built around a characteristic four-ringed hydrocarbon 
skeleton. One of the most important steroids is cholesterol, a compo- 
nent of animal cell membranes and a precursor for the synthesis of a 
number of steroid hormones, such as testosterone, progesterone, 
and estrogen (FIGURE 2.21). Cholesterol is largely absent from plant 
cells, which is why vegetable oils are considered “cholesterol-free,” 
but plant cells may contain large quantities of related compounds. 


Phospholipids 


The chemical structure of a common phospholipid is shown in 
FIGURE 2.22. The molecule resembles a fat (triacylglycerol), but has 
only two fatty acid chains rather than three; it is a diacylglycerol. The 
third hydroxyl of the glycerol backbone is covalently bonded to a 
phosphate group, which in turn is covalently bonded to a small polar 
group, such as choline, as shown in Figure 2.22. Thus, unlike fat mol- 
ecules, phospholipids contain two ends that have very different 


CH3 


HO 
Cholesterol 


Testosterone OH 


HO 
Estrogen 


FIGURE 2.21 The structure of steroids. All steroids share the basic four- 
ring skeleton. The seemingly minor differences in chemical structure 
between cholesterol, testosterone, and estrogen generate profound 
biological differences. 
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FIGURE 2.22 The phospholipid phosphatidylcholine. The molecule 
consists of a glycerol backbone whose hydroxy! groups are covalently 
bonded to two fatty acids and a phosphate group. The negatively charged 
phosphate is also bonded to a small, positively charged choline group. The 
end of the molecule that contains the phosphorylcholine is hydrophilic, 
whereas the opposite end, consisting of the fatty acid tail, is hydrophobic. 
The structure and function of phosphatidylcholine and other phospholipids 
are discussed at length in Section 4.2. 


properties: the end containing the phosphate group has a distinctly 
hydrophilic character; the other end composed of the two fatty acid 
tails has a distinctly hydrophobic character. Because phospholipids 
function primarily in cell membranes, and because the properties of 
cell membranes depend on their phospholipid components, they 
will be discussed further in Sections 4.2 and 15.6 in connection with 
cell membranes. 


REVIEW | 


1. Describe the properties of three different types of lipid 


molecules. What are their respective biological roles? 
‘ae 


2.8 Building Blocks of Proteins 


Proteins are the macromolecules that carry out virtually all of a cell’s 
activities; they are the molecular tools and machines that make 
things happen. As enzymes, proteins vastly accelerate the rate of 


(c) 


(d) 


FIGURE 2.23 Four examples of the thousands of biological structures composed predominantly of protein. These include (a) feathers, which are 
adaptations in birds for thermal insulation, flight, and sex recognition; and (b) spider webs, made of a protein-based silk that is among the strongest 
materials known. (c) human hair, composed of the protein keratin, and (d) fingernails, which are also composed of keratin. 


Source: (a) Darrell Gulin/Getty Images, Inc.; (6) © gabriela schaufelberger / iStockphoto; (c) © Science Picture.Co; (d) Tamara83 / Shutterstock. 


metabolic reactions; as structural cables, proteins provide mechani- 
cal support both within cells and outside their perimeters 
(FIGURE 2.23a); as hormones, growth factors, and gene activators, 
proteins perform a wide variety of regulatory functions; as mem- 
brane receptors and transporters, proteins determine what a cell 
reacts to and what types of substances enter or leave the cell; as con- 
tractile filaments and molecular motors, proteins constitute the 
machinery for biological movements. Among their many other 
functions, proteins act as antibodies, serve as toxins, form blood 
clots, absorb or refract light (Figure 2.23b), and transport substances 
from one part of the body to another. 

How can one type of molecule have so many varied functions? 
The explanation resides in the virtually unlimited molecular struc- 
tures that proteins, as a group, can assume. Each protein, however, 
has a unique and defined structure that enables it to carry out a par- 
ticular function. Most importantly, proteins have shapes and sur- 
faces that allow them to interact selectively with other molecules. 
Proteins, in other words, exhibit a high degree of specificity. It is 
possible, for example, for a particular DNA-cutting enzyme to rec- 
ognize a segment of DNA containing one specific sequence of eight 
nucleotides, while ignoring all the other 65,535 possible sequences 
composed of this number of nucleotides. 


The Structures of Amino Acids 


Proteins are polymers made of amino acid monomers. Each protein 
has a unique sequence of amino acids that gives the molecule its 
unique properties. Many of the capabilities of a protein can be 
understood by examining the chemical properties of its constituent 
amino acids. Twenty different amino acids are commonly used in 
the construction of proteins, whether from a virus or a human. 
There are two aspects of amino acid structure to consider: that which 
is common to all of them and that which is unique to each. We will 
begin with the shared properties. 

All amino acids have a carboxyl group and an amino group, 
which are separated from each other by a single carbon atom, the 
a-carbon (FIGURE 2.24a,b). In a neutral aqueous solution, the «-carboxyl 
group loses its proton and exists in a negatively charged state (—COO ), 
and the a-amino group accepts a proton and exists in a positively 
charged state (NH3) (Figure 2.24b). We saw on page 43 that carbon 
atoms bonded to four different groups can exist in two configura- 
tions (stereoisomers) that cannot be superimposed on one another. 
Amino acids also have asymmetric carbon atoms. With the exception 
of glycine, the «-carbon of amino acids bonds to four different groups 
so that each amino acid can exist in either a D or an L form 
(FIGURE 2.25). Amino acids used in the synthesis of a protein on a 
ribosome are always L-amino acids. The “selection” of L-amino acids 
must have occurred very early in cellular evolution and has been con- 
served for billions of years. Microorganisms, however, use D-amino 
acids in the synthesis of certain small peptides, including those of the 
cell wall and several antibiotics (e.g., gramicidin A). 

During the process of protein synthesis, each amino acid 
becomes joined to two other amino acids, forming a long, continu- 
ous, unbranched polymer called a polypeptide chain. The amino 
acids that make up a polypeptide chain are joined by peptide 
bonds that result from the linkage of the carboxyl group of one 
amino acid to the amino group of its neighbor, with the elimination 
of a molecule of water (Figure 2.24c). A polypeptide chain composed. 
of a string of amino acids joined by peptide bonds has the following 
backbone: 
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FIGURE 2.24 Amino acid structure. Ball-and-stick model (a) and 
chemical formula (b) of a generalized amino acid in which R can be any of a 
number of chemical groups (see Figure 2.26). (c) The formation of a 
peptide bond occurs by the condensation of two amino acids, drawn here 
in the uncharged state. In the cell, this reaction occurs on a ribosome as an 
amino acid is transferred from a carrier (a tRNA molecule) onto the end of 
the growing polypeptide chain (see Figure 11.49). 
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FIGURE 2.25 Amino acid stereoisomerism. Because the a-carbon of all 
amino acids except glycine is bonded to four different groups, two 
stereoisomers can exist. The D and L forms of alanine are shown. 
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The “average” polypeptide chain contains about 450 amino acids. 
The longest known polypeptide, found in the muscle protein titin, 
contains more than 30,000 amino acids. Once incorporated into a 
polypeptide chain, amino acids are termed residues. The residue on 
one end of the chain, the N-terminus, contains an amino acid with a 
free (unbonded) «-amino group, whereas the residue at the opposite 
end, the C-terminus, has a free a-carboxyl group. In addition to 
amino acids, many proteins contain other types of components that 
are added after the polypeptide is synthesized. These include carbo- 
hydrates (to form glycoproteins), metal-containing groups (to form 
metalloproteins), and organic groups (e.g., flavoproteins). 


The Properties of the Side Chains 


The backbone, or main chain, of the polypeptide is composed of that 
part of each amino acid that is common to all of them. The side chain 
or R group (Figure 2.24), bonded to the a-carbon, is highly variable 
among the 20 building blocks, and it is this variability that ultimately 
gives proteins their diverse structures and activities. If the various 
amino acid side chains are considered together, they exhibit a large 
variety of structural features, ranging from fully charged to hydropho- 
bic, and they can participate in a wide variety of covalent and nonco- 
valent bonds. As discussed in the following chapter, the side chains of 
the “active sites” of enzymes can facilitate (catalyze) many different 
organic reactions. The assorted characteristics of the side chains of the 
amino acids are important in both intramolecular interactions, which 
determine the structure and activity of the molecule, and intermolec- 
ular interactions, which determine the relationship of a polypeptide 
with other molecules, including other polypeptides (page 60). 

Amino acids are classified on the character of their side chains. 
They fall roughly into four categories: polar and charged, polar and 
uncharged, nonpolar, and those with unique properties (FIGURE 2.26). 


1. Polar, charged (Fig. 2.26a). Amino acids of this group include 
aspartic acid, glutamic acid, lysine, and arginine. These four 
amino acids contain side chains that become fully charged; that 
is, the side chains contain relatively strong organic acids and 
bases. The ionization reactions of glutamic acid and lysine are 
shown in FIGURE 2.27. At physiologic pH, the side chains of these 
amino acids are almost always present in the fully charged state. 
Consequently, they are able to form ionic bonds with other 
charged species in the cell. For example, the positively charged 
arginine residues of histone proteins are linked by ionic bonds to 
the negatively charged phosphate groups of DNA (see Figure 2.3). 
Histidine is also considered a polar, charged amino acid, although 
in most cases it is only partially charged at physiologic pH. In 
fact, because of its ability to gain or lose a proton in physiologic 
pH ranges, histidine is a particularly important residue in the 
active site of many proteins (as in Figure 3.13). 


2. Polar, uncharged (Fig. 2.26b). The side chains of these amino 
acids have a partial negative or positive charge and thus can form 
hydrogen bonds with other molecules including water. These 
amino acids are often quite reactive. Included in this category are 
asparagine and glutamine (the amides of aspartic acid and glu- 
tamic acid), threonine, serine, and tyrosine. 


3. Nonpolar (Fig. 2.26c). The side chains of these amino acids are 
hydrophobic and are unable to form electrostatic bonds or inter- 
act with water. The amino acids of this category are alanine, valine, 
leucine, isoleucine, tryptophan, phenylalanine, and methionine. 
The side chains of the nonpolar amino acids generally lack oxygen 


and nitrogen. They vary primarily in size and shape, which allows 
one or another of them to pack tightly into a particular space 
within the core of a protein, associating with one another as the 
result of van der Waals forces and hydrophobic interactions. 


4. The other three amino acids—glycine, proline, and cysteine 
(Fig. 2.26d)—have unique properties that separate them from the 
others. The side chain of glycine consists of only a hydrogen 
atom, and glycine is a very important amino acid for just this 
reason. Owing to its lack of a side chain, glycine residues provide 
a site where the backbones of two polypeptides (or two segments 
of the same polypeptide) can approach one another very closely. 
In addition, glycine is more flexible than other amino acids and 
allows parts of the backbone to move or form a hinge. Proline is 
unique in having its &-amino group as part of a ring (making it 
an imino acid). Proline is a hydrophobic amino acid that does not 
readily fit into an ordered secondary structure, such as an o helix 
(page 54), often producing kinks or hinges. Cysteine contains a 
reactive sulfhydryl (—SH) group and is often covalently linked to 
another cysteine residue, as a disulfide (—SS—) bridge. 
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Disulfide bridges often form between two cysteines that are 
distant from one another in the polypeptide backbone or even in 
two separate polypeptides. Disulfide bridges help stabilize the intri- 
cate shapes of proteins, particularly those present outside of cells 
where they are subjected to added physical and chemical stress. For 
example, hair is made of a large number of filaments composed of 
the cysteine-rich protein keratin, cross-linked together by disulfide 
bridges. When someone gets a “perm” to make their hair curlier, the 
hairdresser puts the hair into curlers and then adds a reducing agent 
that breaks the disulfide bridges, letting the keratin filaments slide 
past each other. When the reducing agent is washed out, disulfide 
bridges re-form, locking the keratin in the new positions and caus- 
ing the hair to be permanently locked into the shape imparted by the 
curlers. The same principle is used in hair-straightening treatments 
except that in this case the hairs are pulled out straight before the 
reducing agent is applied. 

Not all of the amino acids described in this section are found in 
all proteins, nor are the various amino acids distributed in an equiva- 
lent manner. A number of other amino acids are also found in pro- 
teins, but they arise by alterations to the side chains of the 20 basic 
amino acids after their incorporation into a polypeptide chain. For 
this reason they are called posttranslational modifications 
(PTMs). Dozens of different types of PTMs have been documented. 
The most widespread and important PTM is the reversible addition of 
a phosphate group to a serine, threonine, or tyrosine residue. Lysine 
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Properties of side chains (R groups): 


Hydrophilic side chains act as acids or bases which tend to be fully charged (+ or -) under physiologic conditions. 
Side chains form ionic bonds and are often involved in chemical reactions. 
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Properties of side chains: 
Hydrophilic side chains tend to have partial + or - charge allowing them to participate in chemical reactions, form 
H-bonds, and associate with water. 
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Properties of side chains: 
Hydrophobic side chain consists almost entirely of C and H atoms. These amino acids tend to form the inner core of 
soluble proteins, buried away from the aqueous medium. They play an important role in membranes by associating 
with the lipid bilayer. 
Side chains with unique properties 
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Side chain consists only of hydrogen 
atom and can fit into either a hydrophilic 
or hydrophobic environment. Glycine often 
resides at sites where two polypeptides 
come into close contact. 


Though side chain has polar, uncharged 
character, it has the unique property of 
forming a covalent bond with another 
cysteine to form a disulfide link. 


Though side chain has hydrophobic 
character, it has the unique property 
of creating kinks in polypeptide 
chains and disrupting ordered 
secondary structure. 


FIGURE 2.26 The chemical structure of amino acids. These 20 amino acids represent those most commonly found in proteins and, more specifically, 
those encoded by DNA. Other amino acids occur as the result of a modification to one of those shown here. The amino acids are arranged into four groups 
based on the character of their side chains, as described in the text. All molecules are depicted as free amino acids in their ionized state as they would exist 
in solution at neutral pH. 
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FIGURE 2.27 The ionization of charged, polar amino acids. (a) The side 
chain of glutamic acid loses a proton when its carboxylic acid group ionizes. 
The degree of ionization of the carboxyl group depends on the pH of the 
medium: the greater the hydrogen ion concentration (the lower the pH), the 
smaller the percentage of carboxyl groups that are present in the ionized 
state. Conversely, a rise in pH leads to an increased ionization of the proton 
from the carboxyl group, increasing the percentage of negatively charged 
glutamic acid side chains. The pH at which 50 percent of the side chains are 
ionized and 50 percent are unionized is called the pK, which is 4.4 for the 
side chain of free glutamic acid. At physiologic pH, virtually all of the 
glutamic acid residues of a polypeptide are negatively charged. (b) The side 
chain of lysine becomes ionized when its amino group gains a proton. The 
greater the hydroxyl ion concentration (the higher the pH), the smaller the 
percentage of amino groups that are positively charged. The pH at which 
50 percent of the side chains of lysine are charged and 50 percent are 
uncharged is 10.0, which is the pK for the side chain of free lysine. At 
physiologic pH, virtually all of the lysine residues of a polypeptide are 
positively charged. Once incorporated into a polypeptide, the pK of a 
charged group can be greatly influenced by the surrounding environment. 


acetylation is another widespread and important PTM affecting thou- 
sands of proteins in a mammalian cell. PTMs can generate dramatic 
changes in the properties and function of a protein, most notably by 
modifying its three-dimensional structure, level of activity, localiza- 
tion within the cell, life span, and/or its interactions with other mol- 
ecules. The presence or absence of a single phosphate group on a key 
regulatory protein has the potential to determine whether a cell will 
behave as a cancer cell or a normal cell. Because of PT'Ms, a single 
polypeptide can exist as a number of distinct biological molecules. 
The ionic, polar, or nonpolar character of amino acid side chains 
is very important in protein structure and function. Most soluble 
(i.e., nonmembrane) proteins are constructed so that the polar 
residues are situated at the surface of the molecule where they can 
associate with the surrounding water and contribute to the protein's 
solubility in aqueous solution (FIGURE 2.28a). In contrast, the non- 
polar residues are situated predominantly in the core of the molecule 
(Figure 2.28b). The hydrophobic residues of the protein interior are 
often tightly packed together, creating a type of three-dimensional 
jigsaw puzzle in which water molecules are generally excluded. 
Hydrophobic interactions among the nonpolar side chains of these 


(b) 


FIGURE 2.28 Disposition of hydrophilic and hydrophobic amino acid 
residues in the soluble protein cytochrome c. (a) The hydrophilic side 
chains, which are shown in green, are located primarily at the surface of the 
protein where they contact the surrounding aqueous medium. (b) The 
hydrophobic residues, which are shown in red, are located primarily 

within the center of the protein, particularly in the vicinity of the central 
heme group. 

Source: Illustration, Irving Geis. Image from the Irving Geis Collection/ 
Howard Hughes Medical Institute. Rights owned by HHMI. Reproduction 
by permission only. 


residues are a driving force during protein folding (page 61) and 
contribute substantially to the overall stability of the protein. In 
many enzymes, reactive polar groups project into the nonpolar inte- 
rior, giving the protein its catalytic activity. For example, a nonpolar 
environment can enhance ionic interactions between charged 
groups that would be lessened by competition with water in an aque- 
ous environment. Some reactions that might proceed at an imper- 
ceptibly slow rate in water can occur in millionths of a second within 
the protein. 


REVIEW | 


1. What are the major properties that distinguish 
different amino acids from one another? What roles 
do these differences play in the structure and function 
of proteins? 


2. What are the properties of glycine, proline, and 


cysteine that distinguish these amino acids? 
e_____——_ 


2.9 Primary and Secondary 
Structures of Proteins 


Nowhere in biology is the intimate relationship between form and 
function better illustrated than with proteins. The structure of most 
proteins is completely defined and predictable. Each amino acid in 
one of these giant macromolecules is located at a specific site within 
the structure, giving the protein the precise shape and reactivity 
required for the job at hand. Protein structure can be described at 
several levels of organization, each emphasizing a different aspect 
and each dependent on different types of interactions. Customarily, 
four such levels are described: primary, secondary, tertiary, and qua- 
ternary. The first, primary structure, concerns the amino acid 
sequence of a protein, whereas the latter three levels concern the 
organization of the molecule in space. To understand the mecha- 
nism of action and biological function of a protein it is essential to 
know how that protein is constructed. 


Primary Structure 


The primary structure of a polypeptide is the specific linear sequence 
of amino acids that constitute the chain. With 20 different building 
blocks, the number of different polypeptides that can be formed is 
20", where n is the number of amino acids in the chain. Because 
most polypeptides contain well over 100 amino acids, the variety of 
possible sequences is essentially unlimited. The information for the 
precise order of amino acids in every protein that an organism can 
produce is encoded within the genome of that organism. 

As we will see later, the amino acid sequence provides the infor- 
mation required to determine a protein’s three-dimensional shape 
and thus its function. The sequence of amino acids, therefore, is all- 
important, and changes that arise in the sequence as a result of 
genetic mutations in the DNA may not be readily tolerated. The ear- 
liest and best-studied example of this relationship is the change in the 
amino acid sequence of hemoglobin that causes the disease sickle cell 
anemia. This severe, inherited anemia results solely from a single 


lum 


FIGURE 2.29 Scanning electron micrograph of a red blood cell from a 
person with sickle cell anemia. Compare with the micrograph of a normal 
red blood cell of Figure 4.32a. 


Source: Courtesy of J. T. Thornwaite, B. F. Cameron, and R. C. Leif. 


change in amino acid sequence within the hemoglobin molecule: 
A nonpolar valine residue is present where a charged glutamic acid is 
normally located. This change in hemoglobin structure can have a 
dramatic effect on the shape of red blood cells, converting them from 
disk-shaped cells to sickle-shaped cells (FIGURE 2.29), which tend to 
clog small blood vessels, causing pain and life-threatening crises. Not 
all amino acid changes have such a dramatic effect, as evidenced by 
the differences in amino acid sequence in the same protein among 
related organisms. The degree to which changes in the primary 
sequence are tolerated depends on the degree to which the shape of 
the protein or the critical functional residues are disturbed. 

The first amino acid sequence of a protein was determined by 
Frederick Sanger and co-workers at Cambridge University in the 
early 1950s. Beef insulin was chosen for the study because of its avail- 
ability and its small size—two polypeptide chains of 21 and 30 amino 
acids each. The sequencing of insulin was a momentous feat in the 
newly emerging field of molecular biology. It revealed that proteins, 
the most complex molecules in cells, have a definable substructure 
that is neither regular nor repeating, unlike those of polysaccharides. 
Each particular polypeptide, whether insulin or some other species, 
has a precise sequence of amino acids that does not vary from one 
molecule to another. With the advent of techniques for rapid DNA 
sequencing (see Section 18.22), the primary structure of a polypep- 
tide can be deduced from the nucleotide sequence of the encoding 
gene. In the past few years, the complete sequences of the genomes of 
hundreds of organisms, including humans, have been determined. 
This information will eventually allow researchers to learn about 
every protein that an organism can manufacture. However, translat- 
ing information about primary sequence into knowledge of higher 
levels of protein structure remains a formidable challenge. 


Secondary Structure 


All matter exists in space and therefore has a three-dimensional 
shape. Proteins are formed by linkages among vast numbers of atoms; 
consequently their shape is complex. The term conformation refers 
to the three-dimensional arrangement of the atoms ofa molecule, that 
is, to their spatial organization. Secondary structure describes the 
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secondary structure were carried out by Linus Pauling and Robert 
Corey of the California Institute of Technology. By studying the struc- 
ture of simple peptides consisting of a few amino acids linked together, 
Pauling and Corey concluded that polypeptide chains exist in pre- 
ferred conformations that provide the maximum possible number of 
hydrogen bonds between neighboring amino acids. 

Two conformations were proposed. In one conformation, the 
backbone of the polypeptide assumed the form ofa cylindrical, twist- 
ing spiral called the alpha (a) helix (FIGURE 2.30a). The backbone 
lies on the inside of the helix, and the side chains project outward. 
The helical structure is stabilized by hydrogen bonds between the 
atoms of one peptide bond and those situated just above and below it 
along the spiral (Figure 2.30b). The X-ray diffraction patterns of 
actual proteins produced during the 1950s bore out the existence of 
the o helix, first in the protein keratin found in hair and later in vari- 
ous oxygen-binding proteins, such as myoglobin and hemoglobin (see 
Figure 2.35). Surfaces on opposite sides of an o helix may have con- 
trasting properties. In water-soluble proteins, the outer surface of an 
a helix often contains polar residues in contact with the solvent, 
whereas the surface facing inward typically contains nonpolar side 
chains. 


3.6 
residues 


(a) 


FIGURE 2.30 The alpha helix. (a) The helical path around a central axis 
taken by the polypeptide backbone in a region of « helix. Each complete 
(360°) turn of the helix corresponds to 3.6 amino acid residues. The distance 
along the axis between adjacent residues is 15 A. (b) The arrangement of 
the atoms of the backbone of the a helix and the hydrogen bonds that form 
between amino acids. Because of the helical rotation, the peptide bonds of 
every fourth amino acid come into close proximity. The approach of the 
carbonyl group (C=O) of one peptide bond to the imine group (H—N) of 
another peptide bond results in the formation of hydrogen bonds between 
them. The hydrogen bonds (orange bars) are essentially parallel to the axis 
of the cylinder and thus hold the turns of the chain together. 

Source: (a, b) Illustration, Irving Geis. Image from the Irving Geis Collection/ 
Howard Hughes Medical Institute. Rights owned by HHMI. Reproduction by 
permission only. 


The second conformation proposed by Pauling and Corey was 
the beta (fp) sheet, which consists of several segments of a polypep- 
tide lying side by side (FIGURE 2.31a). Unlike the coiled, cylindrical 
form of the « helix, the backbone of each segment of polypeptide (or 
6 strand) in a B sheet assumes a folded or pleated conformation 
(Figure 2.31a). Like the « helix, the B sheet is also characterized by a 
large number of hydrogen bonds, but these are oriented perpendicu- 
lar to the long axis of the polypeptide chain and project across from 
one part of the chain to another (Figure 2.31b). The strands ofa B sheet 
can be arranged either parallel to one another, with neighboring 
strands running in the same direction, or antiparallel, with neighboring 
strands running in opposite directions. Figure 2.31b shows an antiparallel 
6 sheet. Like the « helix, the B sheet has also been found in many 
different proteins. Because f strands are highly extended, the f sheet 
resists pulling (tensile) forces. Silk is composed of a protein contain- 
ing an extensive amount of f sheet; silk fibers are thought to owe their 
strength to this architectural feature. Remarkably, a single fiber of spi- 
der silk, which may bea tenth the thickness of a human hair, is roughly 
five times stronger than a steel fiber of comparable weight. 

Those portions of a polypeptide chain not organized into an 
ot helix or a B sheet may consist of hinges, turns, loops, or finger-like 
extensions. Often, these are the most flexible portions of a polypep- 
tide chain and the sites of greatest biological activity. For example, 
antibody molecules are known for their specific interactions with 
other molecules (antigens); these interactions are mediated by a 
series of loops at one end of the antibody molecule (see Figures 17.15 
and 17.16). The various types of secondary structures are most 
simply depicted as shown in FIGURE 2.32: o helices are represented 
by helical ribbons, f strands as flattened arrows, and connecting seg- 
ments as thinner strands. 


(a) 


(b) \ 


FIGURE 2.31 The 6 sheet. (a) Each polypeptide of a B sheet assumes an 
extended but pleated conformation referred to as a B strand. The pleats 
result from the location of the a-carbons above and below the plane of the 
sheet. Successive side chains (R groups in the figure) project upward and 
downward from the backbone. The distance along the axis between 
adjacent residues is 3.5 A. (b) A B sheet consists of a number of B strands 
that lie parallel to one another and are joined together by a regular array of 
hydrogen bonds between the carbonyl and imine groups of the neighbor- 
ing backbones. Neighboring segments of the polypeptide backbone may 
lie either parallel (in the same N-terminal > C-terminal direction) or 
antiparallel (in the opposite N-terminal — C-terminal direction). 

Source: (b): Illustration, Irving Geis. Image from the Irving Geis Collection/ 
Howard Hughes Medical Institute. Rights owned by HHMI. Reproduction by 
permission only. 


FIGURE 2.32 A ribbon model of ribonuclease. The regions of « helix are 
depicted as spirals and f strands as flattened ribbons with the arrows indicating 
the N-terminal > C-terminal direction of the polypeptide. Those segments of 
the chain that do not adopt a regular secondary structure (i.e., an « helix or B 
strand) consist largely of loops and turns. Disulfide bonds are shown in yellow. 
Source: Hand drawn by Jane S. Richardson. 


REVIEW | 


1. How are the properties of an a helix different from a 
B strand? How are they similar? 


e—____ 


2.10 Tertiary Structure of Proteins 


The next level above secondary structure is tertiary structure, which 
describes the conformation of the entire polypeptide. Whereas sec- 
ondary structure is stabilized primarily by hydrogen bonds between 
atoms that form the peptide bonds of the backbone, tertiary struc- 
ture is stabilized by an array of noncovalent bonds between the 
diverse side chains of the protein. Secondary structure is largely lim- 
ited to a small number of conformations, but tertiary structure is 
virtually unlimited. 

The detailed tertiary structure of a protein is usually determined 
using the technique of X-ray crystallography. In this technique 
(which is described in more detail in Sections 3.6 and 18.16), a crystal 
of the protein is bombarded by a thin beam of X-rays, and the radia- 
tion that is scattered (diffracted) by the electrons of the protein’s 
atoms is allowed to strike a radiation-sensitive detector, forming an 
image of spots, such as those of FIGURE 2.33. These patterns of spots 
do not directly show the structure of the protein, but computer pro- 
grams based on the mathematics of diffraction can then be used to 
derive the structure responsible for producing the pattern. Tertiary 
structure can also be determined by nuclear magnetic resonance 
(NMR) spectroscopy, a method in which proteins in solution are 
placed in a powerful magnetic field and probed with radio waves to 
produce a spectrum (FIGURE 2.34a) which provides information 
about the distances between atoms. As with X-ray crystallography, 
the spectrum itself does not directly show the protein structure, but 


FIGURE 2.33 An X-ray diffraction pattern of myoglobin. The pattern of 
spots is produced as a beam of X-rays is diffracted by the atoms in the 
protein crystal, causing the X-rays to strike the film at specific sites. Information 
derived from the position and intensity (darkness) of the spots can be used to 
calculate the positions of the atoms in the protein that diffracted the beam, 
leading to complex structures such as that shown in Figure 2.35. 

Source: Courtesy of John C. Kendrew. 


computer programs can derive the structures most likely to give the 
observed spectrum (Figure 2.34b). The two methods, X-ray crystal- 
lography and NMR, have different strengths and weaknesses: X-ray 
crystallography can provide higher resolution structures for larger 
proteins but is limited by the need to get any given protein to form 
pure crystals. NMR does not require crystallization, can provide 
information about dynamic changes in protein structure, and can 
rapidly reveal drug binding sites on a protein, but the method becomes 
increasingly difficult to apply as the size of the protein increases. 

For many years it was presumed that all proteins had a fixed 
three-dimensional structure, which gave each protein its unique 
properties and specific functions. It came as a surprise to discover 
over the past decade or so that many proteins of higher organisms 
contain sizable segments that lack a defined conformation. Examples 
of proteins containing these types of unstructured (or disordered) 
segments can be seen in the models of the PrP protein in Figure 1 on 
page 63 and the histone tails in Figure 12.13c. The disordered regions 
in these proteins are depicted as dashed lines in the images, convey- 
ing the fact that these segments of the polypeptide (like pieces of spa- 
ghetti) can occupy many different positions and, thus, cannot be 
studied by X-ray crystallography. Disordered segments tend to have a 
predictable amino acid composition, being enriched in charged and 
polar residues and deficient in hydrophobic residues. You might be 
wondering whether proteins lacking a fully defined structure could 
be engaged in a useful function. In fact, the disordered regions of 
such proteins play key roles in vital cellular processes, often binding 
to DNA or to other proteins. Remarkably, these segments often 
undergo a physical transformation once they bind to an appropriate 
partner and are then seen to possess a defined, folded structure. 

Most proteins can be categorized on the basis of their overall 
conformation as being either fibrous proteins, which have an 
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FIGURE 2.34 NMR spectroscopy reveals tertiary structure without crystalization. (a) An NMR spectrum of the membrane-spanning helix protein 
sensory rhodopsin II. (b) Solving the structure of sensory rhodopsin II by NMR, showing thirty computed solutions consistent with the measured NMR 
spectra. The fact that all solutions agree on the overall shape indicates that we should have high confidence in the solution. 


Source: From Antoine Guatier et al., Nature Struct. Mol. Biol., 17, 768. 2010. 


elongated shape, or globular proteins, which have a compact 
shape. Most proteins that act as structural materials outside living 
cells are fibrous proteins, such as collagens and elastins of connective 
tissues, keratins of hair and skin, and silk. These proteins resist pull- 
ing or shearing forces to which they are exposed. In contrast, most 
proteins within the cell are globular proteins. 


Myoglobin: The First Globular Protein 
Whose Tertiary Structure Was Determined 


The polypeptide chains of globular proteins are folded and twisted 
into complex shapes. Distant points on the linear sequence of amino 
acids are brought next to each other and linked by various types of 
bonds. The first glimpse at the tertiary structure of a globular pro- 
tein came in 1957 through the X-ray crystallographic studies of John 
Kendrew and his colleagues at Cambridge University using X-ray 
diffraction patterns such as that shown in Figure 2.33. The protein 
they reported on was myoglobin. 

Myoglobin functions in muscle tissue as a storage site for oxy- 
gen; the oxygen molecule is bound to an iron atom in the center of a 
heme group. (The heme is an example of a prosthetic group, i.e., 
a portion of the protein that is not composed of amino acids, which 
is joined to the polypeptide chain after its assembly on the ribo- 
some.) It is the heme group of myoglobin that gives most muscle 
tissue its reddish color. The first report on the structure of myoglo- 
bin provided a low-resolution profile sufficient to reveal that the 
molecule was compact (globular) and that the polypeptide chain was 
folded back on itself in a complex arrangement. There was no evi- 
dence of regularity or symmetry within the molecule, such as that 
revealed in the earlier description of the DNA double helix. 

The earliest crude profile of myoglobin revealed eight rod-like 
stretches of « helix ranging from 7 to 24 amino acids in length. 


Altogether, approximately 75 percent of the 153 amino acids in the 
polypeptide chain are in the «-helical conformation. This is an unu- 
sually high percentage compared with that for other proteins that 
have since been examined. No f sheet was found. Subsequent analy- 
ses of myoglobin using additional X-ray diffraction data provided a 
much more detailed picture of the molecule (FIGURES 2.35 and 
3.16). For example, it was shown that the heme group is situated 
within a pocket of hydrophobic side chains that promotes the bind- 
ing of oxygen without the oxidation (loss of electrons) of the iron 
atom. Myoglobin contains no disulfide bonds; the tertiary structure 
of the protein is held together exclusively by noncovalent interac- 
tions. All of the noncovalent bonds thought to occur between side 
chains within proteins—hydrogen bonds, ionic bonds, and van der 
Waals forces—have been found (FIGURE 2.36). Unlike myoglobin, 
most globular proteins contain both « helices and B sheets. Most 
importantly, these early landmark studies revealed that each protein 
has a unique tertiary structure that can be correlated with its amino 
acid sequence and its biological function. 


Tertiary Structure May Reveal 
Unexpected Similarities 
between Proteins 


Similarity in primary sequence is often used to decide whether two 
proteins may have similar structure and function. Sometimes, however, 
proteins that appeared to be unrelated at the primary sequence level 
have been found to have similar tertiary structures. Because it is the 
tertiary structure that determines the interactions and enzymatic 
activity of a protein, such structural similarity indicates that these 
proteins may have similar functions. FIGURE 2.37 shows two pro- 
teins, actin from eukaryotic cells and MreB from bacteria, whose 
primary sequence shows no similarity but whose tertiary structures 


FIGURE 2.35 The three-dimensional structure of myoglobin. The tertiary 
structure of myoglobin. Most of the amino acids are part of « helices, shown 
in red. The nonhelical regions occur primarily as turns, where the polypeptide 
chain changes direction, shown in green. The position of the heme is 
indicated in yellow and blue with the coordinated iron atom in orange. 
Source: This work is licensed under the Creative Commons 
Attribution-ShareAlike 3.0 License. 


van der Waals forces 


Hydrogen bond 


lonic bond 


FIGURE 2.36 Types of noncovalent bonds maintaining the 
conformation of proteins. 


are clearly related. As discussed in Section 9.7, both proteins form 
cytoskeletal filaments. 


Protein Domains 


Unlike myoglobin, most eukaryotic proteins are composed of two 
or more spatially distinct modules, or domains, that fold inde- 
pendent of one another. For example, the mammalian enzyme 
phospholipase C, shown in the central part of FIGURE 2.38, consists 


actin 


FIGURE 2.37 Different sequence, similar structure. Two proteins, actin 
and MreB, whose primary sequences are completely different but that share 


a common tertiary structure. 
Source: From Bill Wickstead and Keith Gull, J. Cell Biol. 194:513-525, Fig. 1. 


of four distinct domains, colored differently in the drawing. The 
different domains of a polypeptide often represent parts that func- 
tion in a semi-independent manner. For example, they might bind 
different factors, such as a coenzyme and a substrate or a DNA 
strand and another protein, or they might move relatively inde- 
pendent of one another. Protein domains are often identified with a 
specific function. For example, proteins containing a PH domain 
bind to membranes containing a specific phospholipid, whereas 


Bacterial 
phospholipase C 


Troponin C 


Mammalian 
phospho- 
lipase C 


Synapto- 
tagmin 


Recoverin 


FIGURE 2.38 Proteins are built of structural units, or domains. The 
mammalian enzyme phospholipase C is constructed of four domains, 
indicated in different colors. The catalytic domain of the enzyme is shown in 
blue. Each of the domains of this enzyme can be found independently in 
other proteins as indicated by the matching color. 

Source: From Liisa Holm and Chris Sander, Structure 5:167, © 2007, with 
permission from Elsevier. 
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residue in another protein. The functions of a newly identified pro- 
tein can often be predicted by the domains of which it is made. 

Many polypeptides containing more than one domain are 
thought to have arisen during evolution by the fusion of genes that 
encoded different ancestral proteins, with each domain representing 
a part that was once a separate molecule. Each domain of the mam- 
malian phospholipase C molecule, for example, has been identified 
as ahomologous unit in another protein (Figure 2.38). Some domains 
have been found only in one or a few proteins. Other domains have 
been shuffled widely about during evolution, appearing in a variety 
of proteins whose other regions show little or no evidence of an evo- 
lutionary relationship. Shuffling of domains creates proteins with 
unique combinations of activities. On average, mammalian proteins 
tend to be larger and contain more domains than proteins of less 
complex organisms, such as fruit flies and yeast. 


Dynamic Changes within Proteins 


Although X-ray crystallographic structures possess exquisite detail, 
they are static images frozen in time. Proteins, in contrast, are not 
static and inflexible, but capable of considerable internal movements. 
Proteins are, in other words, molecules with “moving parts.’ Because 
they are tiny, nanometer-sized objects, proteins can be greatly influ- 
enced by the energy of their environment. Random, small-scale fluc- 
tuations in the arrangement of the bonds within a protein create an 
incessant thermal motion within the molecule. Spectroscopic tech- 
niques, such as nuclear magnetic resonance (NMR), can monitor 
dynamic movements within proteins, and they reveal shifts in hydro- 
gen bonds, waving movements of external side chains, and the full 
rotation of the aromatic rings of tyrosine and phenylalanine residues 
about one of the single bonds. The important role that such move- 
ments can play in a protein’s function is illustrated in studies of ace- 
tylcholinesterase, the enzyme responsible for degrading acetylcholine 
molecules that are left behind following the transmission of an 
impulse from one nerve cell to another (Section 4.18). When the ter- 
tiary structure of acetylcholinesterase was first revealed by X-ray 
crystallography, there was no obvious pathway for acetylcholine mol- 
ecules to enter the enzyme’s catalytic site, which was situated at the 
bottom of a deep gorge in the molecule. In fact, the narrow entrance 
to the site was completely blocked by the presence of a number of 
bulky amino acid side chains. Using high-speed computers, research- 
ers have been able to simulate the random movements of thousands 
of atoms within the enzyme, a feat that cannot be accomplished using 
experimental techniques. These molecular dynamic (MD) simula- 
tions indicated that movements of side chains within the protein 
would lead to the rapid opening and closing of a “gate” that would 
allow acetylcholine molecules to diffuse into the enzyme’s catalytic 
site (FIGURE 2.39). The X-ray crystallographic structure of a protein 
(ie., its crystal structure) can be considered an average structure, or 
“ground state.” A protein can undergo dynamic excursions from the 
ground state and assume alternate conformations that are accessible 
based on the amount of energy that the protein contains. 
Predictable (nonrandom) movements within a protein that are 
triggered by the binding of a specific molecule are described as con- 
formational changes. Conformational changes typically involve 
the coordinated movements of various parts of the molecule. A com- 
parison of the polypeptides depicted in Figure 3a and 3b on page 69 
shows the dramatic conformational change that occurs in a bacterial 
protein (GroEL) when it interacts with another protein (GroES). 


FIGURE 2.39 Dynamic movements within the enzyme acetylcholinest- 
erase. A portion of the enzyme is depicted here in two different conforma- 
tions: (1) a closed conformation (left) in which the entrance to the catalytic 
site is blocked by the presence of aromatic rings that are part of the side 
chains of tyrosine and phenylalanine residues (shown in purple) and (2) an 
open conformation (right) in which the aromatic rings of these side chains 
have swung out of the way, opening the “gate” to allow acetylcholine 
molecules to enter the catalytic site. These images are constructed using 
computer programs that take into account a host of information about the 
atoms that make up the molecule, including bond lengths, bond angles, 
electrostatic attraction and repulsion, van der Waals forces, etc. Using this 
information, researchers are able to simulate the movements of the various 
atoms over a very short time period, which provides images of the 
conformations that the protein can assume. An animation of this image can 
be found on the Web at http://mccammon.ucsd.edu 

Source: Courtesy of J. Andrew Mccammon. 


Virtually every activity in which a protein takes part is accompanied 
by conformational changes within the molecule (see http:// 
molmovdb.org to watch examples). The conformational change in 
the protein myosin that occurs during muscle contraction is shown 
in Figure 9.52. In this case, binding of myosin to an actin molecule 
leads to a small (20°) rotation of the head of the myosin, which 
results ina 50 to 100 A movement of the adjacent actin filament. The 
importance of this dynamic event can be appreciated if you consider 
that the movements of your body result from the additive effect of 
millions of conformational changes taking place within the contrac- 
tile proteins of your muscles. 


REVIEW | 


1. What are some of the differences between X-ray 
cyrstallography and NMR for determining protein 
structure? What can NMR tell us that X-ray 
crystallography cannot? 


2.11 Quaternary Structure 
of Proteins 


Whereas many proteins such as myoglobin are composed of only one 
polypeptide chain, most are found in complexes of more than one chain, 
or subunit. The subunits may be linked by covalent disulfide bonds, but 
most often they are held together by noncovalent bonds as occur 


typically between hydrophobic “patches” on the complementary surfaces 
of neighboring polypeptides. Protein complexes are said to have quater- 
nary structure. Depending on the protein, the polypeptide chains may 
be identical or nonidentical. A protein complex composed of two identi- 
cal subunits is described as a homodimer, whereas a protein complex 
composed of two nonidentical subunits is a heterodimer. A ribbon draw- 
ing of a homodimeric protein is depicted in FIGURE 2.40a. The two sub- 
units of the protein are drawn in different colors, and the hydrophobic 
residues that form the complementary sites of contact are indicated. 


The Structure of Hemoglobin 


The best-studied multisubunit protein is hemoglobin, the O,-carrying 
protein of red blood cells. A molecule of human hemoglobin con- 
sists of two a-globin and two B-globin polypeptides (Figure 2.40b), 
each of which binds a single molecule of oxygen. Elucidation of the 
three-dimensional structure of hemoglobin by Max Perutz of 
Cambridge University in 1959 was one of the early landmarks in the 


(b) 


FIGURE 2.40 Proteins with quaternary structure. (a) Drawing of trans- 
forming growth factor-B2 (TGF-B2), a protein that is a dimer composed of two 
identical subunits. (b) Drawing of a hemoglobin molecule, which consists of 
two a-globin chains and two f-globin chains (a heterotetramer) joined by 
noncovalent bonds. When the four globin polypeptides are assembled into a 
complete hemoglobin molecule, the kinetics of O2 binding and release are 
quite different from those exhibited by isolated polypeptides. This is because 
the binding of O, to one polypeptide causes a conformational change in the 
other polypeptides that alters their affinity for O2 molecules. 

Source: (a) From www.resb.org; (6) Illustration, Irving Geis. Image from Irving 
Geis reproduced with permission from AAAS. Collection/ Howard Hughes 
Medical Institute. Rights owned by HHMI. Reproduced by permission only. 


study of molecular biology. Perutz demonstrated that each of the 
four globin polypeptides of a hemoglobin molecule has a tertiary 
structure similar to that of myoglobin, a fact that strongly suggested 
the two proteins had evolved from a common ancestral polypeptide 
with a common O,-binding mechanism. Perutz also compared the 
structure of the oxygenated and deoxygenated versions of hemo- 
globin. In doing so, he discovered that the binding of oxygen was 
accompanied by the movement of the bound iron atom closer to the 
plane of the heme group. This seemingly inconsequential shift in 
position of a single atom pulled on an « helix to which the iron is 
connected, which in turn led to a series of increasingly larger move- 
ments within and between the subunits. This finding revealed for 
the first time that the complex functions of proteins may be carried 
out by means of small changes in their conformation. 


Protein-Protein Interactions 


Even though hemoglobin consists of four subunits, it is still consid- 
ered a single protein with a single function. Many examples are 
known in which different proteins, each with a specific function, 
become physically associated to form a much larger multiprotein 
complex. One of the first multiprotein complexes to be discovered 
and studied was the pyruvate dehydrogenase complex of the bacte- 
rium Escherichia coli, which consists of 60 polypeptide chains consti- 
tuting three different enzymes (FIGURE 2.41). The enzymes that 
make up this complex catalyze a series of reactions connecting two 
metabolic pathways, glycolysis and the TCA cycle (see Figure 5.7). 
Because the enzymes are so closely associated, the product of one 
enzyme can be channeled directly to the next enzyme in the sequence 
without becoming diluted in the aqueous medium of the cell. 
Multiprotein complexes that form within the cell are not necessar- 
ily stable assemblies, such as the pyruvate dehydrogenase complex. In 
fact, most proteins interact with other proteins in highly dynamic pat- 
terns, associating and dissociating depending on conditions within the 
cellat any given time. Interacting proteins tend to have complementary 
surfaces. Often a projecting portion of one molecule fits into a pocket 


(a) (b) 


FIGURE 2.41 Pyruvate dehydrogenase: a multiprotein complex. 

(a) Three-dimensional reconstruction of the bacterial pyruvate 
dehydrogenase E1E2 subcomplex determined by electron microscopy. Its 
molecular mass is 11 million daltons. (6) Molecular structure of the pyruvate 
dehydrogenase complex determined by fitting crystal structures of subunit 
proteins into the large-scale structure seen in electron microscopy. The core 
of the complex consists of dihydrolipoyl acetyltransferase molecules 
(green). Pyruvate dehydrogenase tetramers (blue) form an outer shell 
around the core. The full pyruvate dehydrogenase complex contains a third 
protein that is not resolved here, and is even larger. 


Source: From Jacqueline Milne et al., 2002. Embo J. 21:5587-98. 
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tact, their interaction is stabilized by noncovalent bonds. 

The reddish-colored object in FIGURE 2.42a is called an SH3 
domain, and it is found as part of more than 200 different proteins 
involved in molecular signaling. The surface of an SH3 domain con- 
tains shallow hydrophobic “pockets” that become filled by comple- 
mentary “knobs” projecting from another protein (Figure 2.425). 
A large number of different structural domains have been identified 
that, like SH3, act as adaptors to mediate interactions between pro- 
teins. In many cases, protein-protein interactions are regulated by 
modifications, such as the addition of a phosphate group to a key 
amino acid, which act as a switch to turn on or off the proteins ability 
to bind a protein partner. As more and more complex molecular 
activities have been discovered, the importance of interactions 
among proteins has become increasingly apparent. For example, 
such diverse processes as DNA synthesis, ATP formation, and RNA 
processing are all accomplished by “molecular machines” that consist 
of a large number of interacting proteins, some of which form stable 
relationships and others transient liaisons. Several hundred different 
protein complexes have been purified in large-scale studies on yeast. 


(b) 


FIGURE 2.42 Protein-protein interactions. (a) A model illustrating the 
complementary molecular surfaces of portions of two interacting proteins. 
The reddish-colored molecule is an SH3 domain of the enzyme PI3K, whose 
function is discussed in Chapter 15. This domain binds specifically to a 
variety of proline-containing peptides, such as the one shown in the 
space-filling model at the top of the figure. The proline residues in the 
peptide, which fit into hydrophobic pockets on the surface of the enzyme, 
are indicated. The polypeptide backbone of the peptide is colored yellow, 
and the side chains are colored green. (b) Schematic model of the interaction 
between an SH3 domain and a peptide showing the manner in which certain 
residues of the peptide fit into hydrophobic pockets in the SH3 domain. 


Source: From Hongtao Yu and Stuart Schreiber, Cell 76:940, © 1994, with 
permission from Elsevier. Courtesy of Hongtao Yu and Stuart Schreiber. 
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1. Describe the difference between primary, secondary, 
tertiary, and quaternary structure. 


2.12 Protein Folding 


The elucidation of the tertiary structure of myoglobin in the late 1950s 
led to an appreciation for the complexity of protein architecture. An 
important question immediately arose: How does such a complex, 
folded, asymmetric organization arise in the cell? The first insight into 
this problem began with a serendipitous observation in 1956 by 
Christian Anfinsen at the National Institutes of Health. Anfinsen was 
studying the properties of ribonuclease A, a small enzyme that con- 
sists of a single polypeptide chain of 124 amino acids with four 
disulfide bonds linking various parts of the chain. The disulfide bonds 
of a protein are typically broken (reduced) by adding a reducing agent, 
such as mercaptoethanol, which converts each disulfide bridge to a 
pair of sulfhydryl (—SH) groups (see drawing, page 50). To make all 
of the disulfide bonds accessible to the reducing agent, Anfinsen 
found that the molecule had to first be partially unfolded. The unfold- 
ing or disorganization of a protein is termed denaturation, and it 
can be brought about by a variety of agents, including detergents, 
organic solvents, radiation, heat, and compounds such as urea and 
guanidine chloride, all of which interfere with the various interactions 
that stabilize a protein’s tertiary structure. 

When Anfinsen treated ribonuclease molecules with mercapto- 
ethanol and concentrated urea, he found that the preparation lost all 
of its enzymatic activity, which would be expected if the protein mol- 
ecules had become totally unfolded. When he removed the urea and 
mercaptoethanol from the preparation, he found, to his surprise, that 
the molecules regained their normal enzymatic activity. The active 
ribonuclease molecules that had re-formed from the unfolded pro- 
tein were indistinguishable both structurally and functionally from 
the correctly folded (i.e., native) molecules present at the beginning 
of the experiment (FIGURE 2.43). After extensive study, Anfinsen 
concluded that the linear sequence of amino acids contained all of the 
information required for the formation of the polypeptide's three- 
dimensional conformation. Ribonuclease, in other words, is capable 
of self-assembly. As discussed in Chapter 3, events tend to progress 
toward states of lower energy. According to this concept, the tertiary 
structure that a polypeptide chain assumes after folding is the acces- 
sible structure with the lowest energy, which makes it the most ther- 
modynamically stable structure that can be formed by that chain. It 
would appear that evolution selects for those amino acid sequences 
that generate a polypeptide chain capable of spontaneously arriving 
at a meaningful native state in a biologically reasonable time period. 


Dynamics of Protein Folding 


There have been numerous controversies in the study of protein fold- 
ing. Many of these controversies stem from the fact that the field is char- 
acterized by highly sophisticated experimental, spectroscopic, and 
computational procedures that are required to study complex molecu- 
lar events that typically occur on a microsecond timescale. These efforts 
have often yielded conflicting results and have generated data that are 
open to more than one interpretation. For the sake of simplicity, we 
will restrict the discussion to “simple” proteins, such as ribonuclease, 
that consist of a single domain. One fundamental issue that has been 


Unfolding 
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FIGURE 2.43 Denaturation and refolding of ribonuclease. A native 
ribonuclease molecule (with intramolecular disulfide bonds indicated) is 
reduced and unfolded with B-mercaptoethanol and 8 M urea. After removal 
of these reagents, the protein undergoes spontaneous refolding. 

Source: From C. J. Epstein, R. F. Goldberger, and C. B. Anfinsen, Cold 
Spring Harbor Symp. Quant. Biol. 28:439, 1963. Reprinted with permission 
from Cold Spring Harbor Laboratory Press. 


extensively debated is whether all of the members of a population of 
unfolded proteins of a single species fold along a similar pathway or fold 
by means of a diverse set of routes that somehow converge upon the 
same native state. Recent studies that have simulated folding with the 
aid of high speed computers have suggested that both views are right, 
and that proteins initially explore a wide range of different conforma- 
tions when they first begin to fold but eventually funnel down into an 
increasingly restricted set of possible configurations (see Figure 2.45). 

Another issue that has been roundly debated concerns the types 
of events that occur at various stages during the folding process. In the 
course depicted in FIGURE 2.44a, protein folding is initiated by inter- 
actions among neighboring residues that lead to the formation of 
much of the secondary structure of the molecule. Once the « helices 
and B sheets are formed, subsequent folding is driven by hydrophobic 
interactions that bury nonpolar residues together in the central core 
of the protein. According to an alternate scheme shown in Figure 
2.44b, the first major event in protein folding is the hydrophobic col- 
lapse of the polypeptide to form a compact structure in which the 
backbone adopts a native-like topology. Only after this collapse does 
significant secondary structure develop. Recent studies indicate that 
the two pathways depicted in Figure 2.44 lie at opposite extremes and 
that most proteins probably fold by a middle-of-the-road scheme in 
which secondary structure formation and compaction occur simulta- 
neously. These early folding events lead to the formation of a partially 
folded, transient structure that resembles the native protein but lacks 
many of the specific interactions between amino acid side chains that 
are present in the fully folded molecule (FIGURE 2.45). 

If the information that governs folding is embedded in a pro- 
tein’s amino acid sequence, then it should be possible to predict the 
tertiary structure of the protein just from its sequence. Such de novo 
structural prediction has been a holy grail in protein science and still 
remains an unsolved problem. However, if a protein of interest is 
closely related at the primary sequence level with another protein 
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FIGURE 2.44 Two alternate pathways by which a newly synthesized or 
denatured protein could achieve its native conformation. Curled segments 
represent « helices, and arrows represent f strands. 


FIGURE 2.45 Along the folding pathway. The image on the left shows 
the native tertiary structure of the enzyme acyl-phosphatase. The image on 
the right is the transition structure, which represents the state of the 
molecule at the top of an energy barrier that must be crossed if the protein 
is going to reach the native state. The transition structure consists of 
numerous individual lines because it is a set (ensemble) of closely related 
structures. The overall architecture of the transition structure is similar to 
that of the native protein, but many of the finer structural features of the 
fully folded protein have yet to emerge. Conversion of the transition state 
to the native protein involves completing secondary structure formation, 
tighter packing of the side chains, and finalizing the burial of hydrophobic 
side chains from the aqueous solvent. 

Source: From K. Lindorff-Larsen, et al, Trends Biochem. Sci. 30:14, 2005, Fig. 1B. 
© 2005, with permission from Elsevier. Image from Christopher Dobson. 


whose tertiary structure is known, then it does become possible to 
make a reasonable guess about the tertiary structure of the unknown 
protein by aligning the amino acids of the unknown protein onto the 
corresponding amino acids in the protein whose structure is known, 
a process known as threading. The fact that primary sequence 
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FIGURE 2.46 The role of molecular chaperones in encouraging 
protein folding. The steps are described in the text. (Other families of 
chaperones are known but are not discussed.) 


determines the folding of a protein means that alterations in this 
sequence have the potential to change the way a protein folds, leading 
to an abnormal tertiary structure. In fact, many mutations responsi- 
ble for inherited disorders have been found to alter a protein’s three- 
dimensional structure. In some cases, the consequences of protein 
misfolding can be fatal. Two examples of fatal neurodegenerative 
diseases that result from abnormal protein folding are discussed in 
Section 2.13. 


The Role of Molecular Chaperones 


Not all proteins are able to assume their final tertiary structure by a 
simple process of self-assembly. This is not because the primary struc- 
ture of these proteins lacks the required information for proper folding, 
but rather because proteins undergoing folding have to be prevented 
from interacting nonselectively with other molecules in the crowded 
compartments of the cell. Several families of proteins have evolved 
whose function is to help unfolded or misfolded proteins achieve their 
proper three-dimensional conformation. These “helper proteins” are 
called molecular chaperones, and they selectively bind to short 
stretches of hydrophobic amino acids that tend to be exposed in non- 
native proteins but buried in proteins having a native conformation. 
FIGURE 2.46 depicts the activities of two families of molecular 
chaperones that operate in the cytosol of eukaryotic cells. Molecular 
chaperones are involved in a multitude of activities within cells, 
ranging from the import of proteins into organelles (see Figure 
8.47a) to the prevention and reversal of protein aggregation. We will 
restrict the discussion to their actions on newly synthesized proteins. 


Native 
polypeptide 


Chaperonin 
(TRiC) 


Polypeptide chains are synthesized on ribosomes by the addition of 
amino acids, one at a time, beginning at the chain’s N-terminus (step 1, 
Figure 2.46). Chaperones of the Hsp70 family bind to elongating 
polypeptide chains as they emerge from an exit channel within the 
large subunit of the ribosome (step 2). Hsp70 chaperones are thought 
to prevent these partially formed polypeptides (i.e., nascent polypep- 
tides) from binding to other proteins in the cytosol, which would 
cause them either to aggregate or misfold. Once their synthesis has 
been completed (step 3), many of these proteins are simply released 
by the chaperones into the cytosol where they spontaneously fold 
into their native state (step 4). Other proteins are repeatedly bound 
and released by chaperones until they finally reach their fully folded 
state. Many of the larger polypeptides are transferred from Hsp70 
proteins to a different type of chaperone called a chaperonin (step 5). 
Chaperonins are cylindrical protein complexes that contain cham- 
bers in which newly synthesized polypeptides can fold without inter- 
ference from other macromolecules in the cell. TRiC is a chaperonin 
thought to assist in the folding of up to 15 percent of the polypep- 
tides synthesized in mammalian cells. The discovery and mecha- 
nism of action of Hsp70 and chaperonins are discussed in depth in 
the Experimental Pathways in Section 2.14 on page 67. 
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1. Given that proteins act as molecular machines, explain 
why conformational changes are so important in 
protein function. 


—— 


2.13 THE HUMAN PERSPECTIVE 


Protein Misfolding Can Have Deadly Consequences 


In April 1996 a paper was published in the medical journal Lancet that 
generated widespread alarm in the populations of Europe. The 
paper described a study of 10 persons afflicted with Creutzfeld- 
Jakob disease (CJD), a rare, fatal disorder that attacks the brain, 
causing a loss of motor coordination and dementia. Like numerous 
other diseases, CJD can occur as an inherited disease that runs in 
certain families or as a sporadic form that appears in individuals 
who have no family history of the disease. Unlike virtually every 


other inheritable disease, however, CJD can also be acquired. Until 
recently, persons who had acquired CJD had been recipients of 
organs or organ products that were donated by a person with undi- 
agnosed CJD. The cases described in the 1996 Lancet paper had 
also been acquired, but the apparent source of the disease was 
contaminated beef that the infected individuals had eaten years 
earlier. The contaminated beef was derived from cattle raised in 
England that had contracted a neurodegenerative disease that 


caused the animals to lose motor coordination and develop 
demented behavior. The disease became commonly known as 
“mad cow disease.” Patients who have acquired CJD from eating 
contaminated beef can be distinguished by several criteria from 
those who suffer from the classical forms of the disease. To date, 
roughly 200 people have died of CJD acquired from contaminated 
beef, and the numbers of such deaths have been declining. 

A disease that runs in families can invariably be traced to a 
faulty gene, whereas diseases that are acquired from a contam- 
inated source can invariably be traced to an infectious agent. 
How can the same disease be both inherited and infectious? 
The answer to this question has emerged gradually over the past 
several decades, beginning with observations by D. Carleton 
Gajdusek in the 1960s on a strange malady that once afflicted 
the native population of Papua, New Guinea. Gajdusek showed 
that these islanders were contracting a fatal neurodegenerative 
disease—which they called “kuru”—during a funeral ritual in 
which they ate the brain tissue of a recently deceased relative. 
Autopsies of the brains of patients who had died of kuru showed 
a distinct pathology, referred to as spongiform encephalopathy, 
in which certain brain regions were riddled with microscopic holes 
(vacuolations), causing the tissue to resemble a sponge. 

It was soon shown that the brains of islanders suffering from 
kuru were strikingly similar in microscopic appearance to the 
brains of persons afflicted with CJD. This observation raised an 
important question: Did the brain of a person suffering from CJD, 
which was known to be an inherited disease, contain an infectious 
agent? In 1968, Gajdusek showed that when extracts prepared 


On the surface, this would suggest that the epidemic has run its course, but 
there are several reasons for public health officials to remain concerned. For 
one, studies of tissues that had been removed during surgeries in England 
indicate that thousands of people are likely to be infected with the disease 
without exhibiting symptoms (discussed in Science 335:411, 2012). Even if 
these individuals never develop clinical disease, they remain potential carriers 
who could pass CJD on to others through blood transfusions. In fact, at least 
two individuals are believed to have contracted CJD after receiving blood 
from a donor harboring the disease. These findings underscore the need to 
test blood for the presence of the responsible agent (whose nature is 
discussed below). 


from a biopsy of the brain of a person who had died from CJD 
were injected into a suitable laboratory animal, that animal did 
indeed develop a spongiform encephalopathy similar to that of 
kuru or CJD. Clearly, the extracts contained an infectious agent, 
which at the time was presumed to be a virus. 

In 1982, Stanley Prusiner of the University of California, San 
Francisco, published a paper suggesting that, unlike viruses, the 
infectious agent responsible for CJD lacked nucleic acid and instead 
was composed solely of protein. He called the protein a prion. This 
“protein only” hypothesis, as it is called, was originally met with 
considerable skepticism, but subsequent studies by Prusiner and 
others have provided overwhelming support for the proposal. It was 
presumed initially that the prion protein was an external agent— 
some type of virus-like particle lacking nucleic acid. Contrary to 
this expectation, the prion protein was soon shown to be encoded 
by a gene (called PRNP) within the cell's own chromosomes. The 
gene is expressed within normal brain tissue and encodes a protein 
designated PrP© (standing for prion protein cellular) that resides at 
he surface of nerve cells. The precise function of PrP remains a mys- 
tery. A modified version of the protein (designated PrP*°, standing 
for prion protein scrapie) is present in the brains of humans with CJD. 
Unlike the normal PrP©, the modified version of the protein accumu- 
ates within nerve cells, forming aggregates that kill the cells. 

In their purified states, PrP’ and PrP*° have very different 
physical properties. PrP© remains as a monomeric molecule that 
is soluble in salt solutions and is readily destroyed by protein- 
digesting enzymes. In contrast, PrP°° molecules interact with one 
another to form insoluble fibrils that are resistant to enzymatic 
digestion. Based on these differences, one might expect these 
two forms of the PrP protein to be composed of distinctly differ- 
ent sequences of amino acids, but this is not the case. The two 
forms can have identical amino acid sequences, but they differ in 
the way the polypeptide chain folds to form the three-dimensional 
protein molecule (FIGURE 1). Whereas a PrP© molecule consists 
largely of a-helical segments and interconnecting coils, the core 
of a PrPS* molecule consists largely of B sheet. 

It is not hard to understand how a mutant polypeptide might 
be less stable and more likely to fold into the abnormal PrP*¢ 
conformation, but how is such a protein able to act as an infectious 


(a) 


(b) 


FIGURE 1 A contrast in structure. (a) Tertiary structure of the normal (PrP°) protein as determined by NMR spectroscopy. The orange portions 
represent a-helical segments, and the blue portions are short B strands. The yellow dotted line represents the N-terminal portion of the polypeptide, 


which lacks defined structure. (b) A proposed model of the abnormal, infectious (PrPS°) prion protein, which consists largely of B-sheet. The actual 


tertiary structure of the prion protein has not been determined. The two molecules shown in this figure are formed by polypeptide chains that can be 


identical in amino acid sequence but fold very differently. As a result of the differences in folding, PrP© remains soluble, whereas PrP*< produces 


aggregates that kill the cell. (The two molecules shown in this figure are called conformers because they differ only in conformation.). 


Source: (a) From Adriana Verschoor, et al., J. Cell Biol., Vol. 133 (cover 3), 1996; by copyright permission of Rockefeller University Press; (b) Reprinted 


from S.B. Prusiner, Trends Biochem. Sci. 21:483,1996 Copyright 1996, with permission from Elsevier. 
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agent? According to the prevailing hypothesis, an abnormal prion 
molecule (PrP*°) can bind to a normal protein molecule (PrP©) and 
cause the normal protein to fold into the abnormal form. This 
conversion can be shown to occur in the test tube: Addition of 
PrPS to a preparation of PrP© can convert the PrP© molecules into 
the PrP** conformation. According to this hypothesis, the appear- 
ance of the abnormal protein in the body—whether as a result 
of a rare misfolding event in the case of sporadic disease or by 
exposure to contaminated beef—starts a chain reaction in which 
normal protein molecules in the cells are gradually converted to 
the misshapen prion form as they are recruited into growing insol- 
uble fibrils. The precise mechanism by which prions lead to neu- 
rodegeneration remains unclear. 

CJD is a rare disease caused by a protein with unique infec- 
tive properties. Alzheimer’s disease (AD), on the other hand, is a 
common disorder that strikes as many as 10 percent of individuals 
who are at least 65 years of age and perhaps 40 percent of individ- 
uals who are 80 years or older. Persons with AD exhibit memory 
loss, confusion, and a loss of reasoning ability. CJD and AD share 
a number of important features. Both are fatal neurodegenerative 
diseases that can occur in either an inherited or sporadic form. 
Like CJD, the brain of a person with Alzheimer's disease contains 
fibrillar deposits of an insoluble material referred to as amyloid 
(FIGURE 2).* In both diseases, the fibrillar deposits result from the 
self-association of a polypeptide composed predominantly of 
6 sheet. There are also many basic differences between the two 
diseases: the proteins that form the disease-causing aggregates 
are unrelated, the parts of the brain that are affected are distinct, 
and the protein responsible for AD is not considered to be an 
infectious agent (i.e., it does not spread in a contagious pattern 
from one person to another, although it may spread from cell to 
cell within the brain). 


2it should be noted that the term amyloid is not restricted to the abnormal 
protein found in AD. Many different proteins are capable of assuming an 
abnormal conformation that is rich in B-sheet, which causes the protein 
monomers to aggregate into characteristic amyloid fibrils that bind certain 
dyes. Amyloid fibrils are defined by their molecular structure in which the 
B-strands are oriented perpendicular to the long axis of the fibrils. The 
PrPS*-forming fibrils of prion diseases are also described as amyloid. 


Amyloid plaque 


Neurofibrillary 
tangle (NFT) 


(a) 


Over the past two decades, research on AD has been dom- 
inated by the amyloid hypothesis, which contends that the dis- 
ease is caused by the production of a molecule, called the 
amyloid B-peptide (Ag). AB is originally part of a larger protein 
called the amyloid precursor protein (APP), which spans the 
nerve cell membrane. The Af peptide is released from the APP 
molecule following cleavage by two specific enzymes, B-secre- 
tase and y-secretase (FIGURE 3). The length of the AB peptide 
is somewhat variable. The predominant species has a length of 
40 amino acids (designated as AB40), but a minor species with 
two additional hydrophobic residues (designated as AB42) is also 
produced. Both of these peptides can exist in a soluble form that 
consists predominantly of a helices, but AB42 has a tendency to 
spontaneously refold into a very different conformation that con- 
tains considerable B sheet. It is the misfolded AB42 version of the 
molecule that has the greatest potential to cause damage to the 
brain. AB42 tends to self-associate to form small complexes (olig- 
omers) as well as large aggregates that are visible as fibrils in the 
electron microscope. These amyloid fibrils are deposited outside 
of the nerve cells in the form of extracellular amyloid plaques 
(Figure 2). Although the issue is far from settled, a body of evi- 
dence suggests that it is the soluble oligomers that are most toxic 
to nerve cells, rather than the insoluble aggregates. Cultured 
nerve cells, for example, are much more likely to be damaged 
by the presence of soluble intracellular AB oligomers than by 
either AB monomers or extracellular fibrillar aggregates. In the 
brain, the AB oligomers appear to attack the synapses that con- 
nect one nerve cell to another and eventually lead to the death 
of the nerve cells. Persons who suffer from an inherited form of 
AD carry a mutation that leads to an increased production of the 
Af42 peptide. Overproduction of AB42 can be caused by posses- 
sion of extra copies (duplications) of the APP gene, by mutations 
in the APP gene, or by mutations in genes (PSEN1, PSEN2) that 
encode subunits of y-secretase. Individuals with such mutations 
exhibit symptoms of the disease at an early age, typically in their 
50s. The fact that all mutations associated with these inherited, 
early-onset forms of AD lead to increased production of AB42 is 
the strongest argument favoring amyloid formation as the under- 
lying basis of the disease. The strongest argument against the 


Normal Alzheimer’s 


FIGURE 2 Alzheimer's disease. (a) The defining characteristics of brain tissue from a person who died of Alzheimer's disease. (b) Amyloid plaques 
containing aggregates of the AB peptide appear extracellularly (between nerve cells), whereas neurofibrillary tangles (NFTs) appear within the cells 
themselves. NFTs, which are discussed at the end of the Human Perspective, are composed of misfolded tangles of a protein called tau that is involved 
in maintaining the microtubule organization of the nerve cell. Both the plaques and tangles have been implicated as a cause of the disease. 

Source: (a) © Thomas Deerinck, NCMIR/Photo Researchers, Inc.; (b) © American Health Assistance Foundation. 
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FIGURE 3 Formation of the AB peptide. The Af peptide is carved 
from the amyloid precursor protein (APP) as the result of cleavage by 
two enzymes, f-secretase (also called BACE1) and y-secretase. It is 
interesting that APP and the two secretases are all proteins that span the 
membrane. Cleavage of APP occurs inside the cell (probably in the 
endoplasmic reticulum), and the AB product is ultimately secreted into 
the space outside of the cell. The y-secretase can cut at either of two 
sites in the APP molecule, producing either AB40 or AB42 peptides, the 
latter of which is primarily responsible for production of the amyloid 
plaques seen in Figure 2. y-Secretase is a multisubunit enzyme that 
cleaves its substrate at a site within the membrane. 


amyloid hypothesis is the weak correlation that can exist bet- 
ween the number and size of amyloid plaques in the brain and 
the severity of the disease. Elderly persons who show little or no 
sign of memory loss or dementia can have relatively high levels of 
amyloid deposits in their brain and those with severe disease can 
have little or no amyloid deposition. 

All of the drugs currently on the market for the treatment 
of AD are aimed only at management of symptoms; none has 
any effect on stopping disease progression. With the amyloid 
hypothesis as the guiding influence, researchers have followed 
three basic strategies in the pursuit of new drugs for the preven- 
tion and/or reversal of mental decline associated with AD. These 
strategies are (1) to prevent the formation of the AB42 peptide 
in the first place; (2) to remove the AB42 peptide (or the amyloid 
deposits it produces) once it has been formed; and (3) to pre- 
vent the interaction between AB molecules, thereby preventing 
the formation of both oligomers and fibrillar aggregates. Before 
examining each of these strategies, we can consider how investi- 
gators can determine what type of drugs might be successful in 
the prevention or treatment of AD. 

One of the best approaches to the development of treat- 
ments for human diseases is to find laboratory animals, particu- 
larly mice, that develop similar diseases, and use these animals to 
test the effectiveness of potential therapies. Animals that exhibit 
a disease that mimics a human disease are termed animal models. 
For whatever reason, the brains of aging mice show no evidence 
of the amyloid deposits found in humans, and, up until 1995, there 
was no animal model for AD. Then, in that year, researchers found 
that they could create a strain of mice that developed amyloid 
plaques in their brain and performed poorly at tasks that required 


memory. They created this strain by genetically engineering the 
mice to carry a mutant human APP gene, one responsible for 
causing AD in families. These genetically engineered (transgenic) 
mice have proven invaluable for testing potential therapies for 
AD. The greatest excitement in the field of AD therapeutics has 
centered on the second strategy mentioned above, and we can 
use these investigations to illustrate some of the steps required in 
the development of a new drug. 

In 1999, Dale Schenk and his colleagues at Elan Pharmaceuticals 
published an extraordinary finding. They had discovered that the 
formation of amyloid plaques in mice carrying the mutant human 
APP gene could be blocked by repeatedly injecting the animals 
with the very same substance that causes the problem, the aggre- 
gated AB42 peptide. In effect, the researchers had immunized (i-e., 
vaccinated) the mice against the disease. When young (6-week- 
old) mice were immunized with AB42, they failed to develop the 
amyloid brain deposits as they grew older. When older (13-month- 
old) mice whose brains already contained extensive amyloid 
deposits were immunized with the AB42, a significant fraction of 
the fibrillar deposits was cleared out of the nervous system. Even 
more importantly, the immunized mice performed better than their 
nonimmunized littermates on memory-based tests. 

The dramatic success of these experiments on mice, combined 
with the fact that the animals showed no ill effects from the 
immunization procedure, led government regulators to quickly 
approve a Phase | clinical trial of the AB42 vaccine. A Phase | 
clinical trial is the first step in testing a new drug or procedure in 
humans and usually comes after years of preclinical testing on cul- 
tured cells and animal models. Phase | tests are carried out on a 
small number of subjects and are designed to monitor the safety 
of the therapy and the optimal dose of the drug rather than its 
effectiveness against the disease. 

None of the subjects in two separate Phase | trials of the AB 
vaccine showed any ill-effects from the injection of the amyloid 
peptide. As a result, the investigators were allowed to proceed to 
a Phase II clinical trial, which involves a larger group of subjects 
and is designed to obtain a measure of the effectiveness of the 
procedure (or drug). This particular Phase II trial was carried out 
as a randomized, double-blind, placebo-controlled study. In this 
type of study: 


1. The patients are randomly divided into two groups that are 
treated similarly except that one group is given the curative 
factor (protein, antibodies, drugs, etc.) being investigated and 
the other group is given a placebo (an inactive substance that 
has no therapeutic value). 


2. The study is double-blinded, which means that neither the 
researchers nor patients know who is receiving treatment and 
who is receiving the placebo. 


The Phase ll trial for the AB vaccine began in 2001 and enrolled 
more than 350 individuals in the United States and Europe who 
had been diagnosed with mild to moderate AD. After receiving 
two injections of synthetic B-amyloid (or a placebo), 6 percent of 
the subjects experienced a potentially life-threatening inflamma- 
tion of the brain. The inflammation in most of these patients was 
successfully treated with steroids, but the trial was discontinued. 
More recently, vaccination trials have been conducted using frag- 
ments of AB protein that do not induce inflammation but the 
results of these trials are not yet known. 

Once it had become apparent that vaccination of patients 
with AB42 had inherent risks, it was decided to pursue a safer 
form of immunization therapy, which is to administer antibodies 
directed against AB that have been produced outside the body. 
This type of approach is known as passive immunization because 
the person does not produce the therapeutic antibodies. Passive 
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immunization with an anti-AB42 antibody (called bapineuzumab) 
had already proven capable of restoring memory function in 
transgenic mice and was quickly shown to be safe, and appar- 
ently effective, in Phase | and Il clinical trials. The last step 
before government approval is a Phase III trial, which typically 
employs large numbers of subjects (a thousand or more at sev- 
eral research centers) and compares the effectiveness of the 
new treatment against standard approaches. The first results 
of the Phase III trials on bapineuzumab were reported in 2008 
and were disappointing: There was little or no evidence that the 
antibody provided benefits in preventing the progression of the 
disease. Another antibody known as Solanezumab, which rec- 
ognizes a different part of the AB protein, has been the sub- 
ject of ongoing Phase Ill trials and so far the results indicate 
effectiveness in restoring cognitive function in patients with 
mild Alzheimer’s symptoms, but no effect in patients with more 
severe symptoms. Given the impact of AD on human health 
and the large amount of money that could be earned from this 
type of drug, pharmaceutical companies are willing to take the 
risk that one of these immunologic strategies will exhibit some 
therapeutic value. 

Meanwhile, a comprehensive analysis of some of the patients 
who had been vaccinated with AB42 in the original immunization 
trial from 2001 was also reported in 2008. Analysis of this patient 
group indicated that the AB42 vaccination had had no effect on 
preventing disease progression. It was particularly striking that in 
several of these patients who had died of severe dementia, there 
were virtually no amyloid plaques left in their brains. This finding 
strongly suggests that removal of amyloid deposits in a patient 
already suffering the symptoms of mild-to-moderate dementia 
does not stop disease progression. These results can be inter- 
preted in more than one way. One interpretation is that the amy- 
loid deposits are not the cause of the symptoms of dementia. 
An alternate interpretation is that irreversible toxic effects of the 
deposits had already occurred by the time immunization had 
begun and it was too late to reverse the disease course using 
treatments that remove existing amyloid deposits. It is important 
to note, in this regard, that the formation of amyloid deposits in 
the brain begins 10 or more years before any clinical symptoms of 
AD are reported. It is possible that, if these treatments had started 
earlier, the symptoms of the disease might never have appeared. 
The first clinical trial of this type (known as a “preventive trial”) 
was begun in 2012 as several hundred individuals who would nor- 
mally be destined to develop early-onset AD (due to mutations 
in the PSEN17 gene) were treated with an anti-AB antibody in the 
hopes of blocking the future buildup of amyloid and preventing 
the disease. Additional trials using larger numbers of patients 
who do not carry any known AD predisposing mutations have also 
been started, using the new generation antibody Solanezumab. 
Preventive trials which test the ability of a treatment to slow the 
onset of a disease that normally can take decades to manifest 
itself naturally take a long time to carry out, and so the results are 
not yet known. 

Clearly, preventive treatments make the most sense for 
patients who might be on the verge of developing disease. 
Recent advances in brain-imaging procedures now allow clini- 
cians to observe amyloid deposits in the brains of individuals 
long before any symptoms of AD have developed (FIGURE 4). 
Based on these studies, it may be possible to begin preventive 
treatments in persons who are at very high risk of developing AD 
before they develop symptoms. 

Drugs have also been developed that follow the other two 
strategies outlined above. Alzhemed and scyllo-inositol are 
two small molecules that bind to AB peptides and block molec- 
ular aggregation and fibril formation. Clinical trials have failed 


FIGURE 4 A neuroimaging technique that reveals the presence of 
amyloid in the brain. These PET (positron emission tomography) scans 
show the brains of two individuals that have ingested a radioactive 
compound, called flutemetamol, that binds to amyloid deposits and 
appears red in the image. The left panels show a healthy brain and the 
right panels show a brain from a patient with AD, revealing extensive 
amyloid build-up. Amyloid deposits in the brain can be detected with this 
technique in persons who show no evidence of cognitive dysfunction. 
Such symptom-free individuals are presumed to be at high risk of going 
on to develop AD. Those who lack such deposits can be considered at 
very low risk of the disease in the near future. 

Source: Courtesy of Ken Garber, Nature Biotechnology 30, 575 (2012) 
doi:10.1038/nbt0712-575 Published online 10 July 2012. 


to demonstrate that either drug is effective in stopping disease 
progression in patients with mild to moderate AD. The third 
strategy outlined above is to stop production of AB peptides. 
This can be accomplished by inhibiting either B- or y-secretase, 
because both enzymes are required in the pathway that cleaves 
the APP precursor to release the internal peptide (Figure 3). 
Pharmaceutical companies have had great difficulty developing 
a B-secretase inhibitor that is both potent and small enough to 
enter the brain from the bloodstream, however one such inhibitor, 
known as MK-8931, is currently in Phase Ill clinical trials. A number 
of potent y-secretase inhibitors have been developed that block 
the production of all AB peptides, both in cultured nerve cells and 
in transgenic AD mice. But there is a biological problem that has 
to be overcome with this class of inhibitors. In addition to cleav- 
ing APP, y-secretase activity is also required in a key signaling 
pathway involving a protein called Notch. Two of the most prom- 
ising y-secretase inhibitors, flurizan and semagacestat, have both 
failed to show any benefit in stopping AD progression. In addition 
to its lack of efficacy, semagacestat caused adverse side effects 
that were probably a result of blockade of the Notch pathway. The 
goal of drug designers is to develop a compound (e.g., begaces- 
tat) that blocks APP cleavage but does not interfere with cleavage 
of Notch. 

Taken collectively, the apparent failure of all of these drugs, 
aimed at various steps in the formation of AB-containing aggregates 
and amyloid deposition, has left the field of AD therapeutics without 
a clear plan for the future. Some pharmaceutical companies 
are continuing to develop new drugs aimed at blocking the 


formation of amyloid aggregates, whereas others are moving 
in different directions. These findings also raise a more basic 
question: Is the AB peptide even part of the underlying mech- 
anism that leads to AD? It hasn't been mentioned, but AB is not 
the only misfolded protein found in the brains of persons with 
AD. Another protein called tau, which functions as part of a nerve 
cell’s cytoskeleton (Section 9.2), can develop into bundles of 
tangled cellular filaments called neurofibrillary tangles (or NFTs) 
(Figure 2) that interfere with the movement of substances down 
the length of the nerve cell. NFTs form when the tau molecules 
in nerve cells become excessively phosphorylated. Mutations in 
the gene that encodes tau have been found to cause a rare form 
of dementia (called FTD), which is characterized by the formation 
of NFTs. Thus, NFTs have been linked to dementia, but they have 
been largely ignored as a causative factor in AD pathogenesis, 
due primarily to the fact that the transgenic AD mouse models 
discussed above do not develop NFTs. If one extrapolates the 
results of these mouse studies to humans, they suggest that NFTs 
are not required for the cognitive decline that occurs in patients 
with AD. At the same time, however, autopsies of the brains of 
humans who died of AD suggest that NFT burden correlates 
much better with cognitive dysfunction and neuronal loss than 
does the concentration of amyloid plaques. Given that mutations 
in genes in the AB pathway are clearly a cause of AD, and yet it 
is the NFT burden that correlates with cognitive decline, it would 
appear that both AB and NFTs must be involved in AD etiology. 
Many researchers believe that AB deposition somehow leads 


to NFT formation, an idea known as the “amyloid cascade hy- 
pothesis,” but the mechanism by which this might occur remains 
unknown. In fact, one of the most promising new drugs at the 
time of this writing is one that acts on NFTs rather than B-amyloid. 
In this case, the drug methylthioninium chloride, which dissolves 
NFTs, was tested on a group of more than 300 patients with mild 
to moderate AD in a Phase II trial. The drug was found to reduce 
mental decline over a period of one year by an average of 81 
percent compared to patients receiving a placebo. A modified 
version of the drug, known as leuco methylthioninium, is now 
being tested inlarger Phase Ill studies for both AD and FTD, but 
results are not yet available. Other compounds that inhibit one 
of the enzymes (GSK-3) that adds phosphate groups to the tau 
protein are also being investigated as AD therapeutics. Clinical 
trials of one GSK-3 inhibitor, valproate, have been stopped due to 
adverse effects. (The results of studies on these and other treat- 
ments can be examined at www.alzforum.org/dis/tre/drc) 

It is evident from this discussion that a great deal of work 
on AD has been based on transgenic mice carrying human AD 
genes. These animals have served as the primary preclinical sub- 
jects for testing AD drugs, and they have been used extensively in 
basic research that aims to understand the disease mechanisms 
responsible for the development of AD. But many questions have 
been raised as to how accurately these animal models mimic the 
disease in humans, particularly the sporadic human cases in which 
affected individuals lack the mutant genes that cause the animals 
to develop the corresponding disorder. 


2.14 EXPERIMENTAL PATHWAYS 


Chaperones—Helping Proteins Reach Their Proper Folded State 


In 1962, F. M. Ritossa, an Italian biologist studying the development 
of the fruit-fly Drosophila, reported a curious finding.'! When the 
temperature at which fruit-fly larvae were developing was raised 
from the normal 25°C to 32°C, a number of new sites on the giant 
chromosomes of the larval cells became activated. As we will 
see in Chapter 10, the giant chromosomes of these insect larvae 
provide a visual exhibit of gene expression (see Figure 10.8). 
The results suggested that increased temperature induced the 
expression of new genes, a finding that was confirmed a decade 
later with the characterization of several proteins that appeared 
in larvae following temperature elevation.” It was soon found that 
this response, called the heat-shock response, was not confined 
to fruit flies, but can be initiated in many different cells from vir- 
tually every type of organism—from bacteria to plants and mam- 
mals. Closer examination revealed that the proteins produced 
during the response were found not only in heat-shocked cells, 
but also at lower concentration in cells under normal conditions. 
What is the function of these so-called heat-shock proteins (hsps)? 
The answer to this question was gradually revealed by a series of 
seemingly unrelated studies. 

We will see on page 79 that some complex, multisubunit 
structures, such as a bacterial ribosome or a tobacco mosaic virus 
particle, can self-assemble from purified subunits. It was demon- 
strated in the 1960s that the proteins that make up bacteriophage 
particles (see Figure 1.22c) also possess a remarkable ability to 


self-assemble, but they are generally unable to form a complete, 
functional virus particle by themselves in vitro. Experiments on 
phage assembly in bacterial cells confirmed that phages require 
bacterial help. It was shown in 1973, for example, that a certain 
mutant strain of bacteria, called GroE, could not support the 
assembly of normal phages. Depending on the type of phage, 
the head or the tail of the phage particle was assembled incor- 
rectly.*4 These studies suggested that a protein encoded by the 
bacterial chromosome participated in the assembly of viruses, 
even though this host protein was not a component of the final 
virus particles. Because it obviously did not evolve as an aid for 
virus assembly, the bacterial protein required for phage assem- 
bly had to play some role in the cell's normal activities, but the 
precise role remained obscure. Subsequent studies revealed that 
the GroE site on the bacterial chromosome actually contains two 
separate genes, GroEL and GroES, that encode two separate pro- 
teins GroEL and GroES. Under the electron microscope, the puri- 
fied GroEL protein appeared as a cylindrical assembly consisting 
of two disks. Each disk was composed of seven subunits arranged 

symmetrically around the central axis (FIGURE 1).°° 
Several years later, a study on pea plants hinted at the 
existence of a similar assembly-promoting protein in the chloro- 
plasts of plants.’ Rubisco is a large protein in chloroplasts that 
catalyzes the reaction in which CO molecules taken up from the 
atmosphere are covalently linked to organic molecules during 
continued 
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FIGURE 1 An early model of the GroEL complex built according to 
data from electron microscopy and molecular-weight determination. 
The complex is seen to consist of two disks, each composed of 

seven identical subunits arranged symmetrically around a central axis. 
Subsequent studies showed the complex contains two internal chambers. 


Source: From T. Hohn et al., J. Mol. Biol. 129:371, © 1979, with 
permission of Elsevier. 


photosynthesis (Section 6.9). Rubisco comprises 16 subunits: 
8 small subunits (molecular mass of 14,000 daltons) and 8 large 
subunits (55,000 daltons). It was found that large Rubisco sub- 
units, synthesized inside the chloroplast, are not present in an 
independent state, but are associated with a huge protein assem- 
bly consisting of identical subunits of 60,000 daltons (60 kDa) 
molecular mass. In their paper, the researchers considered the 
possibility that the complex formed by the large Rubisco subunits 
and the 60-kDa polypeptides was an intermediate in the assem- 
bly of a complete Rubisco molecule. 

A separate line of investigation on mammalian cells also 
revealed the existence of proteins that appeared to assist the 
assembly of multisubunit proteins. Like Rubisco, antibody mole- 
cules consist of a complex of two different types of subunits, smaller 
light chains and larger heavy chains. Just as the large subunits of 
Rubisco become associated with another protein not found in the 
final complex, so too do the heavy chains of an antibody com- 
plex.® This protein, which associates with newly synthesized heavy 
chains, but not with heavy chains that are already bound to light 
chains, was named binding protein, or BiP. BiP was subsequently 
found to have a molecular mass of 70,000 daltons (70 kDa). 

To this point, we have been discussing two lines of investiga- 
tion: one concerned with the heat-shock response and the other 
with proteins that promote protein assembly. These two fields 
came together in 1986, when it was shown that one of the pro- 
teins that figured most prominently in the heat-shock response, 
a protein that had been named heat-shock protein 70 (hsp70) 
because of its molecular mass, was identical to BiP, the protein 
implicated in the assembly of antibody molecules.’ 

Even before the discovery of the heat-shock response, the 
structure of proteins was known to be sensitive to temperature, 
with a small rise in temperature causing these delicate molecules 
to begin to unfold. Unfolding exposes hydrophobic residues that 
were previously buried in the protein’s core. Just as fat molecules 
in a bowl of soup are pushed together into droplets, so too are 
proteins with hydrophobic patches on their surface. Consequently, 
when a cell is heat shocked, soluble proteins become denatured 
and form aggregates. A report in 1985 demonstrated that, fol- 
lowing temperature elevation, newly synthesized hsp70 mole- 
cules enter cell nuclei and bind to aggregates of nuclear proteins, 
where they act like molecular crowbars to promote disaggrega- 
tion.'° Because of their role in assisting the assembly of proteins 


by preventing undesirable interactions, hsp70 and related 
molecules were named molecular chaperones." 

It was soon demonstrated that the bacterial heat-shock 
protein GroEL and the Rubisco assembly protein in plants were 
homologous proteins. In fact, the two proteins share the same 
amino acids at nearly half of the more than 500 residues in their 
respective molecules.'* The fact that the two proteins—both 
members of the Hsp60 chaperone family—have retained so 
many of the same amino acids reflects their similar and essential 
function in the two types of cells. But what was that essential 
function? At this point it was thought that their primary function 
was to mediate the assembly of multisubunit complexes, such as 
Rubisco. This view was changed in 1989 by experiments study- 
ing molecular chaperones in mitochondria by Arthur Horwich of 
Yale University and F.-Ulrich Hartl, Walter Neupert, and their col- 
leagues at the University of Munich.'*"4 It was known that newly 
made mitochondrial proteins produced in the cytosol had to cross 
the outer mitochondrial membranes in an unfolded, extended, 
monomeric form. A mutant was found that altered the activity 
of another member of the Hsp60 chaperone family that resided 
inside mitochondria. In cells containing this mutant chaperone, 
proteins that were transported into mitochondria failed to fold 
into their active forms. Even proteins that consisted of a single 
polypeptide chain failed to fold into their native conformation. 
This finding changed the perception of chaperone function from 
a notion that they assist assembly of already folded subunits into 
larger complexes, to our current understanding that they assist 
polypeptide chain folding within the crowded confines of the cell. 

The results of these and other studies indicated the presence 
in cells of at least two major families of molecular chaperones: 
the Hsp70 chaperones, such as BiP, and the Hsp60 chaperones 
(which are also called chaperonins), such as Hsp60, GroEL, and 
the Rubisco assembly protein. We will focus on the Hspé0 chap- 
eronins, such as GroEL, which are best understood. 

As first revealed in 1979, GroEL is a huge molecular complex 
of 14 polypeptide subunits arranged in two stacked rings resem- 
bling a double doughnut.** Fifteen years after these first electron 
micrographs were taken, the three-dimensional structure of the 
GroEL complex was determined by X-ray crystallography.'® The 
study revealed the presence of a central cavity within the GroEL 
cylinder. Subsequent studies demonstrated that this cavity was 
divided into two separate chambers. Each chamber was situated 
within the center of one of the rings of the GroEL complex and 
was large enough to enclose a polypeptide undergoing folding. 

Electron microscopic studies also provided information 
about the structure and function of a second protein, GroES, 
which acts in conjunction with GroEL. Like GroEL, GroES is a ring- 
like protein with seven subunits arrayed symmetrically around a 
central axis. GroES, however, consists of only one ring, and its 
subunits are much smaller (10,000 daltons) than those of GroEL 
(60,000 daltons). GroES is seen as a cap or dome that fits on top 
of either end of a GroEL cylinder (FIGURE 2). The attachment of 
GroES to one end of GroEL causes a dramatic conformational 
change in the GroEL protein that markedly increases the volume 
of the enclosed chamber at that end of the complex." 

The importance of this conformational change has been 
revealed in remarkable detail by X-ray crystallographic studies 
in the laboratories of Arthur Horwich and Paul Sigler at Yale 
University.'” As shown in FIGURE 3, the binding of the GroES cap 
is accompanied by a 60° rotation of the apical (red) domain of the 
subunits that make up the GroEL ring at that end of the GroEL 
cylinder. The attachment of GroES does more than trigger a confor- 
mational change that enlarges the GroEL chamber. Before attach- 
ment of GroES, the inner wall of the GroEL chamber has exposed 
hydrophobic residues that give the lining a hydrophobic character. 


FIGURE 2. Reconstructions of GroEL based on high-resolution electron 
micrographs taken of specimens that had been frozen in liquid ethane and 
examined at -170°C. The image on the left shows the GroEL complex, 
and that on the right shows the GroEL complex with GroES, which appears 
as a dome on one end of the cylinder. It is evident that the binding of the 
GroES is accompanied by a marked change in conformation of the apical 
end of the proteins that make up the top GroEL ring (arrow), which results 
in a marked enlargement of the upper chamber. 


Source: From S. Chen, et al., Courtesy of Helen R. Saibil, Nature 371:263, 
© 1994, reprinted by permission from Macmillan Publishers Limited. 


Nonnative polypeptides also have exposed hydrophobic residues 
that become buried in the interior of the native polypeptide. 
Because hydrophobic surfaces tend to interact, the hydrophobic 
lining of the GroEL cavity binds to the surface of nonnative poly- 
peptides. Binding of GroES to GroEL buries the hydrophobic res- 
idues of the GroEL wall and exposes a number of polar residues, 
thereby changing the character of the chamber wall. As a result 
of this change, a nonnative polypeptide that had been bound to 
the GroEL wall by hydrophobic interactions is displaced into the 
space within the chamber. Once freed from its attachment to the 
chamber wall, the polypeptide is given the opportunity to con- 
tinue its folding in a protected environment. After about 15 sec- 
onds, the GroES cap dissociates from the GroEL ring, and the 
polypeptide is ejected from the chamber. If the polypeptide has 
not reached its native conformation by the time it is ejected, it can 
rebind to the same or another GroEL, and the process is repeated. 
A model depicting some of the steps thought to occur during 
GroEL-GroES-assisted folding is shown in FIGURE 4. 

Approximately 250 of the roughly 2400 proteins present in the 
cytosol of an E. coli cell normally interact with GroEL.'® How is it 
possible for a chaperone to bind so many different polypeptides? 
The GroEL binding site consists of a hydrophobic surface formed 
largely by two « helices of the apical domain that is capable of 
binding virtually any sequence of hydrophobic residues that might 
be accessible in a partially folded or misfolded polypeptide.'? A 
comparison of the crystal structure of the unbound GroEL mol- 
ecule with that of GroEL bound to several different peptides 
revealed that the binding site on the apical domain of a GroEL 
subunit can locally adjust its positioning when bound to different 
partners. This finding indicates that the binding site has structural 
flexibility that allows it to adjust its shape to fit the shape of the 
particular polypeptide with which it has to interact. 

A number of studies have also suggested that GroEL does 
more than simply provide a passive chamber in which proteins 
can fold without outside interference. In one study, site-directed 
mutagenesis was utilized to modify a key residue, Tyr71 of GroES, 
whose side chain hangs from the ceiling of the folding chamber.”° 
Because of its aromatic ring, tyrosine is a modestly hydrophobic 
residue (Figure 2.26). When Tyr71 was replaced by a positively 
or negatively charged amino acid, the resulting GroEL-GroES 
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FIGURE 3 Conformational change in GroEL. (a) The model on the 
left shows a surface view of the two rings that make up the GroEL 
chaperonin. The drawing on the right shows the tertiary structure of one 
of the subunits of the top GroEL ring. The polypeptide chain can be 
seen to fold into three domains. (b) When a GroES ring (arrow) binds to 
the GroEL cylinder, the apical domain of each GroEL subunit of the 
adjacent ring undergoes a dramatic rotation of approximately 60° with 
the intermediate domain (shown in green) acting like a hinge. The effect 
of this shift in parts of the polypeptide is a marked elevation of the 
GroEL wall and enlargement of the enclosed chamber. 


Source: From Z. Xu, A.L. Horwich, and P. B. Sigler, Nature 388:744, 
1997. © 1997, reprinted with permission Macmillan Publishers, Limited. 


variant exhibited an increased ability to assist the folding of one 
specific foreign polypeptide, the green fluorescent protein (GFP). 
However, substitutions for Tyr71 that improved the ability of 
GroES-GroEL to increase GFP folding made the chaperonin less 
competent to help its natural substrates fold. Thus as the chap- 
eronin became more and more specialized to interact with GFP, it 
lost its general ability to assist folding of proteins having an unre- 
lated structure. This finding suggests that individual amino acids 
in the wall of the folding chamber may participate somehow in 
the folding reaction. Data from another study has suggested that 
binding of a nonnative protein to GroEL is followed by a forced 
unfolding of the substrate protein.2’ FRET (fluorescence reso- 
nance energy transfer) is a technique (discussed in Section 18.3) 
that allows researchers to determine the distance between differ- 
ent parts of a protein molecule at different times during a given 
process. In this study, investigators found that the protein under- 
going folding, in this case Rubisco, bound to the apical domain 
of the GroEL ring in a relatively compact state. The compact 
nature of the bound protein was revealed by the close proximity 
to one another of the FRET tags, which were attached to amino 
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FIGURE 4 A schematic illustration of the proposed steps that occur during the GroEL-GroES-assisted folding of a polypeptide. The GroEL is seen to 
consist of two chambers that have equivalent structures and functions and that alternate in activity. Each chamber is located within one of the two rings 
that make up the GroEL complex. The nonnative polypeptide enters one of the chambers (step 1) and binds to hydrophobic sites on the chamber wall. 
Binding of the GroES cap produces a conformational change in the wall of the top chamber, causing the enlargement of the chamber and release of 
the nonnative polypeptide from the wall into the encapsulated space (step 2). After about 15 seconds have elapsed, the GroES dissociates from the 
complex and the polypeptide is ejected from the chamber (step 3). If the polypeptide has achieved its native conformation, as has the molecule on the 
left, the folding process is complete. If, however, the polypeptide is only partially folded, or is misfolded, it will rebind the GroEL chamber for another 
round of folding. (Note: As indicated, the mechanism of GroEL action is driven by the binding and hydrolysis of ATP, an energy-rich molecule whose 


function is discussed at length in the following chapter.) 


Source: A. L. Horwich, et al., Proc. Nat'l. Acad. Sci. U.S.A. 96:11037, 1999. 


acids located at opposite ends of the Rubisco chain. Then, during 
the conformational change that enlarges the volume of the 
GroEL cavity (Figure 3), the bound Rubisco protein was forcibly 
unfolded, as evidenced by the increased distance between the 
two tagged ends of the molecule. This study suggests that the 
Rubisco polypeptide is taken completely back to the unfolded 
state, where it is given the opportunity to refold from scratch. This 
action should help prevent the nonnative protein from becoming 
trapped permanently in a misfolded state. In other words, each 
individual visit to a GroEL-GroES chamber provides an all-or- 
none attempt to reach the native state, rather than just one stage 
in a series of steps in which the protein moves closer to the native 
state with each round of folding. Recent reviews of molecular 
chaperones can be found in References 22-23. 

Keep in mind that molecular chaperones do not convey 
information for the folding process but instead prevent proteins 
from veering off their correct folding pathway and finding them- 
selves in misfolded or aggregated states. Just as Anfinsen discov- 
ered decades ago, the three-dimensional structure of a protein is 
determined by its amino acid sequence. 
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2.15 Proteomics and Interactomics 


With all of the attention on genome sequencing in recent years, it is 
easy to lose sight of the fact that genes are primarily information 
storage units, whereas proteins orchestrate cellular activities. Genome 
sequencing provides a kind of “parts list.” The human genome prob- 
ably contains between 20,000 and 22,000 genes, each of which can 
potentially give rise to a variety of different proteins.” To date, only a 
fraction of these molecules have been characterized. 


Proteomics 


The entire inventory of proteins that is produced by an organism, 
whether human or otherwise, is known as that organism's proteome. 
The term proteome is also applied to the inventory of all proteins that 
are present in a particular tissue, cell, or cellular organelle. Because of 
the sheer numbers of proteins that are currently being studied, inves- 
tigators have sought to develop techniques that allow them to deter- 
mine the properties or activities of a large number of proteins in a 
single experiment. A new term—proteomics—was coined to 
describe the expanding field of protein biochemistry. This term car- 
ries with it the concept that advanced technologies and high-speed 
computers are used to perform large-scale studies on diverse arrays of 
proteins. This is the same basic approach that has proven so successful 
over the past decade in the study of genomes. But the study of prot- 
eomics is inherently more difficult than the study of genomics because 
proteins are more difficult to work with than DNA. In physical terms, 
one gene is pretty much the same as all other genes, whereas each 
protein has unique chemical properties and handling requirements. 
In addition, small quantities of a particular DNA segment can be 
expanded greatly using readily available enzymes, whereas protein 
quantities cannot be increased. This is particularly troublesome when 
one considers that many of the proteins regulating important cellular 
processes are present in only a handful of copies per cell. 

Traditionally, protein biochemists have sought to answer a 
number of questions about particular proteins. These include: What 
specific activity does the protein demonstrate in vitro, and how does 
this activity help a cell carry out a particular function such as cell 
locomotion or DNA replication? What is the protein’s three-dimen- 
sional structure? When does the protein appear in the development 
of the organism and in which types of cells? Where in the cell is it 
localized? Is the protein modified after synthesis by the addition of 
chemical groups (e.g., phosphates or sugars) and, if so, how does this 
modify its activity? How much of the protein is present, and how 
long does it survive before being degraded? Does the level of the 
protein change during physiologic activities or as the result of dis- 
ease? Which other proteins in the cell does it interact with? Biologists 
have been attempting to answer these questions for decades but, for 
the most part, they've been doing it one protein at a time. Proteomics 
researchers attempt to answer similar questions on a more compre- 
hensive scale using large-scale (or high-throughput) techniques to 
catalog the vast array of proteins produced by a particular cell. 

The key technology in proteomics is mass spectrometry, a 
method to probe the chemical structure of an unknown sample. As 


°There are a number of ways that a single gene can give rise to more than one poly- 
peptide. Two of the most prominent mechanisms, alternative splicing and posttrans- 
lational modification, are discussed in other sections of the text. It can also be noted 
that many proteins have more than one distinct function. Even myoglobin, which has 
long been studied as an oxygen-storage protein, has recently been shown to be 
involved in the conversion of nitric oxide (NO) to nitrate (NO3). 


discussed in further detail in Section 18.15, mass spectrometry isa 71 


technique to determine the precise mass of a molecule or fragment 
of a molecule, which can then be used to identify that molecule. 
Suppose that we wanted to identify an unknown protein that we 
have in a test tube. The protein is first digested into peptides with the 
enzyme trypsin. When these peptides are introduced into a mass 
spectrometer, they are converted into gaseous ions and separated 
according to their mass/charge (m/z) ratio. The results are displayed 
as a series of peaks of known m/z ratio, such as that shown in 
FIGURE 2.47. The pattern of peaks constitutes a highly characteristic 
peptide mass fingerprint of that protein. But how can a protein be 
identified based on its peptide mass fingerprint? 

The answer comes from genomics. Once a genome has been 
sequenced, the amino acid sequences of encoded proteins can be 
predicted. This list of “virtual proteins” can then be subjected to a 
theoretical trypsin digestion and the masses of the resulting virtual 
peptides calculated and entered into a database. Once this has been 
done, the actual peptide masses of a purified protein obtained by the 
mass spectrometer can be compared using a computer to the masses 
predicted by theoretical digests of all polypeptides encoded by the 
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FIGURE 2.47 Identifying proteins by mass spectrometry. A protein is 
isolated from a cell or other sample and subjected to digestion by the 
enzyme trypsin. The peptide fragments are then introduced into a mass 
spectrometer where they are ionized and separated according to their 
mass/charge (m/z) ratio. The separated peptides appear as a pattern of 
peaks whose precise m/z ratio is indicated. A comparison of these ratios to 
those obtained by a theoretical digest of virtual proteins encoded by the 
genome allows researchers to identify the protein being studied. In this 
case, the MS spectrum is that of horse myoglobin lacking its heme group. 


Source: Data reprinted from J. R. Yates, Methods Enzymol. 271:353, 1996. 
Copyright 1996, with permission from Elsevier. 
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jected to mass spectrometry can be directly identified based on this 
type of database search. Mass spectrometers are not restricted to 
handling one purified protein at a time, but are also capable of 
analyzing proteins present in complex mixtures (Section 18.15). By 
analyzing the total protein extracted from a cell by mass spectrometry, 
it is possible to determine a list of all the proteins present in the cell. 
This list is known as the proteome of the cell. Proteomic studies are 
particularly useful when two different samples are compared to see 
how the protein composition changes over time. For example, the 
proteome can be analyzed before and after the secretion of a hor- 
mone within the body, after taking a drug, or during a particular 
disease. The changes in the abundance of different proteins during 
such transitions can provide valuable clues about how cell function 
is changing in a given situation. 

Proteomics is playing an increasingly important role in advanc- 
ing the practice of medicine. It is thought that most human diseases 
leave telltale patterns (or biomarkers) among the thousands of pro- 
teins present in the blood or other bodily fluids. The simplest way to 
determine whether a particular biomarker protein characteristic of a 
disease is present in a blood or urine sample, and how much of that 
protein is present, is to measure the protein's interaction with a spe- 
cific antibody. This is the basis, for example, of the PSA test used in 
routine screening of men for prostate cancer. PSA is a protein that is 
found in the blood of normal men, but is present at elevated levels in 
individuals with prostate cancer. PSA levels are determined by meas- 
uring the amount of protein in the blood that binds to anti-PSA anti- 
bodies. The most challenging hurdle in developing this type of 
diagnostic test is knowing what protein will act as the most reliable 
biomarker. This is where proteomics comes into play. Many efforts 
have been made to compare the proteins present in the blood of 
healthy individuals with those present in the blood of persons suffer- 
ing from various diseases, especially cancer. Initially, the results of 
these biomarker searches were unreliable in that the findings of one 
research group could not be duplicated by the efforts of other groups. 
The primary difficulty stems from the fact that human blood serum is 
such a complex solution containing thousands of proteins that range 
in abundance over 9 or 10 orders of magnitude. But as proteomic and 
mass spectrometry technology improves, the information about com- 
plex samples becomes richer and richer. Already, proteomics has been 
used to reveal important biomarkers for disease. For example, the 
OVA] blood test for ovarian cancer, which detects a collection of bio- 
markers using antibody-based tests, was invented using data from 
proteomic analysis of a large number of patient samples. The OVA1 
test is mainly used for cancers that have already been detected, in 
order to provide more information about the tumor before surgery. It 
is hoped that, one day, it will be possible to use a single blood test to 
reveal the existence of early-stage heart, liver, or kidney disease that 
can be treated before it becomes a life-threatening condition. 

Protein separation and mass spectrometric techniques don't tell 
us anything about a protein’s function. Researchers have been work- 
ing to devise techniques that allow protein function to be determined 
on a large scale, rather than one protein at a time. Several new tech- 
nologies have been developed to accomplish this mission; we will 
consider only one—the use of genome-wide RNA interference 
(RNAi) screens. RNAi is a cellular process by which cells produce 
small RNAs (called siRNAs) that bind to specific mRNAs and inhibit 
the translation of these mRNAs into proteins. This phenomenon and 
its use are discussed in detail in Sections 11.10 and 18.25. For the 
present purpose we will simply note that researchers can synthesize a 


collection (library) of siRNAs that are capable of inhibiting the trans- 
lation of virtually any mRNA that is produced by a genome. Each 
mRNA represents the expression of a specific gene encoding a par- 
ticular protein; therefore, one can find out which proteins are involved 
in a particular cellular process by determining which siRNAs inter- 
fere wth that process. RNAi can be used in large-scale screens of the 
whole genome and can also be used to test the function of individual 
genes or sets of genes. The combination of RNAi with proteomics is 
extremely powerful. As soon as a proteomic analysis suggests that the 
level of a particular protein changes during a process of interest, the 
expression of that protein can be blocked by an appropriate siRNA to 
ask whether its function is relevant for that process. 


Interactomics 


Most investigators who study protein-protein interactions want to 
know whether one protein they are working with, call it protein X, inter- 
acts physically with another protein, call it protein Y. Several techniques 
can be used to test for an interaction between any particular pair of 
proteins, as discussed in Section 18.12. However, in recent years, a num- 
ber of research teams have set out to study protein-protein interactions 
on a global scale. For example, one might want to know all of the interac- 
tions that occur among the 6000 or so proteins encoded by the genome 
of the budding yeast Saccharomyces cerevisiae. As is true for an increas- 
ing number of organisms, the entire genome of this yeast has been 
sequenced, and virtually every gene within the genome is available as an 
individual DNA segment that can be cloned and used as desired. Testing 
for potential interactions between all of these proteins requires methods 
that can be easily automated and performed by robots, but that are also 
reliable and sensitive enough to detect most of the true interactions 
without producing too many false interactions in the process. One such 
method that has been widely adopted is TAP-tag mass spectrometry. In 
this approach, DNA from a gene of interest is fused to DNA encoding a 
protein tag called a TAP tag that is easily purified using affinity chroma- 
tography methods (See the Experimental Walkthrough Video: Purification 
of proteins using the TAP tag and Chapter 18). This TAP-tagged gene is 
expressed in a cell and then the cell is broken open and the TAP-tagged 
protein is purified, carrying any interacting proteins along with it. The 
set of proteins that are co-purified with the TAP-tagged protein is then 
identified by mass spectrometry as in Figure 2.47. This process is 
repeated for each gene in the genome, eventually producing a map that 
shows all of the proteins that co-purify with each other and therefore 
presumably interact inside the cell. This complete set of interactions is 
called the “interactome’” of the cell. 

The results from large-scale protein-protein interaction studies 
can be presented in the form of a network, such as that shown in 
FIGURE 2.48. This figure displays the potential binding partners of 
the various yeast proteins that contain an SH3 domain (see Figure 
2.42a) and illustrates the complexities of such interactions at the 
level of an entire organism. Those proteins that have multiple bind- 
ing partners, such as Las17 (situated near the center of Figure 2.48), 
are referred to as hubs of the protein interaction network. Hub pro- 
teins are more likely than non-hub proteins to be essential proteins, 
that is, proteins that the organism cannot survive without. Some hub 
proteins have several different binding interfaces and are capable of 
binding a number of different binding partners at the same time. In 
contrast, other hubs have a single binding interface, which is capable 
of binding several different partners, but only one at a time. Examples 
of each of these types of hub proteins are illustrated in FIGURE 2.49. 
The hub protein depicted in Figure 2.49a plays a central role in the 
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FIGURE 2.48 A network of protein-protein interactions. Each red line 
represents an interaction between two yeast proteins, which are indicated 
by the named black dots. In each case, the arrow points from an SH3 
domain protein to a target protein with which it can bind. The 59 interac- 
tions depicted here were detected using two different types of techniques 
that measure protein-protein interactions. (See Nature 417:399-403, for 
discussion of the validity of protein-protein interaction studies.) 

Source: From A.H.Y. Tong, et al., Science 295:323, 2002, Copyright © 2002. 
Reprinted with permission from AAAS. 


process of gene expression, while that shown in Figure 2.49b plays an 
equally important role in the process of cell division. Overall, it is 
estimated that, on average, each protein encoded in the genome of a 
eukaryotic organism interacts with about five different protein part- 
ners. According to this estimate, human proteins would engage in 
roughly 100,000 different interactions. 

It is important to remember, when viewing these network dia- 
grams, that any large-scale automated experiment will contain some 
errors. For interactome data, some true interactions will be missed, 
for example because the interaction is too weak to survive the puri- 
fication process, or because the TAP tag disrupts an interacting 
domain. Likewise, some of the interactions reported in the analysis 
may not actually represent true interactions in the cell. For example 
if a protein is present as a contaminant when purifying a particular 
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protein complex, it would be incorrectly reported as interacting with 73 


the TAP-tagged protein. It is therefore best to view interactome data- 
sets as showing potential interactions that can serve as a guide for 
future experiments using more direct means. 

Aside from obtaining a lengthy list of potential interactions, what 
do we learn about cellular activities from these types of large-scale 
studies? Genome-sequencing projects have provided scientists with 
the amino acid sequences of a huge number of proteins whose very 
existence was previously unknown. What do these proteins do? One 
approach to determining a protein’s function is to identify the proteins 
with which it associates. If, for example, a known protein has been 
shown to be involved in DNA replication, and an unknown protein is 
found to interact with the known protein, then it is likely that the 
unknown protein is also part of the cell’s DNA-replication machinery. 
Thus, regardless of their limitations, these large-scale protein interac- 
tion studies provide the starting point to infer the function of 
unknown genes based on their possible interaction partners. 


REVIEW | 


1. Which of the two methods discussed in this section, 
proteomics or interactomics, gives us information 
about the primary structure of proteins? Which gives 
information about quaternary structure of protein 
complexes? 
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2.16 Protein Engineering 


Advances in molecular biology have created the opportunity to design 
and mass-produce novel proteins that are different from those made 
by living organisms. It is possible with current DNA-synthesizing 
techniques to create an artificial gene that can be used in the produc- 
tion of a protein having any desired sequence of amino acids. 
Polypeptides can also be synthesized from “scratch” using chemical 
techniques. This latter strategy allows researchers to incorporate build- 
ing blocks other than the 20 amino acids that normally occur in nature. 


FIGURE 2.49 Protein-protein interactions of hub proteins. (a) The enzyme RNA polymerase II, which synthesizes messenger RNAs in the cell, binds a 
multitude of other proteins simultaneously using multiple interfaces. (b) The enzyme Cdc28, which phosphorylates other proteins as it regulates the cell division 
cycle of budding yeast. Cdc28 binds a number of different proteins (CIn1-Cln3) at the same interface, which allows only one of these partners to bind at a time. 


Source: From Damien Devos and Robert B. Russell, Curr. Opin. Struct. Biol. 17:373, © 2007, with permission of Elsevier. 
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The problem with these types of engineering efforts is in know- 
ing which of the virtually infinite variety of possible proteins one 
could manufacture might have some useful function. Consider, for 
example, a pharmaceutical company that wanted to manufacture a 
therapeutic protein that would bind to the surface of the AIDS or 
influenza virus. Assume that computer simulation programs could 
predict the shape such a protein should have to bind to the viral sur- 
face. What sequence of amino acids strung together would produce 
such a protein? The answer requires detailed insight into the rules 
governing the complex relationship between a protein’s primary 
structure and its tertiary structure. FIGURE 2.50 illustrates that pro- 
tein biochemists now have the knowledge that allows them to con- 
struct a protein capable of binding to the surface of another protein, 
in this case the hemagglutinin (HA) protein that was present in the 
reconstructed 1918 influenza virus (page 24). HA protein is used by 
the virus to gain entry to a human cell, so that inhibition of HA 
could, in theory, prevent viral infection. Figure 2.50a shows the HA 
protein in proximity to a small engineered protein (in green). This 
engineered protein is capable of binding to a hydrophobic patch on 
the surface of the HA protein with high affinity. Figure 2.50b shows 
a closer view of the interface between the targeted portion of the HA 
protein (gray) and the binding surface of the designed protein (pur- 
ple). It can be seen that side chains from the designed protein inter- 
act in highly specific ways with sites on the o helix of HA. This 
peptide was found to inhibit the function of HA protein, potentially 
pointing the way to future efforts to developing HA binding peptides 
as antiviral drugs. 


Production of Novel Proteins 


You might think that designing a protein from “scratch” that is capa- 
ble of catalyzing a given chemical reaction—that is, designing an 
enzyme—would be far beyond the capability of present-day biotech- 
nology. Given the magnitude of the task, it came as a surprise when 
researchers reported in 2008 that they had successfully designed and 
produced artificial proteins that were capable of catalyzing two differ- 
ent organic reactions, neither of which was catalyzed by any known 
natural enzyme. One of the reactions involved breaking a carbon- 
carbon bond, and the other the transfer of a proton from a carbon 
atom. These protein architects began by choosing a catalytic mecha- 
nism that might accelerate each chosen reaction and then used 
computer-based calculations to construct an idealized space in which 
amino acid side chains were positioned (forming an active site) to 
accomplish the task. They then searched among known protein struc- 
tures to find ones that might serve as a framework or scaffold that 
could hold the active site they had designed. To transform the 
computer models into an actual protein, they used computational 
techniques to generate DNA sequences that had the potential to 
encode such a protein. The proposed DNA molecules were synthesized 
and introduced into bacterial cells where the proteins were manufac- 
tured. The catalytic activities of the proteins were then tested. Those 
proteins that showed the greatest promise were then subjected to a 
process of test-tube evolution; the proteins were mutated to create a 
new generation of altered proteins, which could in turn be screened 
for enhanced activity. Eventually, the team obtained proteins that 
could accelerate the rates of reaction as much as one million times 
that of the uncatalyzed reaction. While this is not a rate of enhance- 
ment that would fill a natural enzyme with pride, it is a remarkable 
accomplishment for a team of biochemists. It suggests, in fact, that 
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FIGURE 2.50 The computational design of a protein that is capable of 
binding specifically to the surface of another protein. (a) The computa- 
tionally designed protein is shown in green and its target protein (the HA 
protein from the H1N1 1918 influenza virus) is shown on the left in multiple 
colors. The predicted structure of the designed protein fits closely with that 
of the actual protein that was generated from the predicted sequence. 

(b) The actual interface of the targeted hydrophobic helix of the HA protein 
(gray) and the designed protein (purple). Side chains of the designed 
protein are seen to interact with sites on the HA helix. 

Source: (a) From Sarel J. Fleishman, et al., Science 332:820, 2011, image 
courtesy of David Baker; (6) From Bryan S. Der and Brian Kuhlman, Science 
332:801, 2011, both © 2011, reprinted with permission of AAAs. 


scientists will ultimately be able to construct proteins from scratch 
that will be capable of catalyzing virtually any chemical reaction. 

An alternate approach to the production of novel proteins has 
been to modify those that are already produced by cells. Recent 
advances in DNA technology have allowed investigators to isolate an 
individual gene from human chromosomes, to alter its information 
content in a precisely determined way, and to synthesize the modi- 
fied protein with its altered amino acid sequence. This technique, 
which is called site-directed mutagenesis (Section 18.25), has 
many different uses, both in basic research and in applied biology. If, 
for example, an investigator wants to know about the role of a par- 
ticular residue in the folding or function of a polypeptide, the gene 
can be mutated in a way that substitutes an amino acid with different 
charge, hydrophobic character, or hydrogen-bonding properties. 
The effect of the substitution on the structure and function of the 


modified protein can then be determined. As we will see throughout 
this textbook, site-directed mutagenesis has proven invaluable in the 
analysis of the specific functions of minute parts of virtually every 
protein of interest to biologists. 

Site-directed mutagenesis is also used to modify the structure of 
clinically useful proteins to bring about various physiological effects. 
For example, the drug Somavert, which was approved by the FDA in 
2003, is a modified version of human growth hormone (GH) con- 
taining several alterations. GH normally acts by binding to a receptor 
on the surface of target cells, which triggers a physiological response. 
Somavert competes with GH in binding to the GH receptor, but 
interaction between drug and receptor fails to trigger the cellular 
response. Somavert is prescribed for the treatment of acromegaly, a 
disorder that results from excess production of growth hormone. 


Structure-Based Drug Design 


The production of new proteins is one clinical application of recent 
advances in molecular biology; another is the development of new 
drugs that act by binding to known proteins, thereby inhibiting their 
activity. Drug companies have access to chemical “libraries” that 
contain millions of different organic compounds that have been 
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either isolated from plants or microorganisms or chemically synthe- 75 


sized. One way to search for potential drugs is to expose the protein 
being targeted to combinations of these compounds and determine 
which compounds, if any, happen to bind to the protein with reason- 
able affinity. An alternate approach, called structure-based drug 
design, relies upon knowledge of the structure of the protein target. 
If the tertiary structure of a protein has been determined, research- 
ers can use computers to design “virtual” drug molecules whose size 
and shape might allow them to fit into the apparent cracks and crev- 
ices of the protein, rendering it inactive. 

We can illustrate both of these approaches by considering the 
development of the drug Gleevec (generic name Imatinib), as 
depicted in FIGURE 2.51a. Introduction of Gleevec into the clinic has 
revolutionized the treatment of a number of relatively rare cancers, 
most notably that of chronic myelogenous leukemia (CML). As dis- 
cussed at length in Chapters 15 and 16, a group of enzymes called 
tyrosine kinases are often involved in the transformation of normal 
cells into cancer cells. Tyrosine kinases catalyze a reaction in which 
a phosphate group is added to specific tyrosine residues within a 
target protein, an event that may activate or inhibit the target pro- 
tein. The development of CML is driven almost single-handedly by 
the presence of an overactive tyrosine kinase called ABL. 
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FIGURE 2.51 Development of a protein-targeting drug, such as 
Gleevec. (a) Typical steps in drug development. In step 1, a protein (e.g., 
ABL) has been identified that plays a causative role in the disease. This 
protein is a likely target for a drug that inhibits its enzymatic activity. In 

step 2, the protein is incubated with thousands of compounds in a search 
for ones that bind with reasonable affinity and inhibit its activity. In step 3, 
one such compound (e.g, 2-phenylaminopyrimidine in the case of ABL) has 
been identified. In step 4, knowledge of the structure of the target protein is 
used to make derivatives of the compound (e.g., Gleevec) that have greater 
binding affinity and thus can be used at lower concentrations. In step 5, the 


compound in question is tested in preclinical experiments for toxicity and 
efficacy (level of effectiveness) in vivo. Preclinical experiments are typically 
carried out on cultured human cells (step 5a) (e.g., those from patients with 
CML) and laboratory animals (step 5b) (e.g., mice carrying transplants of 
human CML cells). If the drug appears safe and effective in animals, the 
drug is tested in clinical trials (step 6) as discussed on page 65. (b) The 
structure of Gleevec. The blue portion of the molecule indicates the 
structure of the compound 2-phenylaminopyrimidine that was initially 
identified as an ABL kinase inhibitor. (c,d) The structure of the catalytic 
domain of ABL in complex (c) with Gleevec (shown in yellow) and (d) with a 
second-generation inhibitor called Dasatinib. Gleevec binds to the inactive 
conformation of the protein, whereas Dasatinib binds to the active confor- 
mation. Both binding events block the activity that is required for the cell's 
cancerous phenotype. Dasatinib is effective against most cancer cells that 
have become resistant to the action of Gleevec. 


Source: (c & d): From Ellen Weisberg et al., courtesy of James D. Griffin, 
Nature Revs. Cancer 7:353, 2007 © 2007, reprinted by permission from 
Macmillan Publishers Limited. 
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During the 1980s researchers identified a compound called 
2-phenylaminopyrimidine that was capable of inhibiting tyrosine 
kinases. This compound was discovered by randomly screening a 
large chemical library for compounds that exhibited this particular 
activity (Figure 2.51a). As is usually the case in these types of blind 
screening experiments, 2-phenylaminopyrimidine would not have 
madea very effective drug. For one reason, it was only a weak enzyme 
inhibitor, which meant it would have had to be used in very large 
quantities. 2-Phenylaminopyrimidine is described as a lead com- 
pound, a starting point from which usable drugs might be developed. 
Beginning with this lead molecule, compounds of greater potency 
and specificity were synthesized using structure-based drug design. 
One of the compounds to emerge from this process was Gleevec 
(Figure 2.51b), which was found to bind tightly to the inactive form 
of the ABL tyrosine kinase and prevent the enzyme from becoming 
activated, which is a necessary step if the cell is to become cancerous. 
The complementary nature of the interaction between the drug and 
its enzyme target is shown in Figure 2.51c. Preclinical studies dem- 
onstrated that Gleevec strongly inhibited the growth in the labora- 
tory of cells from CML patients and that the compound showed no 
harmful effects in tests in animals. In the very first clinical trial of 
Gleevec, virtually all of the CML patients went into remission after 
taking once-daily doses of the compound. Gleevec has gone on to 
become the primary drug prescribed for treatment of CML, but this 
is not the end of the story. Many patients taking Gleevec eventually 
experience a recurrence of their cancer when the ABL kinase mutates 
to become resistant to the drug. In such cases, the cancer can con- 
tinue to be suppressed by treatment with more recently designed 
drugs that are capable of inhibiting Gleevec-resistant forms of the 
ABL kinase. One of these newer (second-generation) ABL kinase 
inhibitors is shown bound to the protein in Figure 2.51d. 
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1. What are some of the ways that knowledge of protein 


structure can help with developing new drugs? 
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2.17 Protein Adaptation 
and Evolution 


Adaptations are traits that improve the likelihood that an organism 
will survive in a particular environment. Proteins are biochemical 
adaptations that are subject to natural selection and evolutionary 
change in the same way as other types of characteristics, such as eyes 
or skeletons. This is best revealed by comparing evolutionarily 
related (homologous) proteins in organisms living in very different 
environments. For example, the proteins of halophilic (salt-loving) 
archaebacteria possess amino acid substitutions that allow them to 
maintain their solubility and function at very high cytosolic salt con- 
centrations (up to 4 M KCl). Unlike its counterpart in other organ- 
isms, the surface of the halophilic version of the protein malate 
dehydrogenase, for example, is coated with aspartic and glutamic 
acid residues whose carboxyl groups can compete with the salt for 
water molecules (FIGURE 2.52). 

Homologous proteins isolated from different organisms can 
exhibit virtually identical shapes and folding patterns, but show strik- 
ingly divergent amino acid sequences. The greater the evolutionary 
distance between two organisms, the greater the difference in the 


FIGURE 2.52 Distribution of polar, charged amino acid residues in the 
enzyme malate dehydrogenase from a halophilic archaebacterium. Red 
balls represent acidic residues, and blue balls represent basic residues. The 
surface of the enzyme is seen to be covered with acidic residues, which 
gives the protein a net charge of -156, and promotes its solubility in 
extremely salty environments. For comparison, a homologous protein from 
the dogfish, an ocean-dwelling shark, has a net charge of +16. 

Source: From O. Dym, M. Mevarech, and J. L. Sussman, Science 267:1345, 
© 1995, reprinted with permission of AAAS. 


amino acid sequences of their proteins. In some cases, only a few key 
amino acids located in a critical portion of the protein will be present 
in all of the organisms from which that protein has been studied. In one 
comparison of 226 globin sequences, only two residues were found to 
be absolutely conserved in all of these polypeptides; one is a histidine 
residue that plays a key role in the binding and release of O,. These 
observations indicate that the secondary and tertiary structures of pro- 
teins change much more slowly during evolution than their primary 
structures. This does not mean that the conformation of a protein can- 
not be affected in a major way by simple changes in primary structure. 
An example of such a change is shown in FIGURE 2.53. In this case, an 
amino acid substitution was experimentally introduced into a protein 
that completely altered the conformation of a small domain within a 
large protein molecule. The polypeptide on the left, which has a leucine 
at position 45, has a conformation consisting of a bundle of three 
a helices, whereas the polypeptide on the right, which has a tyrosine at 
this position, has a conformation that contains a single o helix and a 
four-stranded B-sheet. Ifa mutation having an effect of this magnitude 
happened to occur in nature, it might result in the formation of a pro- 
tein with new functional properties and thus could be responsible for 
generating the ancestral form of an entirely new family of proteins. 

We have seen how evolution has produced different versions of 
proteins in different organisms, but it has also produced different 
versions of proteins in individual organisms. Take a particular pro- 
tein with a given function, such as globin or collagen. Several differ- 
ent versions of each of these proteins are encoded by the human 
genome. In most cases, different versions of a protein, which are 
known as isoforms, are adapted to function in different tissues or at 
different stages of development. For example, humans possess six dif- 
ferent genes encoding isoforms of the cytoskeletal protein actin. Two 
of these isoforms are found in smooth muscle, one in skeletal muscle, 
one in heart muscle, and two in virtually all other types of cells. 

Now that a large number of amino acid sequences and tertiary 
structures of proteins have been reported, it is clear that most proteins 
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FIGURE 2.53 The dramatic effect on conformation that can result from 
a single amino acid substitution. |n this case the switch between a leucine 
and a tyrosine at a critical position within this 56-amino acid polypeptide 
chain results in a transformation of the entire fold of the backbone of this 
polypeptide. This single substitution causes 85 percent of the amino acid 
residues to change their secondary structure. The spatial disposition of the 
two alternate side chains, which brings about this conformational shift, is 
shown in red in the model structures. The N-terminal amino acids are 
shown in orange and the C-terminal amino acids in blue. 

Source: From Patrick A. Alexander et al., Courtesy of Philip N. Bryan, Proc. 
Nat'l Acad. Sci. U.S.A. 106:21153, 2009, Fig. 6. © 2009 National Academy 
of Sciences. 


are members of much larger families (or superfamilies) of related 
molecules. The genes that encode the various members of a protein 
family are thought to have arisen from a single ancestral gene that 
underwent a series of duplications during the course of evolution (see 
Figure 10.23). Over long periods of time, the nucleotide sequences of 
the various copies diverge from one another to generate proteins with 
related structures. Many protein families contain a remarkable vari- 
ety of proteins that have evolved diverse functions. The expansion of 
protein families is responsible for much of the protein diversity 
encoded in the genomes of today’s complex plants and animals. 


REVIEW | 


1. Cana single amino acid change alter the secondary or 
tertiary structure of a protein? 
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2.18 Nucleic Acids 


Nucleic acids are macromolecules constructed out of long chains 
(strands) of monomers called nucleotides. Nucleic acids function 
primarily in the storage and transmission of genetic information, but 
they may also have structural or catalytic roles. There are two types 
of nucleic acids found in living organisms, deoxyribonucleic acid 
(DNA) and ribonucleic acid (RNA). DNA serves as the genetic 
material of all cellular organisms, though RNA carries out that role 
for many viruses. In cells, information stored in DNA is used to 
govern cellular activities through the formation of RNA messages. In 
the present discussion, we will examine the basic structure of nucleic 
acids using single-stranded RNA as the representative molecule. 


We will look at the structure of DNA in Chapter 10, where it can be 77 


tied to its central role in the chemical basis of life. 

Each nucleotide in a strand of RNA consists of three parts 
(FIGURE 2.54a): (1) a five-carbon sugar, ribose; (2) a nitrogenous base 
(so called because nitrogen atoms form part of the rings of the mol- 
ecule); and (3) a phosphate group. The sugar and nitrogenous base 
together form a nucleoside, so that the nucleotides of an RNA strand 
are also known as ribonucleoside monophosphates. The phosphate is 
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FIGURE 2.54 Nucleotides and nucleotide strands of RNA. 

(a) Nucleotides are the monomers from which strands of nucleic acid are 
constructed. A nucleotide consists of three parts: a sugar, a nitrogenous 
base, and a phosphate. The nucleotides of RNA contain the sugar ribose, 
which has a hydroxyl group bonded to the second carbon atom. In 
contrast, the nucleotides of DNA contain the sugar deoxyribose, which has 
a hydrogen atom rather than a hydroxyl group attached to the second 
carbon atom. Each nucleotide is polarized, having a 5’ end (corresponding 
to the 5’ side of the sugar) and a 3’ end. (b) Nucleotides are joined 
together to form a strand by covalent bonds that link the 3’ hydroxyl group 
of one sugar with the 5’ phosphate group of the adjoining sugar. 
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FIGURE 2.55 Nitrogenous bases in nucleic acids. Of the four standard 
bases found in RNA, adenine and guanine are purines, and uracil and 
cytosine are pyrimidines. In DNA, the pyrimidines are cytosine and thymine, 
which differs from uracil by a methyl group attached to the ring. 


linked to the 5’ carbon of the sugar, and the nitrogenous base is 
attached to the sugar’s 1’ carbon. During the assembly of a nucleic 
acid strand, the hydroxyl group attached to the 3’ carbon of the sugar 
of one nucleotide becomes linked by an ester bond to the phosphate 
group attached to the 5’ carbon of the next nucleotide in the chain. 
Thus the nucleotides of an RNA (or DNA) strand are connected by 
sugar-phosphate linkages (Figure 2.54b), which are described as 
3'-5'-phosphodiester bonds because the phosphate atom is esterified 
to two oxygen atoms, one from each of the two adjoining sugars. 

A strand of RNA (or DNA) contains four different types of nucle- 
otides distinguished by their nitrogenous base. Two types of bases 
occur in nucleic acids: pyrimidines and purines (FIGURE 2.55). 
Pyrimidines are smaller molecules, consisting ofa single ring; purines 
are larger, consisting of two rings. RNAs contain two different purines, 
adenine and guanine, and two different pyrimidines, cytosine and 
uracil. In DNA, uracil is replaced by thymine, a pyrimidine with an 
extra methyl group attached to the ring (Figure 2.55). 

Although RNAs consist of a continuous single strand, they often 
fold back on themselves to produce molecules having extensive double- 
stranded segments and complex three-dimensional structures. This is 
illustrated by the two RNAs shown in FIGURE 2.56. The RNA whose 
secondary structure is shown in Figure 2.56a is a component of the 
small subunit of the bacterial ribosome (see FIGURE 2.57). Ribosomal 
RNAs are not molecules that carry genetic information; rather, they 
serve as structural scaffolds on which the proteins of the ribosome can 
be attached and as elements that recognize and bind various soluble 
components required for protein synthesis. One of the ribosomal 
RNAs of the large subunit acts as the catalyst for the reaction by which 
amino acids are covalently joined during protein synthesis. RNAs hav- 
ing a catalytic role are called RNA enzymes, or ribozymes. Figure 
2.56b depicts the tertiary structure of the so-called hammerhead 
ribozyme, which is able to cleave its own RNA strand. In both examples 
shown in Figure 2.56, the double-stranded regions are held together by 
hydrogen bonds between the bases. This same principle is responsible 
for holding together the two strands of a DNA molecule. 

Nucleotides are not only important as building blocks of nucleic 
acids, they also have important functions in their own right. Most of 
the energy being put to use at any given moment in any living organism 
is derived from the nucleotide adenosine triphosphate (ATP). 
The structure of ATP and its key role in cellular metabolism are 


(b) 


FIGURE 2.56 RNAs can assume complex shapes. (a) This ribosomal 
RNA is an integral component of the small ribosomal subunit of a bacte- 
rium. In this two-dimensional profile, the RNA strand is seen to be folded 
back on itself in a highly ordered pattern so that most of the molecule is 
double-stranded. (6b) This hammerhead ribozyme, as it is called, is a small 
RNA molecule from a viroid (page 25). The helical nature of the double- 
stranded portions of this RNA can be appreciated in this three-dimensional 
model of the molecule. 

Source: From William G. Scott, et al., Cell 81:993, © 1995, with permission 
from Elsevier. 


discussed in the following chapter. Guanosine triphosphate (GTP) 
is another nucleotide of enormous importance in cellular activities. 
GTP binds to a variety of proteins (called G proteins) and acts as a 
switch to turn on their activities (see Figure 11.49 for an example). 


FIGURE 2.57 Reconstruction of a ribosome from the cytoplasm of a 
wheat germ cell. This reconstruction is based on high-resolution electron 
micrographs and shows the two subunits of this eukaryotic ribosome, the 
small (40S) subunit on the left and the large (60S) subunit on the right. The 
internal structure of a ribosome is discussed in Sections 11.16 and 11.17. 


Source: From Adriana Verschoor, et al., J. Cell Biol., Vol. 133 (cover 3), 
1996; by copyright permission of Rockefeller University Press. 


REVIEW | 


1. Describe the structure of nucleotides and the manner 
in which these monomers are joined to form a 
polynucleotide strand. Why would it be overly 
simplistic to describe RNA as a single-stranded 
nucleic acid? 


2. Which macromolecules are polymers? What is the 
basic structure of each type of monomer? How do the 
various monomers of each type of macromolecule vary 
among themselves? 


—— 


2.19 The Formation of Complex 
Macromolecular Structures 


To what degree can the lessons learned from the study of protein 
architecture be applied to more complex structures in the cell? Can 
structures, such as membranes, ribosomes, and cytoskeletal ele- 
ments, which consist of different types of subunits, also assemble by 
themselves? How far can subcellular organization be explained sim- 
ply by having the pieces fit together to form the most stable arrange- 
ment? The assembly of cellular organelles is poorly understood, but 
it is apparent from the following examples that different types of 
subunits can self-assemble to form higher-order arrangements. 


The Assembly of Tobacco Mosaic 
Virus Particles 


The most convincing evidence that a particular assembly process is 
self-directed is the demonstration that the assembly can occur out- 
side the cell (in vitro) under physiological conditions when the only 
macromolecules present are those that make up the final structure. In 
1955, Heinz Fraenkel-Conrat and Robley Williams of the University 


of California, Berkeley, demonstrated that TMV particles, which 79 


consist of one long RNA molecule (approximately 6600 nucleotides) 
wound within a helical capsule made of 2130 identical protein sub- 
units (see Figure 1.21), were capable of self-assembly. In their experi- 
ments, they purified TMV RNA and protein separately, mixed them 
together under suitable conditions, and recovered mature, infective 
particles after a short period of incubation. Clearly the two compo- 
nents contain all the information necessary for particle formation. 


The Assembly of Ribosomal Subunits 


Ribosomes, like TMV particles, are made of RNA and protein. Unlike 
the simpler TMV, ribosomes contain several different types of RNA 
and a considerable collection of different proteins. All ribosomes, 
regardless of their source, are composed of two subunits of different 
size. Although ribosomal subunits are often depicted in drawings as 
symmetric structures, in fact they have a highly irregular shape, as 
indicated in Figure 2.57. The large (or 50S) ribosomal subunit of bac- 
teria contains two molecules of RNA and approximately 32 different 
proteins. The small (or 30S) ribosomal subunit of bacteria contains one 
molecule of RNA and 21 different proteins. The structure and function 
of the ribosome are discussed in detail in Sections 11.16 and 11.17. 
One of the milestones in the study of ribosomes came in the 
mid-1960s, when Masayasu Nomura and his co-workers at the 
University of Wisconsin succeeded in reconstituting complete, fully 
functional 30S bacterial subunits by mixing the 21 purified proteins 
of the small subunit with purified small-subunit ribosomal RNA. 
Apparently, the components of the small subunit contain all the 
information necessary for the assembly of the entire particle. Analysis 
of the intermediates that form at different stages during reconstitu- 
tion in vitro indicates that subunit assembly occurs in a sequential 
step-by-step manner that closely parallels the process in vivo. At least 
one of the proteins of the small subunit (S16) appears to function 
solely in ribosome assembly; deletion of this protein from the recon- 
stitution mixture greatly slowed the assembly process but did not 
block the formation of fully functional ribosomes. Reconstitution of 
the large subunit of the bacterial ribosome was accomplished in the 
following decade. It should be kept in mind that although it takes 
approximately 2 hours at 50°C to reconstitute the ribosome in vitro, 
the bacterium can assemble the same structure in a few minutes at 
temperatures as low as 10°C. It may be that the bacterium uses some- 
thing that is not available to the investigator who begins with purified 
components. Assembly of the ribosome within the cell, for example, 
may include the participation of accessory factors that function in 
protein folding, such as the chaperones described in the Experimental 
Pathways. In fact, the formation of ribosomes within a eukaryotic cell 
requires the transient association of many proteins that do not end up 
in the final particle, as well as the removal of approximately half the 
nucleotides of the large ribosomal RNA precursor (Section 11.4). As 
a result, the components of the mature eukaryotic ribosome no 
longer possess the information to reconstitute themselves in vitro. 


REVIEW | 


1. What type of evidence suggests that bacterial 
ribosomal subunits are capable of self-assembly, but 
eukaryotic subunits are not? 


2. What evidence would indicate that a particular 
ribosomal protein had a role in ribosome function 
but not assembly? 
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12, 


. In the polypeptide backbone —C 


. The pH of cola beverages is around 3, while the pH of battery 


acid is around 1. Which has a higher concentration of protons? 
By how much? If the concentration of protons in wine is approxi- 
mately one-tenth that in cola, what is the pH of wine? 
(See Quantitative Tutorial Video). 


. Sickle cell anemia results from a substitution of a valine for a 


glutamic acid. Would you expect a similar effect if the mutation 
were to have placed a leucine at that site? An aspartic acid? 


. Of the following amino acids, glycine, isoleucine, and lysine, 


which would you expect to be the most soluble in an acidic 
aqueous solution? Which the least? 


. How many structural isomers could be formed from a molecule 


with the formula Cs5H19? CyHg? 


Glyceraldehyde is the only three-carbon aldotetrose, and it can 
exist as two stereoisomers. What is the structure of dihydroxyace- 
tone, the only ketotriose? How many stereoisomers does it form? 


. Bacteria are known to change the kinds of fatty acids they pro- 


duce as the temperature of their environment changes. What 
types of changes in fatty acids would you expect as the tempera- 
ture drops? Why would this be adaptive? 


C—N—C—C—_N—C—C 


NHb, identify the a-carbons. 


Which of the following are true? Increasing the pH of a solution 
would (1) suppress the dissociation of a carboxylic acid, (2) increase 
the charge on an amino group, (3) increase the dissociation of a 
carboxylic acid, (4) suppress the charge on an amino group. 


Which of the four classes of amino acids has side chains with the 
greatest hydrogen-bond-forming potential? Which has the great- 
est potential to form ionic bonds? Hydrophobic interactions? 


If the three enzymes of the pyruvate dehydrogenase complex 
existed as physically separate proteins rather than as a complex, 
what effect might this have on the rate of reactions catalyzed by 
these enzymes? 

Would you agree that neither ribonuclease nor myoglobin had 
quaternary structure? Why or why not? 

How many different tripeptides are possible? How many car- 
boxyl terminals of polypeptide chains are present in a molecule 
of hemoglobin? (See Quantitative Tutorial Video). 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


You have isolated a pentapeptide composed of four glycine resi- 
dues and one lysine residue that resides at the C-terminus of the 
peptide. Using the information provided in the legend of Figure 
2.27, if the pK of the side chain of lysine is 10 and the pK of the 
terminal carboxyl group is 4, what is the structure of the peptide 
at pH 7? At pH 12? 

The side chains of glutamic acid (pK 4.3) and arginine (pK 12.5) can 
form an ionic bond under certain conditions. Draw the relevant 
portions of the side chains and indicate whether an ionic bond 
could form at the following: (a) PH 4; (b) pH 7; (c) pH 12; (d) pH 13. 


Would you expect a solution of high salt to be able to denature 
ribonuclease? Why or why not? 


You have read in the Human Perspective that (1) mutations in the 
PRNP gene can make a polypeptide more likely to fold into 
the PrP®* conformation, thus causing CJD and (2) exposure to 
the PrP®* prion can lead to an infection that also causes CJD. 
How can you explain the occurrence of rare sporadic cases of the 
disease in persons who have no genetic propensity for it? 


Persons who are born with Down syndrome have an extra (third) 
copy of chromosome #21 in their cells. Chromosome #21 con- 
tains the gene that encodes the APP protein. Why do you sup- 
pose that individuals with Down syndrome typically develop 
Alzheimer's disease at an early age? 


We saw on page 77 how evolution has led to the existence of 
protein families composed of related molecules with similar 
functions. A few examples are also known where proteins with 
very similar functions have primary and tertiary structures that 
show no evidence of evolutionary relationship. Subtilisin and 
trypsin, for example, are two protein-digesting enzymes (pro- 
teases) that show no evidence they are homologous despite the 
fact that they utilize the same mechanism for attacking their sub- 
strates. How can this coincidence be explained? 


Would you agree with the statement that many different amino 
acid sequences can fold into the same basic tertiary structure? 
What data can you cite as evidence for your position. 


In the words of one scientist: “The first question any structural biol- 
ogist asks upon being told that a new [protein] structure has been 
solved is no longer ‘What does it look like?; it is now What does it 
look like?’” What do you suppose he meant by this statement? 
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Bioenergetics, Enzymes, 
and Metabolism 


YOU ARE NOT WHAT YOU EAT 


The human body is a machine of almost unimaginable 
complexity. Each day, our body expends around 

2000 kcal of energy, which we use to power our move- 
ment, metabolism, and thought. We obtain this energy 
from the foods that we eat, but unlike a car engine that 
can convert a chemical fuel (gasoline) directly into 
mechanical force, our bodies perform a complex series 
of biochemical reactions to convert the nutrients in food 
into the universal currency of biological energy—the 
small molecule ATP. This web of reactions is orches- 
trated by an array of enzymes that funnel the biomol- 
ecules that we ingest into reactions that produce ATP for 
energy as well as reactions that build up the biochemi- 
cals we need as building blocks for cells and tissues. 


Different people have very different eating habits. Low 
carb diets, vegan diets, paleo diets—each has dramatic 
differences in the biochemical composition of the food. 
Yet the people who eat these different diets look the 
same, behave the same, and contain all the same cells 
and molecules. If you eat corn every day, you don’t turn 
into a corn plant. How is the chemical composition of 
our bodies held constant when our intake of food can 
be so variable? Again the answer lies in the network of 
biochemical reactions, orchestrated by enzymes that can 
convert one type of biochemical into another. We aren't 


what we eat: We are what our enzymes make us. Painting by Giuseppe Arcimboldo: Rudolf II of Habsburg as Vertumnus. 
Source: Painting by GiuseppeArcimboldo 
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3.1 The Laws of Thermodynamics 


A living cell bustles with activity. Macromolecules of all types are 
assembled from raw materials, waste products are produced and 
excreted, genetic instructions flow from the nucleus to the cyto- 
plasm, vesicles are moved along the secretory pathway, ions are 
pumped across cell membranes, and so forth. To maintain such a 
high level of activity, a cell must acquire and expend energy. The 
study of the various types of energy transformations that occur in 
living organisms is referred to as bioenergetics. 

Energy is defined as the capacity to do work, that is, the capac- 
ity to change or move something. Thermodynamics is the study of 
the changes in energy that accompany events in the universe. In the 
following sections, we will focus on a set of concepts that allow us to 
predict the direction that events will take and whether an input of 
energy is required to cause the event to happen. However, thermo- 
dynamic measurements provide no help in determining how rapidly 
a specific process will occur or the mechanism used by the cell to 
carry out the process. 


The First Law of Thermodynamics 


The first law of thermodynamics is the law of conservation of 
energy. It states that energy can neither be created nor destroyed. 
Energy can, however, be converted (transduced) from one form to 


FIGURE 3.1 Examples of energy 


transduction. (a) Conversion of 


mechanical energy to electrical energy 
in a wind-turbine, (b) storage of 
electrical energy in a chemical form 
within a battery, (c) conversion of 
electrical energy to mechanical energy 
by an electric, motor, in this case 
driving a fan, (d) conversion of 
chemical energy to mechanical energy 
by muscles, (e) conversion of chemical 
energy to electrical energy in an 
electric eel, which can generate 500 
volts across its electric organs, 

(f) conversion of chemical energy to 
light in bioluminescent organisms. 


Source: (a) pedrosala / Shutterstock. (e) 


another. The transduction of mechanical energy to electrical energy 
occurs in the large wind-turbines now being used to generate energy 
from wind (FIGURE 3.1a), and chemical energy is converted to elec- 
trical energy by gas-powered generators. Energy can also be stored— 
for example, electrical energy can be stored in chemical form using a 
rechargeable battery (Figure 3.1b), and mechanical energy can be 
stored in the spring of a wind-up toy. Stored energy can be trans- 
ported from one place to another—for example, the stored chemical 
energy in gasoline can be carried long distances in tanker trucks. 
Stored energy can then be utilized by transducing it from its storage 
form to some other useful form—for example, electrical energy 
stored in a battery can be converted into mechanical work by an 
electric motor (Figure 3.1c). 

Cells are also capable of energy transduction, storage, and trans- 
port. As discussed in later chapters, the chemical energy stored in cer- 
tain biological molecules, such as ATP, is converted to mechanical 
energy when muscles contract (Figure 3.1d), to electrical energy when 
ions flow across a membrane (Figure 3.1e), or to thermal energy when 
we shiver in the cold. The most important energy transduction in the 
biological world is the conversion of sunlight into chemical energy— 
the process of photosynthesis—which provides the fuel that directly 
or indirectly powers the activities of nearly all forms of life.' 


‘Several communities of organisms are known that are independent of photosynthe- 
sis. These include communities that reside in the hydrothermal vents at the bottom of 
the ocean floor that depend on energy obtained by bacterial chemosynthesis. 


A number of animals, including fireflies and luminous fish, are able 
to convert chemical energy back into light (Figure 3.1f). Regardless 
of the transduction process, however, the total amount of energy in 
the universe remains constant. 

To discuss energy transformations involving matter, we need to 
divide the universe into two parts: the system under study and the 
remainder of the universe, which we will refer to as the surroundings. 
A system can be defined in various ways: It may be a certain space in 
the universe or a certain amount of matter. For example, the system 
may be a living cell. The changes in a system's energy that occur dur- 
ing an event are manifested in two ways—as a change in the heat 
content of the system and in the performance of work. Even though 
the system may lose or gain energy, the first law of thermodynamics 
indicates that the loss or gain must be balanced by a corresponding 
gain or loss in the surroundings, so that the amount in the universe 
as a whole remains constant. The energy of the system is termed the 
internal energy (E), and its change during a transformation is AE 
(delta E). One way to describe the first law of thermodynamics is 
that AE = Q— W, where Q is the heat energy and W is the mechan- 
ical work energy. If a system pushes or pulls against a force F over a 
distance d, then the work done during that movement is W = Fd. 
This definition fits with our intuitive ideas about the word work. 
Imagine lifting a heavy object: If you have to lift it twice as high, you 
do twice as much work, and you also do twice as much work if the 
weight is twice as heavy so that the force of gravity is twice as large. 

Depending on the process, the internal energy of the system at 
the end can be greater than, equal to, or less than its internal energy 
at the start, depending on its relationship to its surroundings 
(FIGURE 3.2). In other words, AE can be positive, zero, or negative. 
Consider a system to be the contents of a reaction vessel. As long as 
there is no change in pressure or volume of the contents, there is no 
work being done by the system on its surroundings, or vice versa. In 
that case, the internal energy at the end of the transformation will be 
greater than that at the beginning if heat is absorbed and less if heat is 
released. Reactions that lose heat are termed exothermic, and ones 
that gain heat are endothermic, and there are many reactions of both 
types. Because AE for a particular reaction can be positive or negative, 
it gives us no information as to the likelihood that a given event will 


(a) (b) 


occur. To determine the likelihood of a particular transformation, we 83 


need to consider some additional concepts. 


The Second Law of Thermodynamics 


The second law of thermodynamics expresses the concept that 
events in the universe have direction; they tend to proceed “down- 
hill” from a state of higher energy to a state of lower energy. Thus, in 
any energy transformation, there is a decreasing availability of 
energy for doing additional work. Rocks fall off cliffs to the ground 
below, and once at the bottom, their ability to do additional work is 
reduced; it is very unlikely that they will lift themselves back to the 
top of the cliff. Similarly, opposite charges normally move together, 
not apart, and heat flows from a warmer to a cooler body, not the 
reverse. Such events are said to be spontaneous, a term that indi- 
cates they are thermodynamically favorable and can occur without 
the input of external energy. 

The concept of the second law of thermodynamics was formu- 
lated originally for heat engines, and the law carried with it the idea 
that it is thermodynamically impossible to construct a perpetual- 
motion machine. In other words, it is impossible for a machine to be 
100 percent efficient, which would be required if it were to continue 
functioning without the input of external energy. Some of the energy 
is inevitably lost as the machine carries out its activity. A similar rela- 
tionship holds true for living organisms. For example, when a giraffe 
browses on the leaves of a tree or a lion preys on the giraffe, much of 
the chemical energy in the food never becomes available to the ani- 
mal having the meal. 

The loss of available energy during a process is a result of a ten- 
dency for the randomness, or disorder, of the universe to increase 
every time there is a transfer of energy. This gain in disorder is meas- 
ured by the term entropy, and the loss of available energy is equal to 
TAS, where AS is the change in entropy between the initial and final 
states. Entropy is associated with the random movements of particles 
of matter, which, because they are random, cannot be made to accom- 
plish a directed work process. According to the second law of thermo- 
dynamics, every event is accompanied by an increase in the entropy 
of the universe. When a sugar cube is dropped into a cup of hot water, 


FIGURE 3.2 A change in a system’s internal 
energy. In this example, the system will be defined 
as a particular leaf of a plant. (a) During the day, 
sunlight is absorbed by photosynthetic pigments in 
the leaf's chloroplasts and used to convert CO; into 
carbohydrates, such as the glucose molecule shown 
in the drawing (which is subsequently incorporated 
into sucrose or starch). As the cell absorbs light, its 
internal energy increases; the energy present in the 
remainder of the universe has to decrease. (b) At 
night, the energy relationship between the cell and 
its surroundings is reversed as the carbohydrates 
produced during the day are oxidized to CO, in the 
mitochondria and the energy is used to run the cell's 
nocturnal activities. 
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FIGURE 3.3 Events are accompanied by an increase in 
the entropy of the universe. (a) A sugar cube contains 
sucrose molecules in a highly ordered arrangement in which 
their freedom of movement is restricted. As the cube 
dissolves, the freedom of movement of the sucrose 
molecules is greatly increased, and their random movement 
causes them to become equally distributed throughout the 
available space. Once this occurs, there will be no further 
tendency for redistribution, and the entropy of the system is 
at a maximum. (6) Sugar molecules spread randomly through 
a solution can be returned to an ordered state, but only if the 
entropy of the surroundings is increased, as occurs when the 
more ordered water molecules of the liquid phase become 
disordered following evaporation. 


for example, there is a spontaneous shift of the mole- 
cules from an ordered state in the crystal to a much 
more disordered condition when the sugar molecules 
are spread throughout the solution (FIGURE 3.3a). As 
the molecules of the sugar cube dissolve into solution, 
their freedom of movement increases, as does the 
entropy of the system. The change from a concentrated 
to a dispersed state results from the random movements 
of the molecules. The sugar molecules eventually spread 
themselves equally through the available volume because 
the state of uniform distribution is the most probable 
state. 

The release of heat, for example, from the oxida- 
tion of glucose within a cell or from the friction gener- 
ated as blood flows through a vessel, is another example 
of an increase in entropy. The release of thermal energy by living 
organisms increases the rate of random movements of atoms and 
molecules; it cannot be redirected to accomplish additional work. 
Because the energy of molecular and atomic movements increases 
with temperature, so too does the entropy. It is only at absolute zero 
(0K), when all movements cease, that the entropy is zero. 

As with other spontaneous events, we must distinguish between 
the system and its surroundings. The second law of thermodynamics 
indicates only that the total entropy in the universe must increase; 
the disorder within one part of the universe (the system) can decrease 
at the greater expense of its surroundings. The dissolved sugar in 
Figure 3.3a can decrease in entropy; it can be recrystallized by evap- 
orating off the water (Figure 3.3b). The consequence of this process, 
however, is an even greater increase in the entropy of the surround- 
ings. The increased freedom of movement of the water molecules in 
the gaseous phase more than balances the decrease in freedom of the 
molecules of the sugar crystals. 

Life operates on a similar principle. Living organisms are able to 
decrease their own entropy by increasing the entropy of their envi- 
ronment. Entropy is decreased in an organism when relatively sim- 
ple molecules, such as amino acids, are ordered into more complex 
molecules, such as the protein myoglobin in a muscle cell. For this to 
occur, however, the entropy of the environment must increase, 
which is accomplished as complex, ordered molecules such as glyco- 
gen stored in liver or muscle tissue are converted into heat and 
smaller, less ordered compounds (such as CO, and H,0) are released 
to the environment. It is this feature of metabolism that allows living 
organisms to maintain such a highly ordered and improbable state— 
at least for a while. 


Liquid phase 
water molecule 


= Sucrose 
molecules 


Gas phase 
water molecule 


Another measure of the energy state of a living organism is pro- 
vided by the information content of its macromolecules. Information 
is a subject that is difficult to define but easy to recognize. Information 
can be measured in terms of the ordered arrangement of a structure's 
subunits. For example, proteins and nucleic acids, in which the spe- 
cific linear sequence of the subunits is highly ordered, are low in 
entropy and high in information content. Maintaining a state of high 
information content (low entropy) requires the input of energy. 
Consider just one molecule of DNA located in one cell in your liver. 
That cell has dozens of different proteins whose sole job is to patrol the 
DNA, looking for damage and repairing it (discussed in Section 13.8). 
Nucleotide damage in an active cell can be so great that, without this 
expenditure of energy, the information content of DNA would 
rapidly deteriorate. 


REVIEW | 


1. Describe the differences between the first and second 
laws of thermodynamics and how, when taken 
together, they are able to describe the direction of 
events that take place in the universe. 


2. How is the maintenance of the ordered living state 
consistent with the second law of thermodynamics? 


3. Describe two examples in which the entropy of a 
system decreases and two examples in which the 
entropy of a system increases. 


eo 


3.2 Free Energy 


Together, the first and second laws of thermodynamics indicate that 
the energy of the universe is constant, but the entropy continues to 
increase toward a maximum. The concepts inherent in the first two 
laws were combined by the American chemist, J. Willard Gibbs, in 
1878 into the expression AH = AG + TAS, where AG (delta G) is the 
change in free energy, that is, the change during a process in the 
energy available to do work; AH is the change in enthalpy or total 
energy content of the system (equivalent to AE for our purposes), 
T is the absolute temperature (K = °C + 273), and AS is the change 
in the entropy of the system. The equation states that the total energy 
change is equal to the sum of the changes in useful energy (AG) and 
energy that is unavailable to do further work (TAS). 

When rearranged and written as AG=AH—TAS, the above 
equation provides a measure of the spontaneity of a particular pro- 
cess. It allows one to predict the direction in which a process will 
proceed and the extent to which that process will occur. All spontane- 
ous energy transformations must have a negative AG; that is, the pro- 
cess must proceed toward a state of lower free energy. The magnitude 
of AG indicates the maximum amount of energy that can be passed 
on for use in another process but tells us nothing about how rapidly 
the process will occur. Processes that can occur spontaneously, that 
is, processes that are thermodynamically favored (have a —AG), are 
described as exergonic. In contrast, if the AG for a given process is 
positive, then it cannot occur spontaneously. Such processes are 
thermodynamically unfavorable and are described as endergonic. 
As we will see, reactions that are normally endergonic can be made 
to occur by coupling them to energy-releasing processes. 

The signs of AH and AS for a given transformation can be posi- 
tive or negative, depending on the relation between the system and 
its surroundings. (Hence, AH will be positive if heat is gained by the 
system and negative if heat is lost; AS will be positive if the system 
becomes more disordered and negative if it becomes more ordered.) 
The counterplay between AH and AS is illustrated by the ice-water 
transformation. The conversion of water from the liquid to the solid 
state is accompanied by a decrease in entropy (AS is negative, as 
illustrated in FIGURE 3.4) and a decrease in enthalpy (AH is negative). 
In order for this transformation to occur (i.e., for AG to be negative), 
AH must be more negative than TAS, a condition that occurs only 
below 0°C. This relationship can be seen from Table 3.1, which indi- 
cates the values for the different terms if one mole of water were to 
be converted to ice at 10°C, 0° C, or —10°C. In all cases, regardless of 
the temperature, the energy level of the ice is less than that of the 
liquid (the AH is negative). However, at the higher temperature, 
the entropy term of the equation (TAS) is more negative than the 
enthalpy term; therefore, the free-energy change is positive, and 
the process cannot occur spontaneously. At 0°C, the system is in 
equilibrium; at —10°C, the solidification process is favored, that is, the 
AG is negative. 


Free-Energy Changes 
in Chemical Reactions 


Now that we have discussed the concept of free energy in general 
terms, we can apply the information to chemical reactions within the 
cell. All chemical reactions within the cell are reversible, and there- 
fore we must consider two reactions occurring simultaneously, one 
forward and the other in reverse. According to the law of mass 
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FIGURE 3.4 When water freezes, its entropy decreases because the 
water molecules of ice exist in a more ordered state. The large number of 
hydrogen bonds that are found in ice make the enthalpy of the water-ice 
transition favorable, while the decreased entropy makes the transition 
unfavorable. Since the entropic contribution to free energy also depends 
on temperature, it takes over at higher temperatures and makes liquid 
water more favorable. At lower temperatures, the entropic term of the free 
energy becomes smaller in magnitude until eventually the favorable 
enthalpy of the hydrogen bonds dominates, at which point ice becomes 
more energetically favorable than liquid water. 
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Source: This work is licensed under the Creative Commons Attribution- 
ShareAlike 3.0 License. 


TABLE 3.1 Thermodynamics of the Water-Ice Transformation 


Temp. AE AH AS TAS AG 
(°C) (cal/mol)  (cal/mol) = (cal/mol-°C) — (cal/mol) (cal/mol) 
-10 —1343 -1343 -4.9 —1292 -51 

0 —1436 -1436 —5.2 -1436 0 
+10 -1529 -1529 —5.6 —1583 154 


Source: I. M. Klotz, Energy Changes in Biochemical Reactions, Academic Press, 1967. 
Reprinted with permission from Elsevier. 


action, the rate of a reaction is proportional to the concentration of 
the reactants. Consider, for example, the following reaction: 


A+B—=C+D 


The rate of the forward reaction is directly proportional to the product 
of the molar concentrations of A and B . The rate of the forward reac- 
tion can be expressed as k,[A][B], where k, is a rate constant for the 
forward reaction. The rate of the backward reaction equals k,[C][D]. 
All chemical reactions proceed, however slowly, toward a state of equi- 
librium, that is, toward a point at which the rates of the forward and 
backward reactions are equal. At equilibrium, the same number of A 
and B molecules are converted into C and D molecules per unit time 


as are formed from them. At equilibrium, therefore, 
k,[A] [B] = k,[C][D] 


which can be rearranged to 


In other words, at equilibrium there is a predictable ratio of the 
concentration of products to the concentration of reactants. This ratio, 
which is equal to k,/k,, is termed the equilibrium constant, Keg. 
The equilibrium constant allows one to predict the direction 
(forward or reverse) in which the reaction is favored under a given 
set of conditions. Suppose, for example, that we are studying the 
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above reaction, and we have just mixed the four components (A, B, 
C, D) so that each is present at an initial concentration of 0.5 M. 


[C][D] _ [0.5)[0.5] _ 
[A][B] [0.5] [0.5] 


The direction in which this reaction will proceed depends on the 
equilibrium constant. If the K,, is greater than 1, the reaction will 
proceed at a greater rate toward the formation of the products C and D 
than in the reverse direction. For example, if the K,, happens to be 
9.0, then the concentration of the reactants and products at equilib- 
rium in this particular reaction mixture will be 0.25 M and 0.75 M, 
respectively. 


[C][D] _ [0.75][0.75] _ 
[A][B] _ [0.25] [0.25] 


If, on the other hand, the K,, is less than 1, the reverse reaction will 
proceed at a greater rate than the forward reaction, so that the con- 
centration of A and B will rise at the expense of C and D. It follows 
from these points that the net direction in which the reaction is pro- 
ceeding at any moment depends on the relative concentrations of all 
participating molecules and can be predicted from the K,,. If you 
know the K,, for the reaction, you can calculate whether the reaction 
will proceed forward or backward by writing out the equation for 
Kg plugging in the actual concentrations of all the reactants and 
products, and then comparing this value to the expected value of K 9. 
If the result is equal to K,,, then the reaction is already at equilibrium. 
If, however, the result is less than K - then this means that the reaction 
will proceed toward product formation. 

One special type of equilibrium constant bears mentioning: the 
dissociation constant, denoted Kj. The dissociation constant is a 
measure of how much two molecules tend to stick together. Given 
two molecules A and B, for example a protein A that binds a ligand B, 
the reaction is 


AB = A+B 


where AB is the complex of A bound to B. Usually this refers to a 
noncovalent interaction. For this reaction, the equilibrium constant 
is called the dissociation constant and is defined as: 

_{AIB] 


Ka = TAB] 


where[AB] denotes the concentration of A bound to B, while[B] and 
[B] are the concentrations of free (unbound) A and B. By examining 
the equation that defines Kj, you can see that the smaller K, is, the 
greater will be [AB] relative to[B] and[B]. Thus, if a pharmaceutical 
chemist wanted to rank a series of inhibitors for their ability to bind a 
particular protein, she could measure the K,, and the inhibitor with 
the smallest Kz would be the one that binds the most tightly. 

Let us return to the subject of energetics. The ratio of reactants 
to products present at equilibrium is determined by the relative free- 
energy levels of the reactants and products. As long as the total free 
energy of the reactants is greater than the total free energy of the 
products, then AG has a negative value and the reaction proceeds in 
the direction of formation of products. The greater the AG, the 
farther the reaction is from equilibrium and the more work that can 
be performed by the system. As the reaction proceeds, the difference 
in free-energy content between the reactants and products decreases 


(AG becomes less negative), until at equilibrium the difference is 
zero (AG = 0) and no further work can be obtained. 

Because AG for a given reaction depends on the reaction mix- 
ture present at a given time, it is not a useful term to compare the 
energetics of various reactions. To place reactions on a comparable 
basis and to allow various types of calculations to be made, a conven- 
tion has been adopted to consider the free-energy change that occurs 
during a reaction under a set of standard conditions. For biochemical 
reactions, the conditions are arbitrarily set at 25°C (298 K) and 1 atm 
of pressure, with all the reactants and products present at 1.0 M con- 
centration, except for water, which is present at 55.6 M, and H* at 
10°’ M (pH 7.0). The standard free-energy change (AG”) 
describes the free energy released when reactants are converted to 
products under these standard conditions. It must be kept in mind 
that standard conditions do not prevail in the cell, and therefore one 
must be cautious in the use of values for standard free-energy differ- 
ences in calculations of cellular energetics. 

The relationship between the equilibrium constant and stand- 
ard free-energy change is given by the equation 


AG*'=—RT In Kg 
When the natural log (In) is converted to log, 9, the equation becomes 


AG* =-2.303RT log Ki, 


where R is the gas constant (1.987 cal/mol - K) and T is the absolute 
temperature (298 K).° Recall that the log of 1.0 is zero. Consequently, it 
follows from the above equation that reactions having equilibrium con- 
stants greater than 1.0 have negative AG” values, which indicates that 
they can occur spontaneously under standard conditions. Reactions 
having equilibrium constants of less than 1 have positive AG® values 
and cannot occur spontaneously under standard conditions. In other 
words, given the reaction written as follows: A+B = C+D, if the 
AG*’ is negative, the reaction will go to the right when reactants and 
products are all present at 1.0 M concentration at pH 7. The greater the 
negative value, the farther to the right the reaction will proceed before 
equilibrium is reached. Under the same conditions, if the AG is posi- 
tive, the reaction will proceed to the left; that is, the reverse reaction is 
favored. The relation between AG” and K’,, is shown in Table 3.2. 


Free-Energy Changes 
in Metabolic Reactions 


One of the most important chemical reactions in the cell is the 
hydrolysis of ATP (FIGURE 3.5). In the reaction, 


ATP+H,0 > ADP +P, 


the standard free-energy difference between the products and reactants 
is -7.3kcal/ mol. Based on this information, it is evident that the 
hydrolysis of ATP is a highly favorable (exergonic) reaction, that is, 
one that tends toward a large [ADP]/[ATP] ratio at equilibrium. 
There are several reasons why this reaction is so favorable, one of 
which is evident in Figure 3.5. The electrostatic repulsion created by 


?AG° indicates standard conditions including pH 7, whereas AG° indicates standard 
conditions in 1.0 MH* (pH 0.0). The designation K 2 also indicates a reaction mix- 
ture at pH 7. 


>The right-hand side of this equation is equivalent to the amount of free energy lost as 
the reaction proceeds from standard conditions to equilibrium. 


TABLE 3.2 Relation between AG” 
and Kg, at 25°C 


AG” 
K' eg (kcal/mol) 
10° -8.2 
10* 5.5 
10° -2.7 
10! -1.4 
10° 0.0 
107! 1.4 
iO? 2.7 
10°* 5.5 
10°° 8.2 


four closely spaced negative charges on ATP“. is partially relieved by 
formation of ADP*. 

It is important that the difference between AG and AG™ be kept 
clearly in mind. The AG” is a fixed value for a given reaction and 
indicates the direction in which that reaction would proceed were 
the system at standard conditions. Because standard conditions do 
not prevail within a cell, AG® values cannot be used to predict the 
direction in which a particular reaction is proceeding at a given 
moment within a particular cellular compartment. To do this, one 
must know the AG, which is determined by the concentrations of the 
reactants and products that are present at the time. At 25°C 


[C][D] 
[A] [B] 


AG = AG*' +2.303RT log 


[C][D] 
[A] [B] 


AG = AG* + 2.303(1.987cal/mol + K)(298K) log 


[C][D] 
[A][B] 


AG = AG* +(1.4 kcal/mol) log 


where [A], [B], [C], and [D] are the actual concentrations at the time. 
Calculation of the AG reveals the direction in which the reaction 
in the cell is proceeding and how close the particular reaction in 
question is to equilibrium. For example, typical cellular concentra- 
tions of the reactants and products in the reaction for ATP hydrolysis 


might be [ATP] = 10 mM; [ADP] =1 mM; [P;] =10 mM. Substituting 87 


these values into the equation, 


AG = AG* +2.303RT log Eee 
[ATP] 
[10 *][107] 


AG = -7.3 kcal /mol + (1.4 kcal/mol) log 102) 
107 


AG =-7.3 kcal/mol +(1.4 kcal/mol)(-3) 


AG =~-11.5 kcal/mol (or — 46.2 kJ/mol) 


Thus, even though the AG” for the hydrolysis of ATP is —7.3 kcal/mol, 
the typical AG in the cell for this reaction is about —12 kcal/mol 
because a cell maintains a high[ ATP] /[ADP] ratio. This fact illustrates 
a useful way to think about AG, as a number that describes how far 
away a system is from equiblibrium. ATP is a high energy molecule 
because the ratio of ATP to ADPis far away from its equilibrium value. 
Cells carry out many reactions with positive AG values because 
the relative concentrations of reactants and products favor the pro- 
gress of the reactions. This can happen in two ways. The first illus- 
trates the important difference between AG and AG”, and the second 
reveals the manner in which reactions with positive AG” values can 
be driven in the cell by the input of stored chemical energy. 
Consider the reaction of glycolysis (see Figure 3.24) in 
which dihydroxyacetone phosphate is converted to glyceraldehyde 
3-phosphate. The AG” for this reaction is +1.8 kcal/mol, yet the for- 
mation of the product of this reaction takes place in the cell. The 
reaction proceeds because other cellular reactions maintain the ratio 
of the reactant to product above that defined by the equilibrium con- 
stant. As long as this condition holds, the AG will be negative and the 
reaction will continue spontaneously in the direction of formation of 
glyceraldehyde 3-phosphate. This brings up an important character- 
istic of cellular metabolism, namely, that specific reactions cannot be 
considered independently as if they were occurring in isolation in a 
test tube. Hundreds of reactions occur simultaneously in a cell. All 
these reactions are interrelated because a product of one reaction 
becomes a reactant for the next reaction in the sequence and so on 
throughout one metabolic pathway and into the next. To maintain 
the production of glyceraldehyde 3-phosphate at the expense of 
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FIGURE 3.5 ATP hydrolysis. Adenosine triphosphate (ATP) is hydrolyzed as part of many biochemical processes. In most reactions, as shown here, ATP is 
hydrolyzed to ADP and inorganic phosphate (P;), but in some cases (not shown) it is hydrolyzed to AMP, a compound with only one phosphate group, and 
pyrophosphate (PP). Both of these reactions have essentially the same AG” of —7.3 kcal/mol (-30.5 kJ/mol). 
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dihydroxyacetone phosphate, the reaction is situated within a meta- 
bolic pathway in such a way that the product is removed by the next 
reaction in the sequence at a sufficiently rapid rate that a favorable 
ratio of the concentrations of these two molecules is retained. 


REVIEW | 


1. Discuss the differences between AG and AG”; 
between the relative rates of the forward and reverse 
reactions when AG is negative, zero, or positive. What 
is the relationship between AG” and K.,? How can a 
cell carry out a reaction that has a +AG®’? 

2. How is it possible for a cell to maintain an [ATP]/[ADP] 
ratio greater than one? How does this ratio differ from 
that expected at equilibrium? 

3. How is it that ice formation does not take place at 
temperatures above 0°C? 


— = 


3.3 Coupling Endergonic 
and Exergonic Reactions 


Reactions with large positive AG® values are typically driven by the 
input of energy. Consider the formation of the amino acid glutamine 
from glutamic acid by the enzyme glutamine synthetase: 
Glutamicacid+NH,; — glutamine AG®° =+3.4 kcal/mol 
This endergonic reaction takes place in the cell because glutamic 


acid is actually converted to glutamine in two sequential reactions, 
both of which are exergonic: 


Ist reaction : Glutamic acid + ATP —> glutamyl phosphate + ADP 


2nd reaction : Glutamyl phosphate ++ NH; > glutamine + P, 


Overall reaction: Glutamic acid + ATP + NH; > 
glutamine + ADP + P; 


AG* =-3.9 kcal/mol 


The formation of glutamine is said to be coupled to the hydrolysis of 
ATP. As long as the AG for ATP hydrolysis is more negative than 
the AG for glutamine synthesis from glutamic acid is positive, the 
“downhill” ATP hydrolysis reaction can be used to drive the “uphill” 
synthesis of glutamine. To couple the two chemical reactions, the 
product of the first reaction becomes the reactant for the second. 
The bridge between the two reactions—glutamyl] phosphate in this 
case—is called the common intermediate. What happens, in essence, 
is that the exergonic hydrolysis of ATP is taking place in two steps. 
In the first step, glutamic acid acts as an acceptor of the phosphate 
group, which is displaced by NH, in the second step. 

The hydrolysis of ATP can be used in the cell to drive reactions 
leading to the formation of molecules such as glutamine because ATP 
levels are maintained at much higher levels (relative to those of ADP) 
than they would be at equilibrium. This can be demonstrated by the 
following calculation. As noted above, a typical cellular concentration of 
P; might be 10 mM. To calculate the equilibrium ratio of [ADP]/[ATP] 
under these conditions, we can set AG at the equilibrium value of 0 and 
solve the following equation (taken from page 87) for [ADP]/[ATP]: 


[ADP] [P;] 


AG = AG*' + (1.4 kcal/mol) log 
[ATP] 


[ADP ][L0~] 


0 =-7.3kcal/mol + (1.4 kcal/mol) log 
[ATP] 


0 =-7.3 kcal/mol +(1.4 kcal/mol 107 +log er 


[ATP] 


[ADP] 
[ATP] 


7.3 kcal/mol = (1.4 kcal/mol )(—2) + (1.4 kcal/mol)log 


_ [ADP] _10.1kcal/mol _ 
"UMP idkeliool 


ADE 4.6007 
[ATP] 


Thus, at equilibrium, the ADP concentration would be expected to 
be more than 10’ times that of ATP, but in fact, the ATP concentra- 
tions in most cells are 10 to 100 times that of ADP. This is a crucial 
point because it’s the relative concentrations of ATP and ADP that 
matter. If a cell were to contain an equilibrium mixture of ATP, ADP, 
and P;, it wouldn't matter how much ATP was present, the cell would 
have no capacity to perform work. 

ATP hydrolysis is used to drive most endergonic processes 
within the cell, including chemical reactions such as the one just 
described, the separation of charge across a membrane, the concen- 
tration of a solute, the movement of filaments in a muscle cell, and 
changes in the properties of proteins (FIGURE 3.6). ATP can be used 
for such diverse processes because its terminal phosphate group can 
be transferred to a variety of different types of molecules, including 
amino acids, sugars, lipids, and proteins. In most coupled reactions, 
the phosphate group is transferred in an initial step from ATP to 
one of these acceptors and is subsequently removed in a second step 
(see Figure 4.46 as an example). 


REVIEW | 


1. What is the role of a common intermediate in driving 
an unfavorable chemical reaction? 


3.4 Equilibrium versus 
Steady-State Metabolism 


As reactions tend toward equilibrium, the free energy available to do 
work decreases toward a minimum and the entropy increases toward 
a maximum. Thus, the farther a reaction is kept from its equilibrium 
state, the less its capacity to do work is lost to the increase in entropy. 
Cellular metabolism is essentially nonequilibrium metabolism; that 
is, it is characterized by nonequilibrium ratios of products to reac- 
tants. This does not mean that some reactions do not occur at or near 
equilibrium within cells. In fact, many of the reactions of a metabolic 
pathway may be near equilibrium (see Figure 3.25). However, at least 
one and often several reactions in a pathway are poised far from equi- 
librium, making them essentially irreversible. These are the reactions 
that keep the pathway going in a single direction. These are also the 
reactions that are subject to cellular regulation because the flow of 
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FIGURE 3.6 A few roles for ATP hydrolysis. In the cell, energy from ATP 
hydrolysis may be used to (a) separate charge across a membrane; 

(b) concentrate a particular solute within the cell; (c) drive an otherwise 
unfavorable chemical reaction; (d) slide filaments across one another, as 
occurs during the shortening of a muscle cell; (e) donate a phosphate 
group to a protein, thereby changing its properties and bringing about a 
desired response. In this case the added phosphate groups serve as 
binding sites for other proteins. 


materials through the entire pathway can be greatly increased or 
decreased by stimulating or inhibiting the activity of the enzymes 
that catalyze these reactions. 

The basic principles of thermodynamics were formulated using 
nonliving, closed systems (no exchange of matter between the system 
and its surroundings) under reversible equilibrium conditions. The 
unique features of cellular metabolism require a different perspective. 
Cellular metabolism can maintain itself at irreversible, nonequilib- 
rium conditions because unlike the environment within a test tube, 
the cell is an open system. Materials and energy are continually flowing 
into the cell from the bloodstream or culture medium. The extent of 
this input into cells from the outside becomes apparent if you simply 
hold your breath. We depend from minute to minute on an external 
source of oxygen because oxygen is a very important reactant in cel- 
lular metabolism. The continual flow of oxygen and other materials 
into and out of cells allows cellular metabolism to exist in a steady 
state (FIGURE 3.7). In a steady state, the concentrations of reactants 


Steady state 


ADP ATP 
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Equilibrium 
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ADP ATP 
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FIGURE 3.7 Steady state versus equilibrium. (a) As long as this ameba 
can continue to take in nutrients from the outside world, it can harvest the 
energy necessary to maintain concentrations of compounds at a steady 
state, which may be far from equilibrium. The steady-state concentrations 
of ATP and ADP are indicated by the colored dots and the histogram. 

(b) When the ameba dies, the concentrations of ATP and ADP (as well as 
other biochemicals) shift toward their equilibrium ratios. 


and products remain relatively constant, even though the individual 
reactions are not necessarily at equilibrium. This is not to suggest 
that the concentrations of cellular metabolites do not change. Cells 
are capable of continually adjusting the concentration of key sub- 
stances in response to changing conditions. A rise or fall in the level 
of regulatory substances, such as the hormone insulin, for example, 
can cause a dramatic surge or decline in the production of sugars, 
amino acids, or fats. Cells, in other words, exist in a state of dynamic 
nonequilibrium, in which the rates of forward and reverse reactions 
can be increased or decreased instantaneously in response to chang- 
ing conditions. 


REVIEW | 


1. What defines a closed system? Are cells closed 
systems? 


e | 


3.5 Enzymes as Biological 
Catalysts 


At the end of the nineteenth century, a debate was raging as to 
whether the process of ethanol formation required the presence of 
intact yeast cells. On one side of the debate was the organic chemist 
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Justus von Liebig, who argued that the reactions of fermentation that 
produced the alcohol were no different from the types of organic 
reactions he had been studying in the test tube. On the other side 
was the biologist Louis Pasteur, who argued that the process of fer- 
mentation could only occur within the confines of an intact, highly 
organized, living cell. 

In 1897, two years after Pasteur’s death, Hans Btichner, a bacte- 
riologist, and his brother Eduard, a chemist, were preparing “yeast 
juice’—an extract made by grinding yeast cells with sand grains 
and then filtering the mixture through filter paper. They wanted to 
preserve the yeast juice for later use. After trying to preserve the 
extract with antiseptics and failing, they attempted to protect the 
preparation from spoilage by adding sugar, the same procedure 
used to preserve jams and jellies. Instead of preserving the solution, 
the yeast juice produced gas from the sugar, and it bubbled contin- 
uously for days. After further analysis, Eduard discovered that fer- 
mentation was producing ethanol and bubbles of carbon dioxide. 
Buchner had shown that fermentation did not require the presence 
of intact cells. 

It was soon found, however, that fermentation was very differ- 
ent from the types of reactions carried out by organic chemists. 
Fermentation required the presence of a unique set of catalysts that 
had no counterpart in the nonliving world. These catalysts were 
called enzymes (after the Greek for “in yeast”). Enzymes are the 
mediators of metabolism, responsible for virtually every reaction 
that occurs in a cell. Without enzymes, metabolic reactions would 
proceed so slowly as to be imperceptible. 

The first evidence that enzymes are proteins was obtained in 
1926 by James Sumner of Cornell University, when he crystallized 
the enzyme urease from jack beans and determined its composition. 
Although this finding was not greeted with much positive acclaim at 
the time, several other enzymes were soon shown to be proteins, and 
it became accepted for the next few decades that all biological cata- 
lysts were proteins. Eventually, it became evident that certain bio- 
logical reactions are catalyzed by RNA molecules. For the sake of 
clarity, the term enzyme is still generally reserved for protein cata- 
lysts, while the term ribozyme is used for RNA catalysts. We will 
restrict the discussion in the present chapter to protein catalysts and 
discuss the properties of RNA catalysts in Chapter 11. 

Even though enzymes are proteins, many of them are conju- 
gated proteins; that is, they contain nonprotein components, called 
cofactors, which may be inorganic (metals) or organic (coen- 
zymes). Vitamins and their derivatives often function as coenzymes. 
When present, cofactors are important participants in the function- 
ing of the enzyme, often carrying out activities for which amino acids 
are not suited. In myoglobin, for example, as discussed in the previ- 
ous chapter, the iron atom of a heme group is the site where oxygen 
is bound and stored until required by cellular metabolism. 


The Properties of Enzymes 


As is true of all catalysts, enzymes exhibit the following properties: 
(1) They are required only in small amounts; (2) they are not altered 
irreversibly during the course of the reaction, and therefore each 
enzyme molecule can participate repeatedly in individual reactions; 
and (3) they have no effect on the thermodynamics of the reaction. 
This last point is particularly important. Enzymes do not supply 
energy for a chemical reaction and therefore do not determine 
whether a reaction is thermodynamically favorable or unfavorable. 


Similarly, enzymes do not determine the ratio of products to reac- 
tants at equilibrium. These are inherent properties of the reacting 
chemicals. As catalysts, enzymes can only accelerate the rate at which 
a favorable chemical reaction proceeds. 

There is no necessary relationship between the magnitude of the 
AG for a particular reaction and the rate at which that reaction takes 
place. The magnitude of AG informs us only of the difference in free 
energy between the beginning state and equilibrium. It is totally inde- 
pendent of the pathway or the time it takes to reach equilibrium. The 
oxidation of glucose, for example, is a very favorable process, as can be 
determined by the amount of energy released during its combustion. 
However, glucose crystals can be left out in room air virtually indefi- 
nitely without a noticeable conversion into less energetic materials. 
Glucose, in other words, is kinetically stable, even if it is thermody- 
namically unstable. Even if the sugar were to be dissolved, as long as 
the solution was kept sterile, it would not rapidly deteriorate. However, 
if one were to add a few bacteria, within a short period of time the 
sugar would be taken up by the cells and enzymatically degraded. 

Enzymes are remarkably adept catalysts. Catalysts employed by 
organic chemists in the lab, such as acid, metallic platinum, and mag- 
nesium, generally accelerate reactions a hundred to a thousand times 
over the noncatalyzed rate. In contrast, enzymes typically increase 
the velocity of a reaction 10° to 10'° fold or greater (Table 3.3). On the 
basis of these numbers, enzymes can accomplish in one second what 
would require from three years to 300,000 years if the enzyme were 
absent. Even more remarkably, they accomplish this feat at the mild 
temperature and pH found within a cell. In addition, unlike the inor- 
ganic catalysts used by chemists, enzymes are highly specific in the 
reactants they can bind and the reaction they can catalyze. The 
reactants bound by an enzyme are called substrates. If, for example, 
the enzyme hexokinase is present in solution with a hundred low- 
molecular-weight compounds in addition to its substrate, glucose, only 
the glucose molecules will be recognized by the enzyme and undergo 
reaction. For all practical purposes, the other compounds may as well 
be absent. This type of specificity, whether between enzymes and sub- 
strates or other types of proteins and the substances that they bind, is 
crucial for maintaining the order required to sustain life. 

In addition to their high level of activity and specificity, enzymes 
act as metabolic traffic directors in the sense that enzyme-catalyzed 
reactions are very orderly—the only products formed are the appro- 
priate ones. This is very important because the formation of chemi- 
cal by-products would rapidly take its toll on the life of a fragile cell. 
Finally, unlike other catalysts, the activity of enzymes can be regu- 
lated to meet the particular needs of the cell at a particular time. As 
will be evident in this chapter and the remainder of the text, the 
enzymes of a cell are truly a wondrous collection of miniature 
molecular machines. 


Overcoming the Activation 
Energy Barrier 


How are enzymes able to accomplish such effective catalysis? The 
first question to consider is why thermodynamically favorable reac- 
tions do not proceed on their own at relatively rapid rates in the 
absence of enzymes. Even ATP, whose hydrolysis is so favorable, is 
stable in a cell until broken down in a controlled enzymatic reaction. 
If this weren't the case, ATP would have little biological use. 
Chemical transformations require that certain covalent bonds 
be broken within the reactants. For this to occur, the reactants must 


TABLE 3.3 Catalytic Activity of a Variety of Enzymes 


Enzyme 


Nonenzymatic ty,2' 


Turnover number? Rate enhancement? 


OMP decarboxylase 
Staphylococcal nuclease 
Adenosine deaminase 
AMP nucleosidase 
Cytidine deaminase 
Phosphotriesterase 
Carboxypeptidase A 
Ketosteroid isomerase 
Triosephosphate isomerase 
Chorismate mutase 
Carbonic anhydrase 
Cyclophilin, human 


78,000,000 yr 
130,000 yr 
120 yr 
69,000 yr 
69 yr 

2.9 yr 

7.3 yr 

7 wk 
1.9d 

7.4 hr 

5 sec 

23 sec 


39 1.4 x 10!” 

95 5.6 x 10!4 
370 2.1x 10" 
60 6.0 x 10! 
299 1.2 x 10” 
2,100 2.8 x 10! 
578 1.9 x 10! 
66,000 3.9 x 10! 
4,300 1.0 x 10° 
50 1.9 x 10° 

1x 10° 7.7 x 10° 
13,000 4.6 x 10° 


'The time that would elapse for half the reactants to be converted to product in the absence of enzyme. 
?The number of reactions catalyzed by a single enzyme molecule per second when operating at a saturating substrate concentration. 
>The increase in reaction rate achieved by the enzyme-catalyzed reaction over the noncatalyzed reaction. 


Source: A. Radzicka and R. Wolfenden, Science 267:91, 1995. Copyright 1995. Reprinted with permission from AAAS. 


contain sufficient kinetic energy (energy of motion) that they over- 
come a barrier, called the activation energy (Ea). This is expressed 
diagrammatically in FIGURE 3.8, where the activation energy is rep- 
resented by the height of the curves. The reactants in a chemical 
reaction are often compared to an object resting on top of a cliff 
ready to plunge to the bottom. If left to itself, the object in all likeli- 
hood would remain there indefinitely. However, if someone were to 
come along and provide the object with sufficient energy to over- 
come the friction or other small barrier in its way and cause it to 
reach the edge of the cliff, it would spontaneously drop to the bottom. 


Reactants 


@ sco 
Glucose + ATP 


Free enerry ——> 


Initial 

State}, eT carci aaa cane 
AG* of reaction 
= —4 kcal/mol 

Final 

State —p pce re ee 


The object has the potential to fall to a lower energy state once the 
kinetic barriers have been removed. 

In a solution at room temperature, molecules exist in a state of 
random movement, each possessing a certain amount of kinetic 
energy at a given instant. Among a population of molecules, their 
energy is distributed in a bell-shaped curve (FIGURE 3.9), some hav- 
ing very little energy and others much more. Molecules of high 
energy (activated molecules) remain as such only for a brief time, 
losing their excess energy to other molecules by collision. Let us 
consider a reaction in which one reactant molecule is split into two 


Transition state (activated complex between glucose and ATP) 


En of uncatalyzed reaction 
in forward direction 


E, of enzyme-catalyzed reaction 
in forward direction 


Products 
2) co 
Glucose 6-phosphate + ADP 


Course of reaction 


FIGURE 3.8 Activation energy and enzymatic reactions. Even though the formation of glucose 6-phosphate is a thermodynamically favored reaction 
(AG* = —4 kcal/mol), the reactants must possess sufficient energy to achieve an activated state in which the atomic rearrangements necessary for the 
reaction can occur. The amount of energy required is called the activation energy (E,) and is represented by the height of the curve. The activation energy is 
not a fixed value, but varies with the particular reaction pathway. E, is greatly reduced when the reactants combine with an enzyme catalyst. (This diagram 
depicts a simple, one-step reaction mechanism. Many enzymatic reactions take place in two or more steps leading to the formation of intermediates (as in 
Figure 3.13). Each step in the reaction has a distinct E, and a separate transition state.) 
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__— Minimal energy 
required of molecules 
for uncatalyzed reaction 


Minimal energy ——____ ; 
required of molecules 1 
for catalyzed reaction ! 


@ Molecules capable of 
reaction in presence 
of enzyme catalyst 
at lower temperature 

© Molecules capable of 
reaction at elevated 
temperature — no catalyst 

© Molecules capable of 
reaction at lower 
temperature — no catalyst 


Number of molecules with a particular energy 


FIGURE 3.9 The effect of lowering activation energy on the rate of a 
reaction. The bell-shaped curves indicate the energy content of a 
population of molecules present in a reaction mixture at two different 
temperatures. The number of reactant molecules containing sufficient 
energy to undergo reaction is increased by either heating the mixture or 
adding an enzyme catalyst. Heat increases the rate of reaction by increas- 
ing the energy content of the molecules so that a larger fraction have 
enough energy to overcome the activation energy barrier (red versus blue 
curves), while the enzyme does so by lowering the activation energy 
required for the reaction to occur (vertical dotted lines). 


product molecules. If a given reactant molecule acquires sufficient 
energy to overcome the activation barrier, then the possibility exists 
that it will split into the two product molecules. The rate of the reac- 
tion depends on the number of reactant molecules that contain the 
necessary kinetic energy at any given time. One way to increase the 
reaction rate is to increase the energy of the reactants. This is most 
readily done in the laboratory by heating the reaction mixture 
(Figure 3.9). In contrast, applying heat to an enzyme-mediated reac- 
tion leads to the rapid inactivation of the enzyme due to its 
denaturation. 

Reactants, when they are at the crest of the energy hump and 
ready to be converted to products, are said to be at the transition 
state (Figure 3.8). At this point, the reactants have formed a fleet- 
ing, activated complex in which bonds are being formed and broken. 
We can illustrate the nature of a transition state structure by examin- 
ing the interconversion of D and L stereoisomers of proline, a reac- 
tion catalyzed by the bacterial enzyme proline racemase. 


s Sf 
OX Sen Cor BOK 
COO™ 
= i 
D+ ies Transition L-proline 
state 


This reaction proceeds in either direction through the loss of a proton 
from the a-carbon of the proline molecule. As a result, the transition 
state structure contains a negatively charged carbanion in which the 
three bonds formed by the carbon atom are all in the same plane. 


Unlike the difference in standard free energy for a reaction, the 
activation energy required to reach the transition state is not a fixed 
value but varies with the particular reaction mechanism utilized. 
Enzymes catalyze reactions by decreasing the magnitude of the acti- 
vation energy barrier. Consequently, unlike catalysis by heat, 
enzymes cause their substrates to be very reactive without having to 
be raised to particularly high energy levels. A comparison between 
the percentage of molecules able to react in an enzyme-catalyzed 
reaction versus a noncatalyzed reaction is shown in Figure 3.9. 
Enzymes are able to lower activation energies by binding more 
tightly to the transition state than to the reactants, which stabilizes 
this activated complex, thereby decreasing its energy. Asa result, dif- 
ferences in the geometry and bonding properties between the reac- 
tants and the transition state are a focus of research in enzymatic 
mechanisms. The importance of the transition state can be demon- 
strated in numerous ways. For example: 


e Compounds that resemble the transition state of a reaction tend 
to be very effective inhibitors of that reaction because they are 
able to bind so tightly to the catalytic region of the enzyme. 


e Antibodies normally do not behave like enzymes but, rather, 
simply bind to molecules with high affinity. However, antibodies 
that bind to a compound that resembles a transition state for a 
reaction are often able to act like enzymes and catalyze the 
breakdown of that compound. 


As the transition state is converted to products, the affinity of the 
enzyme for the bound molecule(s) decreases and the products are 
expelled. 


The Active Site 


As catalysts, enzymes accelerate bond-breaking and bond-forming 
processes. To accomplish this task, enzymes become intimately 
involved in the activities that are taking place among the reactants. 
Enzymes do this by forming a complex with reactants, called an 
enzyme-substrate (ES) complex. An ES complex is illustrated 
schematically in FIGURE 3.10 and the image in Figure 3.14. In most 
cases, the association between enzyme and substrate is noncovalent, 
though many examples are known in which a transient covalent 
bond is formed. 

That part of the enzyme molecule that is directly involved in bind- 
ing the substrate is termed the active site. The active site and the 
substrate(s) have complementary shapes, enabling them to bind 
together with a high degree of precision. The binding of substrate to 
enzyme is accomplished by the same types of noncovalent interactions 
(ionic bonds, hydrogen bonds, hydrophobic interactions) that deter- 
mine the structure of the protein itself. For example, the enzyme 
depicted in FIGURE 3.11a contains a number of positively charged 
residues strategically located to bind negatively charged atoms of the 
substrate. In addition to binding the substrate, the active site contains a 
particular array of amino acid side chains that influence the substrate 
and lower the activation energy of the reaction. The importance of indi- 
vidual side chains of the active site can be evaluated by site-directed 
mutagenesis (Section 18.25), a technique in which one particular amino 
acid is replaced by another amino acid having different properties. 

The active site is typically buried in a cleft or crevice that leads 
from the aqueous surroundings into the depths of the protein. When 
a substrate enters the active site cleft, it typically gives up its bound 
water molecules (desolvation) and enters a hydrophobic environ- 
ment within the enzyme. The reactivity of the active-site side chains 


Enzyme-substrate 
complex 


Pyruvate 


FIGURE 3.10 Formation of an enzyme-substrate complex. Schematic 
drawing of the reaction catalyzed by pyruvate kinase (see Figure 3.24) in 
which the two substrates, phosphoenolpyruvate (PEP) and ADP, bind to the 
enzyme to form an enzyme-substrate (ES) complex, which leads to the 
formation of the products, pyruvate and ATP. 


may be much greater in this protected environment as compared to 
the aqueous solvent of the cell. The amino acids that make up the 
active site are usually situated at distant points along the length of 
the extended polypeptide chain, but they are brought into close 
proximity to one another as the polypeptide folds into its final 


(a) 


tertiary structure (Figure 3.11b). The structure of the active site 93 


accounts not only for the catalytic activity of the enzyme, but also for 
its specificity. As noted above, most enzymes are capable of binding 
only one or a small number of closely related biological molecules. 


REVIEW | 
1. How is it possible to have a reaction characterized by a 
large AG and a small E,? a large E, and a small AG? 


2. Explain how enzymes can be so specific toward the 
substrates they bind. 


e@- 


3.6 Mechanisms of Enzyme 
Catalysis 


How is it that an enzyme can cause a reaction to occur hundreds of 
times a second when that same reaction might occur at undetectable 
rates in the enzyme’s absence? The answer lies in the formation of 
the enzyme-substrate complex, which allows the substrate(s) to be 
taken out of solution and held onto the surface of the large catalyst 
molecule. Once there, the physical and chemical properties of the 
substrate can be affected in a number of ways, several of which are 
described in the following sections. 


(b) 


FIGURE 3.11 The active site of an enzyme. (a) Diagrammatic representation of the active site of the enzyme ribulose bisphosphate carboxylase oxyge- 
nase showing the various sites of interaction between the bound substrates (RUBP and CO.) and certain amino acid side chains of the enzyme. In addition to 
determining the substrate-binding properties of the active site, these noncovalent interactions alter the properties of the substrate in ways that accelerate its 
conversion to products. (b) An electron density map of the active site of a blood clotting enzyme called Factor Xa with a chemical inhibitor that mimics the 
substrate, prothrombin, shown superimposed (red). The gray surface provides an indication of the outer reaches of the electron orbitals of the atoms that 
make up the side chains of the enzyme, thus portraying a visual representation of the space occupied by the atoms of the active site. It can be clearly seen 
that the substrate binds in a deep cleft in the protein, typical of active sites. The active site serine, Ser195, is shown in blue. Protein regions important for 
determining which molecules can bind as substrates, known as S1 and S4, are made up of multiple side chains and are indicated in yellow and green. 


Source: (a) D. A. Harris, Bioenergetics at a Glance, P. 88, Blackwell, 1995; (b) From Yeh CH, Fredenburgh JC, Weitz JI. 2012. Oral direct factor Xa inhibitors. 


Circulation Research 111, 1069-1078. 
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Substrate Orientation 


Suppose you were to place a handful of nuts and bolts into a bag and 
shake the bag for 5 minutes. It is very unlikely that any of the bolts 
would have a nut firmly attached to its end when you were finished. 
In contrast, if you were to pick up a bolt in one hand and a nut in the 
other, you could rapidly guide the bolt into the nut. By holding the 
nut and bolt in the proper orientation, you have greatly decreased 
the entropy of the system. Enzymes lower the entropy of their sub- 
strates in a similar manner. 

Substrates bound to the surface of an enzyme are brought very 
close together in precisely the correct orientation to undergo reaction 
(FIGURE 3.12a). In contrast, when reactants are present in solution, 
they are free to undergo translational and rotational movements, and 
even those possessing sufficient energy do not necessarily undergo a 
collision that results in the formation of a transition-state complex. 


Changing Substrate Reactivity 


Enzymes are composed of amino acids having a variety of different 
types of side chains, from fully charged to highly nonpolar. When a 
substrate is bound to the surface of an enzyme, the distribution of 
electrons within that substrate molecule is influenced by the neigh- 
boring side chains of the enzyme (Figure 3.12b). This influence 


Substrate 


aligned 


Substrate 
acquires 
charged 


region 4 


Substrate 


stressed 


FIGURE 3.12 Three mechanisms by which enzymes accelerate 


reactions: (a) maintaining precise substrate orientation, (b) changing 
substrate reactivity by altering its electrostatic configuration, (c) exerting 
physical stress on bonds in the substrate to be broken. 


increases the reactivity of the substrate and stabilizes the transition-state 
complex formed during the reaction. These effects are accomplished 
without the input of external energy, such as heat. 

There are several general mechanisms whereby the reactivity of 
substrates is increased by association with enzymes. Basically, these 
mechanisms are similar to those characterized by organic chemists 
studying the mechanisms of organic reactions in a test tube. For 
example, the rate of reactions can be greatly affected by changes in 
pH. Although enzymes cannot change the pH of their medium, they 
do contain numerous amino acids with acidic or basic side chains. 
These groups are capable of donating or accepting protons to and 
from the substrate, thereby altering the electrostatic character of the 
substrate, making it more reactive. 

The active sites of many enzymes contain side chains with a par- 
tial positive or negative charge. Such groups are capable of interact- 
ing with a substrate to alter its electrostatic properties (Figure 3.12b) 
and hence its reactivity. Such groups are also capable of reacting with 
a substrate to produce a temporary, covalent enzyme-substrate link- 
age. Chymotrypsin, an enzyme that digests food proteins within the 
small intestine, acts in this way. The series of reactions that take 
place as chymotrypsin hydrolyzes a peptide bond in a substrate pro- 
tein is shown in FIGURE 3.13. The reaction in Figure 3.13 is divided 
into two steps. In the first step, the electronegative oxygen atom of 
the side chain of a serine of the enzyme “attacks” a carbon atom of 
the substrate. As a result, the peptide bond of the substrate is hydro- 
lyzed, and a covalent bond is formed between the serine and the 
substrate, displacing the remainder of the substrate as one of the 
products. As discussed in the figure legend, the ability of serine to 
carry out this reaction depends on a nearby histidine residue, which 
attracts the proton of serine’s hydroxyl group, increasing the nucleo- 
philic power of the group’s oxygen atom. Enzymes are also adept at 
utilizing water molecules in the reactions they catalyze. In the sec- 
ond step shown in Figure 3.130, the covalent bond between enzyme 
and substrate is cleaved by a water molecule, returning the enzyme 
to its original unbonded state and releasing the remainder of the 
substrate as the second product. 

Although amino acid side chains can engage in a variety of reac- 
tions, they are not well suited to donate or accept electrons. As we 
will see in the following section (and more fully in Chapters 5 and 6), 
the transfer of electrons is the central event in the oxidation- 
reduction reactions that play such a vital role in cellular metabolism. 
To catalyze these reactions, enzymes contain cofactors (metal ions or 
coenzymes) that increase the reactivity of substrates by removing or 
donating electrons. In a number of cases, enzymes create their own 
electron-accepting cofactor by chemically modifying one of the 
amino acid residues situated in their active site. 


Inducing Strain in the Substrate 


Although the active site of an enzyme may be complementary to its 
substrate(s), various studies reveal a shift in the relative positions of 
certain of the atoms of the enzyme once the substrate has bound. In 
many cases, the conformation shifts so that the complementary fit 
between the enzyme and reactants is improved (an induced fit), 
and the proper reactive groups of the enzyme move into place. An 
example of induced fit is shown in FIGURE 3.14. These types of 
movements within an enzyme molecule provide a good example ofa 
protein acting as a molecular machine. As these conformational 
changes occur, mechanical work is performed, allowing the enzyme 
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(a) (b) 


FIGURE 3.13 Diagrammatic representation of the catalytic 
mechanism of chymotrypsin. The reaction is divided into two steps. 

(a) The electronegative oxygen atom of a serine residue (Ser 195) in the 
enzyme, which carries a partial negative charge, carries out a nucleophilic 
attack on the carbonyl carbon atom of the substrate, which carries a 
partial positive charge, splitting the peptide bond. The polypeptide 
substrate is shown in blue. The serine is made more reactive by a closely 
applied histidine residue (His 57) that draws the proton from the serine 
and subsequently donates the proton to the nitrogen atom of the cleaved 
peptide bond. Histidine is able to do this because its side chain is a weak 
base that is capable of gaining and losing a proton at physiologic pH. 

(A stronger base, such as lysine, would remain fully protonated at this pH.) 
Part of the substrate forms a transient covalent bond with the enzyme 

by means of the serine side chain, while the remainder of the substrate 

is released as the first product. (It can be noted that the serine and 
histidine residues are situated 138 amino acids away from each other in 
the primary sequence but are brought together within the enzyme by 
the folding of the polypeptide. An aspartic acid, residue 102, which is 
not shown, also plays a role in catalysis by influencing the ionic state of 
the histidine.) (6) In the second step, the electronegative oxygen atom of 
a water molecule displaces the covalently linked substrate from the 
enzyme, regenerating the unbound enzyme molecule. As in the 

first step, the histidine plays a role in proton transfer; in this step, 

the proton is removed from water, making it a much stronger nucleo- 
phile. The proton is subsequently donated to the serine residue of 

the enzyme. 


(a) 


FIGURE 3.14 An example of induced fit. When a glucose molecule 
binds to the enzyme hexokinase, the protein undergoes a conforma- 
tional change that encloses the substrate within the active site pocket 
and aligns the reactive groups of the enzyme and substrate. 


Source: Courtesy of Thomas A. Steitz, Yale University. 


to exert a physical force on certain bonds within a substrate molecule. 
This has the effect of destabilizing the substrate, causing it to adopt 
the transition state in which the strain is relieved (Figure 3.12c). 

To fully understand the mechanism by which an enzyme cata- 
lyzes a particular reaction, it is necessary to describe the various 
changes in atomic and electronic structure in both the enzyme and the 
substrate(s) as the reaction proceeds. We saw in the last chapter how 
X-ray crystallographic techniques can reveal details of the structure of 
a large enzyme molecule. Because 40 to 60 percent of the volume of a 
typical protein crystal consists of trapped solvent, most crystallized 
enzymes retain a high level of enzymatic activity. It should be possible, 
therefore, to use X-ray diffraction techniques to study reaction mecha- 
nisms. There is one major limitation, namely, time. In a standard crys- 
tallographic study, enzyme crystals must be subjected to an X-ray beam 
for a period of hours or days while the necessary data are collected 
(See the Experimental Walkthrough Video: X-ray crystallography for this 
chapter). The portrait that emerges from such studies captures the 
structure of the molecule averaged out over time. Recent innovations, 
however, have made it possible to use X-ray crystallographic tech- 
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niques to observe the fleeting structural changes that take place in the 
active site while an enzyme is catalyzing a single reaction cycle. This 
approach, which is called time-resolved crystallography, can include: 


Use of ultra-high-intensity X-ray beams generated by a synchro- 
tron, an instrument used by nuclear physicists to study sub- 
atomic particles. This can cut the X-ray exposure period to a 
matter of picoseconds, which is the same time scale required for 
an enzyme to catalyze a single chemical transformation. 


» Cooling the enzyme crystals to temperatures within 20 to 40 
degrees of absolute zero, which slows the reaction by a factor as 
high as 10 billion, greatly increasing the lifetime of transient 
intermediates. 


Use of techniques to simultaneously trigger a reaction through- 
out an entire crystal so that all of the enzyme molecules in the 
crystal are in the same stage of the reaction at the same time. For 
example, in the case of a reaction in which ATP is a substrate, 
the enzyme crystals can be infiltrated with ATP molecules that 
have been made nonreactive by linking them to an inert group 
(e.g., a nitrophenyl group) by a photosensitive bond. When the 
crystals are exposed to a brief pulse of light, all of the “caged” 
ATP molecules are released, initiating the reaction simultane- 
ously in active sites throughout the crystal. 


Use of enzymes bearing site-directed mutations (Section 18.25) 
that impose kinetic barriers at specific stages of the reaction, 
leading to an increased lifetime for particular intermediates. 


» Determination of the structure at ultra-high (atomic) resolution 
(e.g., 0.8 A), which allows visualization of individual hydrogens 
that may be present in hydrogen bonds or associated with acidic 
groups in the protein; the presence or absence of bound water 
molecules; the precise conformation of catalytic side chains; and 
the subtle strain that appears in parts of the substrate during 
catalysis. The remarkable detail that can be extracted from these 
high-resolution images is illustrated by the hydrogen bond in 
FIGURE 3.15. 


By combining these techniques, researchers have been able to 
determine the three-dimensional structure of an enzyme at succes- 
sive stages during a single catalyzed reaction. When these individual 
“snapshots” are put together in sequence, they produce a “movie” 
showing the various catalytic intermediates that appear and disap- 
pear as the reaction proceeds. An example of the data that can be 
collected using several of these approaches is shown in FIGURE 3.16. 

While the spotlight has been focused on the conformational 
changes that occur as an enzyme transforms its substrates into prod- 
ucts, considerable attention has also been paid in recent years to the 
conformational changes that occur within the protein itself. As shown 
in Figure 2.39, proteins are dynamic molecules that are capable of 
built-in (intrinsic) motion that can have important functional rele- 
vance. In the case depicted in Figure 2.39, this motion allows entry of 
substrate to the enzyme’ active site. Analysis of this type of intrinsic 
motion by various experimental techniques has suggested that 
proteins—even in the absence of substrate—are capable of many of the 
same movements that can be detected during the protein's catalytic 
cycle. If true, this suggests that evolution has selected for protein struc- 
tures that are capable of intrinsic motions that can be useful for poten- 
tial functions that a protein might serve. Rather than inducing a specific 
conformational change, the substrate(s) might simply be “waiting” for 
a protein to assume a conformation to which it can bind effectively. 


FIGURE 3.15 Electron density map of a single hydrogen bond 
(green-dotted line). This map shows a very small part of the proteo- 
lytic enzyme subtilisin at atomic (0.78 A) resolution. The hydrogen 
atom (yellow) is seen to be shared between a nitrogen atom on the 
ring of a histidine residue and an oxygen atom of an aspartic 

acid residue. 

Source: From Peter Kuhn et al., Biochemistry 37:13450, 1998, courtesy 
of Richard Bott; © 1998 American Chemical Society. 


REVIEW | 


1. Examine Figure 3.13 illustrating the steps in an 
enzymatically catalyzed reaction and describe what is 


occurring without reading the legend. 
® 


FIGURE 3.16 Myoglobin: the movie. In this example of time-resolved 
X-ray crystallography, the structure of myoglobin (Mb) was determined 
with a molecule of CO bound to the heme group and at various 

times after the CO molecule was released. (CO binds to the same 

site on Mb as Op but is better suited for these types of studies.) Release 
of CO was induced simultaneously throughout the crystal by 

exposure to a flash of laser light (photolysis). Each of the structures 

was determined following a single, intense X-ray pulse from a 
synchrotron. The myoglobin molecule being studied had a single 

amino acid substitution that made it a better subject for analysis. 

a) A 6.5 A-thick slice through a Mb molecule showing the changes that 
occur by 100 picoseconds (1ps = one-trillionth of a second) following the 
release of CO from its binding site. The structure of Mb prior to the 
aser flash is shown in magenta, and the structure of the protein 100 ps 
after the laser flash is shown in green. Those parts of the molecule that 
did not change in structure over this time period appear white. Three 
arge-scale displacements near the CO-binding site can be seen 
indicated by the yellow arrows). (b) An enlarged view of the boxed 
region in part a. The released CO (solid circle) is situated about 

2 A from its original binding site (dashed circle). Movement of CO 

is accommodated by the rotation of Phe29, which pushes His64 
outward, which in turn dislodges a bound water molecule. 

(c) By 3.16 nanoseconds after the laser flash, the CO molecule has 
migrated to either of the two positions shown (labeled 2 and 3), Phe29 
and His64 have relaxed toward their initial states, and the heme group 
has undergone considerable displacement as indicated by the increased 
amount of green shading in the region of the heme. 

Source: Reprinted with permission from Friedrich Schotte, et al, Science 300: 
1946. 2003; Copyright ©2003, reprinted with permission from AAAS. 


3.7 Enzyme Kinetics 


Enzymes vary greatly in their ability to catalyze reactions. The cata- 
lytic activity of an enzyme is revealed by studying its kinetics, that is, 
the rate at which it catalyzes a reaction under various experimental 
conditions. 


The Michaelis-Menten Model 
of Enzyme Kinetics 


In 1913, Leonor Michaelis and Maud Menten reported on the math- 
ematical relationship between substrate concentration and the 
velocity of enzyme reactions as measured by the rate of product for- 
mation (also known as the velocity of the reaction). This relationship 
can be expressed by an equation (given on p. 98) that generates a 
hyperbola, as shown in FIGURE 3.17. To determine the velocity of a 
reaction experimentally, an incubation mixture is set up at the 
desired temperature, containing all the ingredients required except 
one, which when added initiates the reaction. If, at the time the reac- 
tion begins, no product is present in the mixture, then the amount of 
product that appears over time provides a measure of the velocity of 
the reaction. There are complicating factors in this procedure. If the 
incubation time is too great, the substrate concentration becomes 
measurably reduced. In addition, as product appears, it can be con- 
verted back to substrate by the reverse reaction, which is also cata- 
lyzed by the enzyme. Ideally, what we want to determine is the initial 
velocity, that is, the velocity at the instant when no product has yet 
been formed. To accurately measure the initial reaction velocity, 
brief incubation times and sensitive measuring techniques are 
employed. 

To generate a curve such as that shown in Figure 3.17, the initial 
velocity is determined for a series of incubation mixtures that 
contain the same amount of enzyme but an increasing concentration 
of substrate. It is evident from this curve that the initial reaction 
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FIGURE 3.17 The relationship between the rate (velocity) of an 
enzyme-catalyzed reaction and the substrate concentration. Since 
each enzyme molecule is only able to catalyze a certain number of 
reactions in a given amount of time, the velocity of the reaction (typically 
expressed as moles of product formed per second) approaches a maximal 
rate as the substrate concentration increases. The substrate concentration at 
which the reaction is at half-maximal velocity (Vinax/2) is called the Michaelis 
constant, or Ky. 
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velocity varies markedly with substrate concentration. At low sub- 
strate concentrations, enzyme molecules are subjected to relatively 
few collisions with substrate in a given amount of time. Consequently, 
the enzyme has “idle time”; that is, substrate molecules are rate limit- 
ing. At high substrate concentrations, enzymes are colliding with 
substrate molecules more rapidly than they can be converted to 
product. Thus, at high substrate concentrations, individual enzyme 
molecules are working at their maximal capacity; that is, enzyme 
molecules are rate limiting. Thus, as a greater and greater concentra- 
tion of substrate is present in the reaction mixture, the enzyme 
approaches a state of saturation. The initial velocity at this theoreti- 
cal saturation point is termed the maximal velocity (Vinax)- 

The simplest measure of the catalytic activity of an enzyme is 
provided by its turnover number, which can be calculated from 
the Vinax- The turnover number (or catalytic constant, ka, as it is also 
called) is the maximum number of molecules of substrate that can 
be converted to product by one enzyme molecule per unit time. A 
turnover number (per second) of 1 to 10° is typical for enzymes, 
although values as great as 10° are known (see Table 3.3). It is appar- 
ent from these values that a few molecules of enzyme can rapidly 
convert a large number of substrate molecules into product. 

The value of Vinax is only one useful term obtained from a plot 
such as that in Figure 3.17; another is the Michaelis constant (Ky), 
which is equal to the substrate concentration when the reaction 
velocity is one-half of Vinax. As its name implies, the Ky is constant 
for a given enzyme and, thus, independent of substrate or enzyme 
concentration. The relationship between Vinax and Ky is best appre- 
ciated by considering the Michaelis-Menten equation, which can be 
used to generate the plot shown in Figure 3.17. 


yoy... 
me [S]+ Ky, 


According to the equation, when the substrate concentration [S] is 
set at a value equivalent to Ky, then the velocity of the reaction (V) 
becomes equal to V,,,,/2, or one-half the maximal velocity. Thus, 
K,, =[S], when V =V,,,,./2. 

To generate a hyperbolic curve such as that in Figure 3.17 and 
make an accurate determination of the values for Vmax and Km, a 
considerable number of points must be plotted. An easier and more 
accurate description is gained by plotting the reciprocals of the 
velocity and substrate concentration against one another, as formu- 
lated by Hans Lineweaver and Dean Burk. When this is done, the 
hyperbola becomes a straight line (FIGURE 3.18) whose x intercept is 
equal to -1/Ky, y intercept is equal to 1/V,,,,, and slope is equal 
to Kw/Vmax. The values of Kym and Vinax are, therefore, readily 
determined by extrapolation of the line drawn from a relatively 
few points. 

In most cases, the value for Ky provides a measure of the affinity 
of the enzyme for the substrate. The higher the Ky, the greater the 
substrate concentration that is required to reach one-half Vinax and, 
thus, the lower the affinity of the enzyme for that substrate. The Ky 
for most enzymes ranges between 10‘ and 10’, with a typical value 
about 10 *M. Other factors that strongly influence enzyme kinetics 
are the pH and temperature of the incubation medium. Each enzyme 
has an optimal pH and temperature at which it operates at maximal 
activity (FIGURE 3.19). The influence of temperature on enzyme 
activity is illustrated by the striking effect of refrigeration in slowing 
the growth of microorganisms. 
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FIGURE 3.18 A Lineweaver-Burk plot of the reciprocals of velocity 
and substrate concentration from which the values for the V,,,, and Ky are 


readily calculated. 
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FIGURE 3.19 Dependence of the rate of an enzyme-catalyzed reaction 
on (a) pH, and (b) temperature. The shape of the curves and the optimal 
pH and temperature vary with the particular reaction. (a) Changes in pH 
affect the ionic properties of the substrate and the enzyme as well as the 
enzyme’s conformation. (b) At lower temperatures, reaction rates rise with 
increasing temperature due to the increased energy of the reactants. At 
higher temperatures, this positive effect is offset by enzyme denaturation. 
Source: (a) From E. A. Moelwyn-Hughes, In The Enzymes, J. B. Sumner and 
K. Myrback, Eds., Vol. 1, Academic Press, 1950. (6) From K. Hayashi et al., 
Journal Biochem. 64:93, 1968, by Permission of Oxford University Press. 


Enzyme Inhibitors 


Enzyme inhibitors are molecules that are able to bind to an enzyme 
and decrease its activity. The cell depends on inhibitors to regulate 
the activity of many of its enzymes; biochemists use inhibitors to 
study the properties of enzymes; and many pharmaceutical compa- 
nies produce enzyme inhibitors that act as drugs. Enzyme inhibitors 
can be divided into two types: reversible or irreversible. 
Irreversible inhibitors are those that bind very tightly to an 
enzyme, often by forming a covalent bond to one of its amino acid 
residues. A number of nerve gases, such as diisopropylphosphofluori- 
date and the organophosphate pesticides, act as irreversible inhibitors 
of acetylcholinesterase, an enzyme that plays a crucial role in destroy- 
ing acetylcholine, the neurotransmitter responsible for causing muscle 
contraction. With the enzyme inhibited, the muscle is stimulated 


continuously and remains in a state of permanent contraction. As 
discussed in the accompanying Human Perspective, the antibiotic 
penicillin acts as an irreversible inhibitor of a key enzyme in the 
formation of the bacterial cell wall. 

Reversible inhibitors, on the other hand, bind only loosely to an 
enzyme, and thus are readily displaced. For the sake of simplicity, we 
will discuss the simplest cases of reversible inhibitors, namely com- 
petitive and noncompetitive types. Competitive inhibitors are 
reversible inhibitors that compete with a substrate for access to the 
active site of an enzyme. Because substrates have a complementary 
structure to the active site to which they bind, competitive inhibitors 
must resemble the substrate to compete for the same binding site, 
but differ in a way that prevents them from being transformed into 
product (FIGURE 3.20). Analysis of the types of molecules that can 
compete with the substrate for a binding site on the enzyme provides 
insight into the structure of the active site and the nature of the inter- 
action between a substrate and its enzyme. 

Competitive enzyme inhibition forms the basis of action of many 
common drugs, as illustrated in the following example. Angiotensin 
converting enzyme (ACE) is a proteolytic enzyme that acts on a 10- 
residue peptide (angiotensin I) to produce an 8-residue peptide 
(angiotensin II). Elevated levels of angiotensin II are a major risk 
factor in the development of high blood pressure (hypertension). In 
the 1960s John Vane and his colleagues at the Eli Lilly Company 
began a search for compounds that could inhibit ACE. Previous 
studies had found that the venom of a Brazilian pit viper contained 
inhibitors of proteolytic enzymes, and it was found that one of the 
components of this venom, a peptide called teprotide 
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Teprotide (peptide bonds marked in red) 


was a potent competitive inhibitor of ACE. Although teprotide was 
shown to lower blood pressure in hypertensive patients, it was not a 
very useful drug because it had a peptide structure and thus was 
rapidly degraded if taken orally. Subsequent efforts to develop non- 
peptide inhibitors of the enzyme led researchers to synthesize a com- 
pound called captopril, 
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which became the first useful antihypertensive drug that acted by 
binding to ACE. 

The effectiveness of a competitive inhibitor depends on its rela- 
tive affinity for the enzyme. Regardless, competitive inhibition can 
be overcome if the substrate/inhibitor ratio is great enough. In other 
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FIGURE 3.20 Competitive inhibition. Because of their molecular 
similarity, competitive inhibitors are able to compete with the substrate for 
a binding site on the enzyme. The effect of a competitive inhibitor depends 
on the relative concentrations of the inhibitor and substrate. 


words, if the number of collisions between the enzyme and inhibitor 
becomes insignificant relative to those between the enzyme and its 
substrate, then the effect of the inhibitor becomes minimal. Given a 
sufficient substrate concentration, it remains theoretically possible 
to achieve the enzyme’s maximal velocity even in the presence of the 
competitive inhibitor. Because many drugs used to treat human dis- 
ease act by inhibiting enzymes, it is extremely important to be able to 
develop compounds that have high enough affinity for the enzyme 
that they can out-compete the natural substrate effectively. The pro- 
cess by which such tight binding compounds are developed was 
illustrated in Chapter 2 in the context of the BCR-ABL inhibitor 
Gleevec (Figure 2.51). 

In noncompetitive inhibition, the substrate and inhibitor do 
not compete for the same binding site; generally, the inhibitor acts at 
a site other than the enzyme’s active site. The level of inhibition 
depends only on the concentration of the inhibitor, and increasing 
the concentration of the substrate cannot overcome it. Because, in 
the presence of a noncompetitive inhibitor, a certain fraction of the 
enzyme molecules are necessarily inactive at a given instant, the 
maximal velocity of the population of enzyme molecules cannot be 
reached. Tipranavir is a potent noncompetitive inhibitor of the pro- 
tease produced by HIV when it infects a white blood cell. Unlike 
other inhibitors of this enzyme, such as ritonavir, tipranavir does not 
resemble the peptide substrate of the enzyme, nor does it compete 
with the substrate. The effects on the kinetics of enzymes in the pres- 
ence of noncompetitive and competitive inhibitors are shown 
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FIGURE 3.21 The effects of inhibitors on enzyme kinetics. The effect of both competitive and noncompetitive inhibitors is shown when the kinetics of 
the reaction are plotted as velocity of reaction versus substrate concentration (a) or its reciprocal (b). The noncompetitive inhibitor reduces V,,3x without 


affecting Ky, whereas the competitive inhibitor increases Ky without affecting Vinax- 


in FIGURE 3.21. In one case, the Vax is lowered, and in the other, the 
Ky is increased. In both types, the slope (K}4/ Vmax) is increased rela- 
tive to the uninhibited reaction. As we will see on page 110, cells 
utilize a version of noncompetitive inhibition to regulate the activity 
of key enzymes of metabolic pathways. 
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3.8 THE HUMAN PERSPECTIVE 


The Growing Problem of Antibiotic Resistance 


Not too long ago, it was widely believed that human health would 
no longer be threatened by serious bacterial infections. Bacterial 
diseases such as leprosy, pneumonia, gonorrhea, and dozens 
of others were being cured by administration of any one of a 
number of antibiotics—compounds that selectively kill bacteria 
without harming the human host in which they grow. This state 
of affairs has changed dramatically in the past several decades as 
pathogenic bacteria have become increasingly resistant to these 
“wonder drugs,” leading to the deaths of many people who would 
once have been successfully treated. Even tuberculosis, which had 
virtually disappeared from developed countries, has reemerged 
around the world in a form referred to as XDR (extremely drug 
resistant) that is virtually untreatable. It is feared that XDR-TB is 
on the verge of becoming a major global health crisis. To make 
matters worse, the pharmaceutical industry has drastically cut 
resources devoted to the development of new antibiotics. This 
change of action by the drug industry is generally attributed to 
a lack of financial incentives: (1) Antibiotics are taken only for a 
short period of time (as opposed to drugs prescribed for chronic 
conditions such as diabetes or depression). (2) Unlike other types 
of drugs that retain their effectiveness (efficacy), new antibiotics 
run the risk of having a relatively short lifetime in the marketplace 
as bacteria become resistant to each successive product. (3) The 
most effective antibiotics are held back from widespread use and 
are kept instead as weapons of last resort when other drugs have 
failed. Ideas for the formation of nonprofit institutions for the 
development of new antibiotics have been widely discussed. 
Here, we will look briefly at the mechanism of action of 
antibiotics—particularly those that target enzymes, which are the 


subjects of this chapter—and the development of bacterial resis- 
tance. Most antibiotics are derived from natural products that are 
produced by microorganisms living in the soil and readily culti- 
vated in the lab. Table 1 lists the major classes of antibiotics, exam- 
ples of each class, the targets in the bacterial cells on which these 
compounds act, and the mechanism by which bacteria have devel- 
oped resistance to that class of compounds. We will first look at 
several of the main types of targets: 


1. Enzymes involved in the synthesis of the bacterial cell wall. 
Penicillin and its derivatives (e.g., methicillin) are structural ana- 
logues of the substrates of a family of transpeptidases that cat- 
alyze the final cross-linking reactions that give the cell wall its 
rigidity. If these reactions do not occur, a rigid cell wall fails to 
develop. Penicillin is an irreversible inhibitor of the transpepti- 
dases; the antibiotic fits into the active site of these enzymes, 
forming a covalently bound complex that cannot be dislodged. 
The antibiotic vancomycin inhibits transpeptidation by binding 
to the peptide substrate of the transpeptidase, rather than to the 
enzyme itself. Normally, the transpeptidase substrate terminates 
in a D-alanine-p-alanine dipeptide. To become resistant to van- 
comycin, a bacterial cell must synthesize an alternate terminus 
that doesn't bind the drug, which is a roundabout process that 
requires the acquisition of several new enzymatic activities. As 
a result, vancomycin is the antibiotic to which bacteria have 
proven least able to develop resistance and thus is usually given 
as a last resort when other antibiotics have failed. Unfortunately, 
vancomycin-resistant strains of several pathogenic bacteria, 
including Staphylococcus aureus, have emerged in recent years. 


TABLE 1 Antibiotics in Clinical Use and Modes of Resistance 


Antibiotic class 


Examples 


Target 


Mode of resistance 


B-lactams 


Penicillins (ampicillin) 
Cephalosporins (cephamycin) 
Penems (meropenem) 
Monobactams (aztreonam) 


Peptidoglycan biosynthesis 


Hydrolysis 
Efflux 
Altered target 


Aminoglycosides 


Gentamicin 
Streptomycin 
Spectinomycin 


Translation 


Phosphorylation 
Acetylation 
Nucleotidylation 
Efflux 

Altered target 


Glycopeptides 


Vancomycin 
Teicoplanin 


Peptidoglycan biosynthesis 


Reprogramming of peptidoglycan 
biosynthesis 


Tetracyclines 


Minocycline 
Tigecycline 


Translation 


Monooxygenation 
Efflux 
Altered target 


Marcrolides 


Erythromycin 
Azithromycin 


Translation 


Hydrolysis 
Glycosylation 
Phosphorylation 
Efflux 

Altered target 


Lincosamides 


Clindamycin 


Translation 


Nucleotidylation 
Efflux 
Altered target 


Streptogramins 


Oxazolidinones 


Phenicols 


Synercid 


Linezolid 


Chloramphenicol 


Translation 


Translation 


Translation 


C—O lyase (Type B streptogramins) 
Acetylation (Type A streptogramins) 
Efflux 

Altered target 


Efflux 
Altered target 


Acetylation 
Efflux 
Altered target 


Quinolones 


Ciprofloxacin 


DNA replication 


Acetylation 
Efflux 
Altered target 


Pyrimidines 


Trimethoprim 


DNA replication 


Efflux 
Altered target 


Sulfonamides 


Sulfamethoxazole 


DNA replication 


Efflux 
Altered target 


Rifamycins 


Rifampin 


Transcription 


ADP-ribosylation 
Efflux 
Altered target 


Lipopeptides 
Cationic peptides 


From Annual Review Genet 2010, 44:27, Enzymology of Antibiotic Resistance, M. Moror and G. D. Wright. Reproduced with permission of ANNUAL REVIEWS, INC. 


Daptomycin 
Colistin 


in the format Textbook via Copyright Clearance Center. 


Cell membrane 
Cell membrane 


Altered target 
Altered target 
Efflux 


S. aureus is a common inhabitant of the skin and nasal pas- 
sages. Although it is usually relatively harmless, this organism 
is the most frequent cause of life-threatening infections that 
develop in hospitalized patients who have open wounds or 
invasive tubes. For years, methicillin-resistant S. aureus (MRSA) 
strains, which are also resistant to many other antibiotics, have 
created havoc in hospital wards. Infections by this one path- 
ogen kill more than 10,000 patients every year in the United 
States alone, which is greater than the number of deaths in the 
United States from AIDS. MRSA infections have also begun to 
appear in community settings, such as high school gyms or chil- 
dren's daycare centers, where there is a lot of contact between 
individuals. In many cases, vancomycin is the only drug that can 


stop these infections. Consequently, it was alarming to discover 
that MRSA can become resistant to vancomycin by acquiring a 
cluster of vancomycin-resistance genes from another bacterium 
(E. faecium), which is also a common cause of hospital-based 
infections. So far, vancomycin-resistant MRSA cases are rare 
and these bacteria have not been able to gain a foothold in 
either a hospital or community setting, but it may only be a 
matter of time. For this reason, infectious disease specialists 
have urged hospitals to institute better hygiene programs and 
to isolate patients at the first sign of infection. These practices 
have been proven to reduce the incidence of lethal infections 
where they have been put into place and have cut the MRSA 
death rate by 50 percent in recent years. 


continued 
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2. Components of the system by which bacteria duplicate, 
transcribe, and translate their genetic information. Although 
bacterial and eukaryotic cells have a similar system for storing 
and utilizing genetic information, there are many differences 
between the two types of cells that pharmacologists can take 
advantage of. Streptomycin and the tetracyclines, for example, 
bind to bacterial ribosomes, but not eukaryotic ribosomes. 
Quinolones, such as ciprofloxacin (brand name Cipro), are a 
rare example of wholly synthetic antibiotics (i.e., they are not 
based on natural products). Quinolones inhibit the enzyme 
DNA gyrase, which is required for bacterial DNA replication. 


3. Enzymes that catalyze metabolic reactions specifically in 
bacteria. Sulfa drugs, for example, are effective antibiotics 
because they closely resemble the compound p-aminobenzoic 
acid (PABA), 


HN—(_ )—COoH HN—(_ )—S0,—NH—R 


PABA Sulfa drugs 


which bacteria convert enzymatically to the essential coen- 
zyme folic acid. Because humans lack a folic acid-synthesizing 
enzyme, they must obtain this essential coenzyme in their 
diet and, consequently, sulfa drugs have no effect on human 
metabolism. 


Nearly all of the “new” antibiotics approved for use in recent 
years are derivatives of existing compounds that have been modi- 
fied chemically in the laboratory. In other words, these compounds 
possess the same core or scaffold as the compound from which 
they are derived and thus can be considered members of the 
same chemical class. Existing drugs are chemically modified in an 
attempt to make them less vulnerable to the mechanisms of resis- 
tance employed by the bacteria they target. It was hoped that the 
sequencing of the genomes of pathogenic bacteria beginning in 
1995 would lead to the identification of a host of new drug targets, 
but this has not been the case. Remarkably, there have only been 
two new classes of antibiotics developed and approved since 
1963. One of these two new classes includes the antibiotic linezolid 
(brand name Zyvox), introduced in 2000, which acts specifically on 
bacterial ribosomes and interferes with protein synthesis. The other 
new class of antibiotics, introduced in 2003 and represented by 
daptomycin (brand name Cubicin), are cyclic lipopeptides, which 
disrupt bacterial membrane function. Many researchers are hop- 
ing that linezolid and daptomycin will be used sparingly so that 
resistance can be kept to a minimum; however, linezolid-resistant 
MRSA has already started to crop up in hospitals, and instances 
of daptomycin-resistant E. faecium are becoming more and more 
common. There is thus great concern that we are rapidly entering 
a “post antibiotic age,” in which thousands or millions of people 
will once again start to die from minor cuts and scrapes because 
there are no longer any antibiotics that are effective. 
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Bacteria become resistant to antibiotics through a number 
of distinct mechanisms, many of which can be illustrated using 
penicillin as an example. Penicillin is a B-lactam; that is, it contains 
a four-membered B-lactam ring (shown in color). 

By the 1940s, researchers had discovered that certain bacteria 
possess an enzyme called B-lactamase (or penicillinase) that can 
open the lactam ring, rendering the compound harmless to the 
bacterium. At the time when penicillin was first introduced as an 
antibiotic during World War Il, none of the major disease-causing 
bacteria possessed a gene for B-lactamase. This can be verified by 
examining the DNA of bacteria descended from laboratory cultures 
that were started in the preantibiotic era. Today, the B-lactamase 
gene is present in a wide variety of infectious bacteria, and the 
production of B-lactamase by these cells is the primary cause of 
penicillin resistance. How did these species acquire the gene? 

The widespread occurrence of the B-lactamase gene illus- 
trates how readily genes can spread from one bacterium to 
another, not only among the cells of a given species, but between 
species. There are several ways this can happen, including conju- 
gation (shown in Figure 1.13), in which DNA is passed from one 
bacterial cell to another; transduction, in which a bacterial gene 
is carried from cell to cell by a virus; and transformation, in which 
a bacterial cell is able to pick up naked DNA from its surrounding 
medium. Pharmacologists have attempted to counter the spread 
of B-lactamase by synthesizing penicillin derivatives that are more 
resistant to the hydrolytic enzyme. Methicillin, which is no longer 
in use, was first produced in 1959 as a B-lactamase-resistant anti- 
biotic. As might be expected, natural selection leads rapidly to 
the evolution of bacteria whose B-lactamase can hydrolyze new 
forms of the antibiotic. 

Notall penicillin-resistant bacteria have acquired a B-lactamase 
gene. Some are resistant because they possess modifications 
in their cell walls that block entry of the antibiotic; others are resis- 
tant because they possess an efflux pump that allows them to export 
the antibiotic once it has entered the cell; still others are resistant 
because they possess an altered target, such as a modified trans- 
peptidase that fails to bind the antibiotic. Bacterial meningitis, for 
example, is caused by the bacterium Neisseria meningitidis, which 
has yet to display evidence it has acquired B-lactamase. Yet these 
bacteria are becoming resistant to penicillin because their trans- 
peptidases have lost an affinity for the antibiotics as the result of 
mutations in the gene that encodes the enzyme. 

The problem of drug resistance is not restricted to bacterial 
diseases, but has also become a major issue in the treatment of 
AIDS. Unlike bacteria, whose DNA-replicating enzymes operate at 
a very high level of accuracy, the replicating enzyme of the AIDS 
virus (HIV), called reverse transcriptase, makes a large number 
of mistakes, which leads to a high rate of mutation. This high 
error rate (about one mistake for every 10,000 bases duplicated), 
combined with the high rate of virus production (510° virus 
particles produced in a person per day), makes it very likely that 
drug-resistant variants will emerge within an individual as the 
infection progresses. This problem is being combatted: 


e By having patients take several different drugs targeted at dif- 
ferent viral enzymes. This greatly reduces the likelihood that a 
variant will emerge that is resistant to all of the drugs. 


@ By designing drugs that interact with the most highly con- 
served portions of each targeted enzyme, that is, those por- 
tions where mutations are most likely to produce a defective 
enzyme. This point underscores the importance of knowing the 
structure and function of the target enzyme and the manner in 
which potential drugs interact with that target. 


3.9 An Overview of Metabolism 


Metabolism is the collection of biochemical reactions that occur 
within a cell, which includes a tremendous diversity of molecular 
conversions. Most of these reactions can be grouped into meta- 
bolic pathways containing a sequence of chemical reactions in 
which each reaction is catalyzed by a specific enzyme, and the prod- 
uct of one reaction is the substrate for the next. The enzymes consti- 
tuting a metabolic pathway are usually confined to a specific region 
of the cell, such as the mitochondria or the cytosol. Increasing evi- 
dence suggests that the enzymes of a metabolic pathway are often 
physically linked to one another, a feature that allows the product of 
one enzyme to be delivered directly as a substrate to the active site of 
the next enzyme in the reaction sequence. 

The compounds formed in each step along the pathway are 
metabolic intermediates (or metabolites) that lead ultimately to 
the formation of an end product. End products are molecules with a 
particular role in the cell, such as an amino acid that can be incorpo- 
rated into a polypeptide or a sugar that can be consumed for its 
energy content. The metabolic pathways of a cell are interconnected 
at various points so that a compound generated by one pathway may 
be shuttled in a number of directions depending on the requirements 
of the cell at the time. We will concentrate in this section on aspects 
of metabolism that lead to the transfer and utilization of chemical 
energy, for this topic is one we will draw on throughout the book. 

Metabolic pathways can be divided into two broad types. 
Catabolic pathways lead to the disassembly of complex molecules 
to form simpler products. Catabolic pathways serve two functions: 
They make available the raw materials from which other molecules 
can be synthesized, and they provide chemical energy required for 
the many activities of a cell. As will be discussed at length, energy 
released by catabolic pathways is stored temporarily in two forms: as 
high-energy phosphates (primarily ATP) and as high-energy elec- 
trons (primarily in NADPH). Anabolic pathways lead to the syn- 
thesis of more complex compounds from simpler starting materials. 
Anabolic pathways are energy-requiring and utilize chemical energy 
released by the exergonic catabolic pathways. 

FIGURE 3.22 shows a greatly simplified profile of the ways in 
which the major anabolic and catabolic pathways are intercon- 
nected. Macromolecules are first disassembled (hydrolyzed) into the 
building blocks of which they are made (stage I, Figure 3.22). Once 
macromolecules have been hydrolyzed into their components— 
amino acids, sugars, and fatty acids—the cell can reutilize the build- 
ing blocks directly to (1) form other macromolecules of the same 
class (stage I), (2) convert them into different compounds to make 
other products, or (3) degrade them further (stages II and III, Figure 
3.22) and extract a measure of their free-energy content. 

The pathways for degradation of the diverse building blocks of 
macromolecules vary according to the particular compound being 
catabolized. Ultimately, however, all of these molecules are con- 
verted into a small variety of compounds that can be metabolized 
similarly. Thus, even though substances begin as macromolecules 
having a very different structure, they are converted by catabolic 
pathways to the same low-molecular-weight metabolites. For this 
reason, catabolic pathways are said to be convergent. 

Remarkably, the chemical reactions and metabolic pathways 
described in this chapter are found in virtually every living cell, from 
the simplest bacterium to the most complex plant or animal. It is 
evident that these pathways appeared very early in, and have been 
retained throughout, the course of biological evolution. 


Oxidation and Reduction: 
A Matter of Electrons 


Both catabolic and anabolic pathways include key reactions in which 
electrons are transferred from one reactant to another. Reactions that 
involve a change in the electronic state of the reactants are called 
oxidation-reduction (or redox) reactions. Changes of this type 
involve the gain or loss of electrons. Consider the conversion of 
metallic iron (Fe°) to the ferrous state (Fe*), in which the iron atom 
loses a pair of electrons, thereby attaining a more positive state. When 
an atom loses one or more electrons, it is said to be oxidized. The reac- 
tion is reversible, meaning ferrous ions can be converted to metallic 
iron, a more negative state, by the acquisition of a pair of electrons. 
When an atom gains one or more electrons, it is said to be reduced. 

For metallic iron to be oxidized, there must be some substance 
to accept the electrons that are released. Conversely, for ferrous ions 
to be reduced, there must be some substance to donate the necessary 
electrons. In other words, the oxidation of one reactant must be 
accompanied by the simultaneous reduction of some other reactant, 
and vice versa. One possible reaction involving iron might be 


Fe? +Cu** = Fe”* +Cu° 


The substance that is oxidized during a redox reaction, that is, the 
one that loses electrons, is called a reducing agent, and the one 
that is reduced, that is, the one that gains electrons, is called an 
oxidizing agent. 

The oxidation or reduction of metals, such as iron or copper, 
involves the complete loss or gain of electrons. The same cannot occur 
with most organic compounds for the following reason: the oxidation 
and reduction of organic substrates during cellular metabolism involve 
carbon atoms that are covalently bonded to other atoms. As discussed 
in Chapter 2, when a pair of electrons is shared by two different atoms, 
the electrons are attracted more strongly to one of the two atoms of the 
polarized bond. In a C—H bond, the carbon atom has the strongest 
pull on the electrons; thus it can be said that the carbon atom is in a 
reduced state. In contrast, if a carbon atom is bonded to a more elec- 
tronegative atom, as in a C—O or C—N bond, the electrons are 
pulled away from the carbon atom, which is thus in an oxidized state. 
Since carbon has four outer-shell electrons it can share with other 
atoms, it can exist in a variety of oxidation states. This is illustrated by 
the carbon atom in a series of one-carbon molecules (FIGURE 3.23) 
ranging from the fully reduced state in methane (CH,) to the fully oxi- 
dized state in carbon dioxide (CO,). The relative oxidation state of an 
organic molecule can be roughly determined by counting the number 
of hydrogen versus oxygen and nitrogen atoms per carbon atom. As we 
will see shortly, the oxidation state of the carbon atoms in an organic 
molecule provides a measure of the molecule’s free-energy content. 


The Capture and Utilization of Energy 


The compounds that we use as chemical fuels to run our furnaces and 
automobiles are highly reduced organic compounds, such as natural 
gas (CH,) and petroleum derivatives. Energy is released when these 
molecules are burned in the presence of oxygen, converting the carbons 
to more oxidized states, as in carbon dioxide and carbon monoxide 
gases. The degree of reduction of a compound is also a measure of its 
ability to perform chemical work within the cell. The more hydrogen 
atoms that can be stripped from a “fuel” molecule, the more ATP that 
ultimately can be produced. Carbohydrates are rich in chemical energy 
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FIGURE 3.22 Three stages of metabolism. The catabolic pathways (green arrows rightward) converge to form common metabolites and lead to ATP 
synthesis in stage Ill. The anabolic pathways (blue arrows leftward) start from a few precursors in stage Ill and utilize ATP to synthesize a large variety of 
cellular materials. Metabolic pathways for nucleic acids are more complex and are not shown here. 


because they contain strings of (H—C—OH) units. Fats contain 
even greater energy per unit weight because they contain strings of 
more reduced (F]JH—(C—H}) units. In the following discussion we 


will concentrate on carbohydrates. 


As the sole building block of both starch and glycogen, glucose is 
a key molecule in the energy metabolism of both plants and animals. 
The free energy released by the complete oxidation of glucose is very 
large: 


C,H,,0, +60, >6CO, +6H,O 


AG* =-686 kcal / mol 


By comparison, the free energy required to convert ADP to ATP is 
relatively small: 


ADP+P,>ATP+H,O AG*=+47.3 kcal/mol 
It is evident from these numbers that the complete oxidation of a 
molecule of glucose to CO, and H,O can release enough energy to 
produce a large number of ATPs. As we will see in Chapter 5, up to 
about 36 molecules of ATP are formed per molecule of glucose oxi- 
dized under conditions that exist within most cells. For this many 
ATP molecules to be produced, the sugar molecule is disassembled 
in many small steps. Those steps in which the free-energy difference 
between the reactants and products is relatively large can be coupled 
to reactions that lead to the formation of ATP. 

There are basically two stages in the catabolism of glucose, and 
they are virtually identical in all aerobic organisms. The first stage, 


H H H HO 0) 
H—C—H ——~> H—C— OH — H—C>0 — H—C>—0 — C=—0 
H H 
Methane Methanol Formaldehyde Formic acid Carbon dioxide 
(CH,) (CH30H) (CH,0) (HCOOH) (CO,) 
Most Most 


reduced _—_——_———— |g oxidized 


state 


state 


— Covalent bond in which carbon atom has greater share of electron pair 
— Covalent bond in which oxygen atom has greater share of electron pair 


FIGURE 3.23 The oxidation state of a carbon atom depends on the other atoms to which it is bonded. Each carbon atom can form a maximum of 
four bonds with other atoms. This series of simple, one-carbon molecules illustrates the various oxidation states in which the carbon atom can exist. In its 
most reduced state, the carbon is bonded to four hydrogens (forming methane); in its most oxidized state, the carbon atom is bonded to two oxygens 


(forming carbon dioxide). 


glycolysis, occurs in the soluble phase of the cytoplasm (the cyto- 
sol) and leads to the formation of pyruvate. The second stage is the 
tricarboxylic acid (or TCA) cycle, which occurs within the mito- 
chondria of eukaryotic cells and the cytosol of prokaryotes and leads 
to the final oxidation of the carbon atoms to carbon dioxide. Most of 
the chemical energy of glucose is stored in the form of high-energy 
electrons, which are removed as substrate molecules are oxidized 
during both glycolysis and the TCA cycle. It is the energy of these 
electrons that is ultimately used to synthesize ATP. We will concen- 
trate in the following sections on the steps during glycolysis, the first 
stage in the oxidation of glucose, which occurs without the involve- 
ment of oxygen. This is presumably the pathway of energy capture 
utilized by our early anaerobic ancestors and remains the major ana- 
bolic pathway utilized by anaerobic organisms living today. We will 
complete the story of glucose oxidation in Chapter 5 when we dis- 
cuss the mitochondrion and its role in aerobic respiration. 


REVIEW | 


1. Why are catabolic pathways described as convergent, 
whereas anabolic pathways are described as 
divergent? 


o——____ 


3.10 Glycolysis and Fermentation 


The reactions of glycolysis and the enzymes that catalyze them are 
shown in FIGURE 3.24. Before discussing the specific reactions, an 
important point concerning the thermodynamics of metabolism can 
be made. In an earlier discussion, the difference between AG and 
AG*' was stressed; it is the AG for a particular reaction that deter- 
mines its direction in the cell. Actual measurements of the concen- 
trations of metabolites in the cell can reveal the value for AG of a 
reaction at any given time. FIGURE 3.25 shows the typical AG values 
measured for the reactions of glycolysis. In contrast to the AG’ values 
of Figure 3.24, all but three reactions have AG values of nearly zero; 
that is, they are near equilibrium. The three reactions that are far 
from equilibrium, which makes them essentially irreversible in the 
cell, provide the driving force that moves the metabolites through 
the glycolytic pathway in a directed manner. 

In 1905, two British chemists, Arthur Harden and William 
Young, were studying glucose breakdown by yeast cells, a process that 
generates bubbles of CO, gas. Harden and Young noted that the bub- 
bling eventually slowed and stopped, even though there was plenty of 
glucose left to metabolize. Apparently, some other essential compo- 
nent of the broth was being exhausted. After experimenting with a 
number of substances, the chemists found that adding inorganic 
phosphates started the reaction going again. They concluded that the 
reaction was exhausting phosphate, the first clue that phosphate 
groups played a role in metabolic pathways. The importance of the 
phosphate group is illustrated by the initial reactions of glycolysis. 

Glycolysis begins with the linkage of the sugar to a phosphate 
group (step 1, Figure 3.24) at the expense of one molecule of ATP. The 
use of ATP at this stage can be considered an energy investment— 
the cost of getting into the glucose oxidation business. Phosphorylation 
activates the sugar, making it capable of participating in subsequent 
reactions whereby phosphate groups are moved around and trans- 
ferred to other acceptors. Phosphorylation also traps the glucose 
inside the cell, because the phosphorylated glucose cannot cross the 


cell membrane. This allows the cell to take up more glucose from the 
surrounding environment. Glucose 6-phosphate is converted to fruc- 
tose 6-phosphate and then to fructose 1, 6-bisphosphate at the expense 
of a second molecule of ATP (steps 2, 3). The six-carbon bisphosphate 
is split into two three-carbon monophosphates (step 4), which sets the 
stage for the first exergonic reaction to which the formation of ATP 

can be coupled. Let us turn now to the formation of ATP. 


ATP Production in Glycolysis 


ATP is formed in two basically different ways, both illustrated by one 
chemical reaction of glycolysis: the conversion of glyceraldehyde 
3-phosphate to 3-phosphoglycerate (steps 6, 7). The overall reaction 
is an oxidation of an aldehyde to a carboxylic acid (as in Figure 3.23), 
and it occurs in two steps catalyzed by two different enzymes 
(Figure 3.24). The first of these enzymes requires a nonprotein 
cofactor (a coenzyme), called nicotinamide adenine dinucleotide 
(NAD), to catalyze the reaction. As will be evident in this and 
following chapters, NAD plays a key role in energy metabolism by 
accepting and donating electrons. The first reaction (FIGURE 3.26a, b) 
is an oxidation-reduction in which two electrons and a proton 
(equivalent to a hydride ion, : H” ) are transferred from glyceraldehyde 
3-phosphate (which becomes oxidized) to NAD* (which becomes 
reduced). The reduced form ofthe coenzymeis NADH (FIGURE 3.27). 
An enzyme that catalyzes this type of reaction is termed a dehydro- 
genase; the enzyme catalyzing the above reaction is glyceraldehyde 
3-phosphate dehydrogenase. The NAD* molecule, which is derived 
from the vitamin niacin, acts as a loosely associated coenzyme to the 
dehydrogenase in position to accept the hydride ion (i.e., both the 
electrons and the proton). The NADH formed in the reaction is then 
released from the enzyme in exchange for a fresh molecule of NAD”*. 

We will return to this reaction in a moment, but first we will 
continue with the consequences of the formation of NADH. 
NADH is considered a high-energy compound because of the ease 
with which it is able to transfer electrons to other electron-attracting 
molecules. (NADH is said to have a high electron-transfer potential 
relative to other electron acceptors in the cell; see Table 5.1). 
Electrons are usually transferred from NADH through a series of 
membrane-embedded electron carriers that constitute an electron- 
transport chain. As the electrons move along this chain, they move to 
a lower and lower free-energy state and are ultimately passed to 
molecular oxygen, reducing it to water. The energy released during 
electron transport is utilized to form ATP bya process called oxida- 
tive phosphorylation. Electron transport and oxidative phosphoryla- 
tion will be explored in detail in Chapter 5. 

In addition to the indirect route for ATP formation involving 
NADH and an electron-transport chain, the conversion of glyceral- 
dehyde 3-phosphate to 3-phosphoglycerate includes a direct route 
for ATP formation. In the second step of this overall reaction (step 7, 
Figures 3.24 and 3.26c), a phosphate group is transferred from 1, 3- 
bisphosphoglycerate to ADP to form a molecule of ATP. The reaction 
is catalyzed by the enzyme phosphoglycerate kinase. This direct route of 
ATP. formation is referred to as substrate-level phosphorylation 
because it occurs by transfer of a phosphate group from one of the 
substrates (in this case, 1,3-bisphosphoglycerate) to ADP . The remain- 
ing reactions of glycolysis (steps 8-10), including a second substrate- 
level phosphorylation of ADP (step 10), are indicated in Figure 3.24. 

The substrate-level phosphorylation of ADP illustrates an 
important point about ATP. Its formation is not that endergonic. In 
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FIGURE 3.24 The steps of glycolysis. 


Glucose 


Free-energy change (AG) in the cell 
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FIGURE 3.25 Free-energy profile of glycolysis in cardiac muscle tissue. 
The step numbers correspond to the reactions of glycolysis shown in Figure 
3.24. All reactions are at or near equilibrium except those catalyzed by 
hexokinase, phosphofructokinase, and pyruvate kinase (reactions 1, 3, and 
10), which exhibit large differences in free energy. 

Source: From Voet, Voet, Pratt, Fund of Biochemistry 4E. This material is 
reproduced with permission of John Wiley & Sons, Inc. 


other words, ATP can be readily formed by metabolic reactions. 
There are numerous phosphorylated molecules whose hydrolysis 
has a more negative AG” than that of ATP. FIGURE 3.28 illustrates 
the relative AG°’s for the hydrolysis of several phosphorylated 
compounds. Any donor higher on the scale can be used to 
phosphorylate any molecule lower on the scale, and the AG” of this 
reaction will be equal to the difference between the two values given 
in the figure. For example, the AG®’ for the transfer of a phosphate 


for use in fermentation 
of pyruvate or in 
oxidative phosphorylation 
(see Figure 5.5) 


group from 1, 3-bisphosphoglycerate to ADP to form ATP is equal 
to —4.5kcal/mol(—11.8kcal/mol+7.3kcal/mol). This concept of 
transfer potential is useful for comparing any series of donors 
and acceptors regardless of the group being transferred, whether 
protons, electrons, oxygen, or phosphate groups. Those molecules 
higher on the scale, that is, ones with greater free energy (larger 
—AG*'s), are molecules with less affinity for the group being trans- 
ferred than are ones lower on the scale. The less the affinity, the bet- 
ter the donor; the greater the affinity, the better the acceptor. 

An important feature of glycolysis is that it can generate a lim- 
ited number of ATP molecules in the absence of oxygen. Neither the 
substrate-level phosphorylation of ADP by 1,3-bisphosphoglycerate 
nor a later one by phosphoenolpyruvate (step 10, Figure 3.24) 
requires molecular oxygen. Thus, glycolysis can be considered an 
anaerobic pathway to ATP production, indicating that it can pro- 
ceed in the absence of molecular oxygen to continue to provide ATP. 
Two molecules of ATP are produced by substrate-level phospho- 
rylation during glycolysis from each molecule of glyceraldehyde 
3-phosphate oxidized to pyruvate. Because each molecule of glucose 
produces two molecules of glyceraldehyde 3-phosphate, four ATP 
molecules are generated per molecule of glucose oxidized to pyru- 
vate. On the other hand, two molecules of ATP must be hydrolyzed 
to initiate glycolysis, leaving a net profit for the cell of two molecules 
of ATP per glucose oxidized. The net equation for the reactions of 
glycolysis can be written 


Glucose +2ADP + 2P, +2NAD* > 
2 pyruvate + 2ATP +2NADH +2H*+2H,O 


H 
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FIGURE 3.27 The structure of NAD* and its reduction to NADH. When 
the 2’ OH of the ribose moiety (indicated by the purple colored screen) is 
covalently bonded to a phosphate group, the molecule is NADP*/NADPH, 
whose function is discussed later in the chapter. 


phosphorylation in which a phosphate group is transferred 
from a substrate molecule, in this case 1,3-bisphosphoglycer- 
ate, to ADP to form ATP. 
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FIGURE 3.28 Ranking compounds by phosphate transfer potential. 
Those phosphorylated compounds higher on the scale (ones with a more 
negative AG°’ of hydrolysis) have a lower affinity for their phosphate 
group than those compounds lower on the scale. As a result, compounds 
higher on the scale readily transfer their phosphate group to form 
compounds that are lower on the scale. Thus, phosphate groups can be 
transferred from 1,3-bisphosphate or phosphoenolpyruvate to ADP during 


glycolysis. 
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Pyruvate, the end product of glycolysis, is a key compound because 
it stands at the junction between anaerobic (oxygen-independent) 
and aerobic (oxygen-dependent) pathways. In the absence of molecular 
oxygen, pyruvate is subjected to fermentation, as discussed in the 
following section. When oxygen is available, pyruvate is further 
catabolized by aerobic respiration, as discussed in Chapter 5. 


Anaerobic Oxidation of Pyruvate: 
The Process of Fermentation 


We have seen that glycolysis is able to provide a cell with a small net 
amount of ATP for each molecule of glucose converted to pyruvate. 
However, glycolytic reactions occur at rapid rates so that a cell is able 
to produce a significant amount of ATP using this pathway. In fact, 
a number of cells, including yeast cells, tumor cells, and muscle cells, 
rely heavily on glycolysis as a means of ATP formation. There is, 
however, a problem that must be confronted by such cells. One of the 
products of the oxidation of glyceraldehyde 3-phosphate is NADH. 
The formation of NADH occurs at the expense of one of the reac- 
tants, NAD", which is in short supply in cells. Because NAD* isa 
required reactant in this important step of glycolysis, it must be 
regenerated from NADH. If not, the oxidation of glyceraldehyde 
3-phosphate can no longer take place, nor can any of the succeeding 
reactions of glycolysis. However, without oxygen, NADH cannot be 
oxidized to NAD* by means of the electron-transport chain because 
oxygen is the final electron acceptor of the chain. Cells are able, how- 
ever, to regenerate NAD" by fermentation, the transfer of elec- 
trons from NADH to pyruvate, the end product of glycolysis, or to 
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FIGURE 3.29 Fermentation. Most cells carry out aerobic respiration, which 
depends on molecular oxygen. If oxygen supplies should diminish, as occurs 
in a skeletal muscle cell undergoing strenuous contraction or a yeast cell 
living under anaerobic conditions, these cells are able to regenerate NAD* 
by fermentation. Muscle cells accomplish fermentation by formation of 
lactate, whereas yeast cells do so by formation of ethanol. Aerobic oxidation 
of pyruvate by means of the TCA cycle is discussed at length in Chapter 5. 


a compound derived from pyruvate (FIGURE 3.29). Like glycolysis, 
fermentation takes place in the cytosol. For most organisms, which 
depend on O,, fermentation is a stopgap measure to regenerate 
NAD* when O, levels are low, so that glycolysis can continue and 
ATP production can be maintained. 

The product of fermentation varies from one type of cell or organ- 
ism to another. When muscle cells are required to contract repeatedly, 
the oxygen level becomes too low to keep pace with the cell’s metabolic 
demands. Under these conditions, skeletal muscle cells regenerate 
NAD” by converting pyruvate to lactate. When oxygen once again 
becomes available in sufficient amounts, the lactate can be converted 
back to pyruvate for continued oxidation. Yeast cells have faced the 
challenges of anaerobic life with a different metabolic solution: They 
convert pyruvate to ethanol, as illustrated in Figure 3.29. 

Although fermentation is a necessary adjunct to metabolism in 
many organisms, and the sole source of metabolic energy in some 
anaerobes, the energy gained by glycolysis alone is meager when 
compared with the complete oxidation of glucose to carbon dioxide 
and water. When a mole of glucose is completely oxidized, 686 kilo- 
calories are released. In contrast, only 57 kilocalories are released 
when this same amount of glucose is converted to ethanol under 
standard conditions, and only 47 kilocalories are released when it is 
converted to lactate. In any case, only two molecules of ATP are 
formed per glucose oxidized by glycolysis and fermentation; over 
90 percent of the energy is simply discarded in the fermentation 
product (as evidenced by the flammability of ethyl alcohol). 

During the early stages of life on Earth, when oxygen had not 
yet appeared, glycolysis and fermentation were probably the primary 
metabolic pathways by which energy was extracted from sugars by 
primitive prokaryotic cells. Following the appearance of cyanobacte- 
ria, the oxygen of the atmosphere rose dramatically, and the evolu- 
tion of an aerobic metabolic strategy became possible. As a result, 
the products of glycolysis could be completely oxidized, and much 
more ATP could be harvested. We will see how this is accomplished 
in Chapter 5 when we discuss the structure and function of the 
mitochondrion. 


REVIEW | 


1. Compare the energy obtained by a cell that oxidizes 
glucose anaerobically and aerobically. What is the 
difference in the end products of these two types of 
metabolism? 


2. Explain what is meant by phosphate transfer potential. 
How does the phosphate transfer potential of 
phosphoenolpyruvate compare with that of ATP? 
What does this mean thermodynamically (i.e., in terms 
of relative AG*s of hydrolysis)? What does it mean in 
terms of affinity for phosphate groups? 


3. In the reaction in which glyceraldehyde 3-phosphate is 
oxidized to 3-phosphoglycerate, which compound is 
acting as the reducing agent? Which as the oxidizing 
agent? Which compound is being oxidized? Which is 
being reduced? 


4. Which reactions of glycolysis are coupled to ATP 
hydrolysis? Which reactions involve substrate-level 
phosphorylation? Which reactions depend on either 
fermentation or aerobic respiration to continue? 

i 


3.11 Reducing Power 


The energy required to synthesize complex biological molecules, 
such as proteins, fats, and nucleic acids, is derived largely from the 
ATP generated by glycolysis and electron transport. But many of 
these materials, particularly fats and other lipids, are more reduced 
than the metabolites from which they are built. Synthesis of fats 
requires the reduction of metabolites, which is accomplished by the 
transfer of high-energy electrons from NADPH, a compound simi- 
lar in structure to NADH, but containing an additional phosphate 
group (described in the legend to Figure 3.27). A cell’s reservoir of 
NADPH represents its reducing power, which is an important 
measure of the cell’s usable energy content. The use of NADPH can 
be illustrated by one of the key reactions of photosynthesis: 


O O 


I 
oer | 
HE—0H + NADPH > HC—0OH + NADP* + P, 


CH,OPO3- CH,OPO3- 


1,3-Bisphosphoglycerate Glyceraldehyde 3-phosphate 
In this reaction, a pair of electrons (together with a proton) are 
transferred from NADPH to the substrate 1,3-bisphosphoglycerate, 
reducing a carbon atom (indicated in red). 

The oxidized form of NADPH, NADP”, is formed from NAD* 
in the following reaction: 


NAD* + ATP = NADP* + ADP 


NADPH can then be formed by the reduction of NADP”. Like 
NADH, NADPH is a high-energy compound because of its high 
electron-transfer potential. The transfer of free energy in the form of 
these electrons raises the acceptor to a more reduced, more energetic 
state. 

The separation of reducing power into two distinct but related 
molecules, NADH and NADPH, reflects a separation of their pri- 
mary metabolic roles. NADH and NADPH are recognized as coen- 
zymes by different enzymes. Enzymes having a reductive role in 
anabolic pathways utilize NADPH as their coenzyme, whereas 
enzymes acting as dehydrogenases in catabolic pathways utilize 
NAD‘. Even though they are used differently, one coenzyme can 
reduce the other in the following reaction, which is catalyzed by the 
enzyme transhydrogenase: 


NADH+NADP* = NAD* + NADPH 


When energy is abundant, the production of NADPH is favored, 
providing a supply of electrons needed for biosynthesis of new mac- 
romolecules that are essential for growth. When energy resources 
are scarce, however, most of the high-energy electrons of NADH are 
“cashed in” for ATP, and only enough NADPH is generated to meet 
the cell’s minimal biosynthetic requirements. 


REVIEW | 


1. Why is reducing power considered a form of energy? 
| 


3.12 Metabolic Regulation 


The amount of ATP present in a cell at a given moment is surpris- 
ingly small. A bacterial cell, for example, contains approximately 
one million molecules of ATP, whose half-life is very brief (on the 
order of a second or two). With so limited a supply, it is evident 
that ATP is not a molecule in which a large total amount of free 
energy is stored. The energy reserves of a cell are stored instead as 
polysaccharides and fats. When the levels of ATP start to fall, 
reactions are set in motion to increase ATP formation at the 
expense of the energy-rich storage forms. Similarly, when ATP lev- 
els are high, reactions that would normally lead to ATP production 
are inhibited. Cells regulate these important energy-releasing reac- 
tions by controlling certain key enzymes in a number of metabolic 
pathways. 

The function of a protein is closely related to its structure (i.e., 
conformation). It is not surprising, therefore, that cells regulate pro- 
tein activity by modifying the conformation of key protein mole- 
cules. In the case of enzymes, regulation of catalytic activity is 
centered on the modification of the structure of the active site. Two 
common mechanisms for altering the shape of an enzyme’s active 
site are covalent modification and allosteric modulation, both of 
which play key roles in regulating glucose oxidation.* 


Altering Enzyme Activity 
by Covalent Modification 


During the mid-1950s, Edmond Fischer and Edwin Krebs of the 
University of Washington were studying glycogen phosphorylase, an 
enzyme found in muscle cells that disassembles glycogen into its glu- 
cose subunits. The enzyme could exist in either an inactive or an 
active state. Fischer and Krebs prepared a crude extract of muscle 
cells and found that inactive enzyme molecules in the extract could 
be converted to active ones simply by adding ATP to the test tube. 
Further analysis revealed a second enzyme in the extract—a “con- 
verting enzyme,’ as they called it—that transferred a phosphate 
group from ATP to one of the 841 amino acids that make up the 
glycogen phosphorylase molecule. The presence of the phosphate 
group altered the shape of the active site of the enzyme molecule and 
increased its catalytic activity. 

Subsequent research has shown that covalent modification of 
enzymes, as illustrated by the addition (or removal) of phosphates, is 
a general mechanism for changing the activity of enzymes. Enzymes 
that transfer phosphate groups to other proteins are called protein 
kinases and regulate such diverse activities as hormone action, cell 
division, and gene expression. The “converting enzyme” discovered 
by Krebs and Fischer was later named phosphorylase kinase, and its 
regulation is discussed in Section 15.8. There are two basically dif- 
ferent types of protein kinases: One type adds phosphate groups to 
specific tyrosine residues in a substrate protein; the other type adds 
phosphates to specific serine or threonine residues in the substrate. 
The importance of protein kinases is reflected in the fact that 
approximately 2 percent of all the genes of a yeast cell (113 out of 
about 6200) encode members of this class of enzymes. 


‘Metabolism is also regulated by controlling the concentrations of enzymes. The 
relative rates at which enzymes are synthesized and degraded are considered in 
subsequent chapters. 
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FIGURE 3.30 Feedback inhibition. The flow of metabolites through a 
metabolic pathway stops when the first enzyme of the pathway (enzyme 
BC) is inhibited by the end product of that pathway (compound E), which 
binds to an allosteric site on the enzyme. Feedback inhibition prevents a 
cell from wasting resources by continuing to produce compounds that are 
no longer required. 


Altering Enzyme Activity 
by Allosteric Modulation 


Allosteric modulation is a mechanism whereby the activity of an 
enzyme is either inhibited or stimulated by a compound that binds 
to a site, called the allosteric site, that is spatially distinct from the 
enzyme’s active site. According to one view, the binding of a 
compound to the allosteric site sends a “ripple” through the protein, 
producing a defined change in the shape of the active site, which 
may be located on the opposite side of the enzyme or even on a dif- 
ferent polypeptide within the protein molecule. Depending on the 
particular enzyme and the particular allosteric modulator, the 
change in shape of the active site may either stimulate or inhibit its 
ability to catalyze the reaction. Allosteric modulation illustrates the 
intimate relationship between molecular structure and function. 
Very small changes in the structure of the enzyme induced by the 
allosteric modulator can cause marked changes in enzyme activity. 
Cells are highly efficient manufacturing plants that do not waste 
energy and materials producing compounds that are not utilized. One 
of the primary mechanisms cells use to shut down anabolic assembly 
lines is a type of allosteric modulation called feedback inhibition, in 
which the enzyme catalyzing the first committed step in a metabolic 


pathway is temporarily inactivated when the concentration of the end 
product of that pathway—an amino acid, for example—reaches a 
certain level. This is illustrated by the simple pathway depicted in 
FIGURE 3.30 in which two substrates, A and B, are converted to the end 
product E. As the concentration of the product E rises, it binds to the 
allosteric site of enzyme BC, causing a conformational change in the 
active site that decreases the enzyme’s activity. Feedback inhibition 
exerts immediate, sensitive control over a cell’s anabolic activity. 


REVIEW | 


1. How are the levels of AMP and ATP related in a cell? Is 
glycolysis inhibited by high ATP or high AMP? 
oe____——_ 


3.13 Separating Catabolic 
and Anabolic Pathways 


A brief consideration of the anabolic pathway leading to the synthe- 
sis of glucose (gluconeogenesis) will illustrate a few important 
aspects about synthetic pathways. Most cells are able to synthesize 
glucose from pyruvate at the same time they are oxidizing glucose 
as their major source of chemical energy. How are cells able to utilize 
these two opposing pathways? The first important point is that, 
even though enzymes can catalyze a reaction in both directions, the 
reactions of gluconeogenesis cannot proceed simply by the reversal 
of the reactions of glycolysis. The glycolytic pathway contains three 
thermodynamically irreversible reactions (Figure 3.25), and these 
steps must somehow be bypassed. Even if all the reactions of glycoly- 
sis could be reversed, it would be a very undesirable way for the cell 
to handle its metabolic activities, because the two pathways could 
not be controlled independently of one another. Thus, a cell could 
not shut down glucose synthesis and crank up glucose breakdown 
because the same enzymes would be active in both directions. 

If one compares the pathway by which glucose is degraded to 
the pathway by which glucose is synthesized, it is apparent that some 
of the reactions are identical, although they are running in opposite 
directions, while others are very different (steps 1 to 3, FIGURE 3.31). 
By using different enzymes to catalyze different key reactions in the 
two opposing pathways, a cell can solve both the thermodynamic 
and the regulatory problems inherent in being able to manufacture 
and degrade the same molecules. 

We can see this better by looking more closely at key enzymes of 
both glycolysis and gluconeogenesis. As indicated in step 2 of Figure 3.31, 
phosphofructokinase, an enzyme of glycolysis, catalyzes the reaction 


Fructose 6-phosphate + ATP = fructose 1,6-bisphosphate + ADP 


which has a AG” of —3.4kcal/ mol, making it essentially irreversible. 
The reaction has such a large —AG°’ because it is coupled to the 
hydrolysis of ATP. In gluconeogenesis, the formation of fructose 
6-phosphate is catalyzed by the enzyme fructose 1,6-bisphosphatase 
by a simple hydrolysis: 


fructose 1, 6-bisphosphate + H,O = 


fructose 6-phosphate+P, AG =-3.9kcal/mol 


The particular enzymes of glycolysis and gluconeogenesis 
described above are key enzymes in their respective pathways. These 
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FIGURE 3.31 Glycolysis versus gluconeogenesis. Whereas most of the 
reactions are the same in the two pathways, even though they run in 
opposite directions, the three irreversible reactions of glycolysis (steps 1 
to 3 here) are replaced in the gluconeogenic pathway by different, 
thermodynamically favored reactions. 


enzymes are regulated, in part, by AMP and ATP. Keep in mind 
that the concentration of AMP within cells is inversely related to the 
concentration of ATP; when ATP levels are low, AMP levels are 
high, and vice versa. Consequently, elevated AMP concentrations 
signal to a cell that its ATP fuel reserves are becoming depleted. 
Even though ATP happens to be a substrate of phosphofructoki- 
nase, ATP is also an allosteric inhibitor, while AMP is an allosteric 
activator. When ATP levels are high, the activity of the enzyme is 


decreased so that no additional ATP is formed by glycolysis. 111 


Conversely, when ADP and AMP levels are high relative to ATP, 
the activity of the enzyme is increased, which promotes the synthesis 
of ATP . In contrast, the activity of fructose 1,6-bisphosphatase, a key 
enzyme of gluconeogenesis, is inhibited by elevated levels of AMP. 

The importance of allosteric modulation in metabolic control is 
illustrated by recent studies on the enzyme AMP-activated protein 
kinase (AMPK). Like phosphorylase kinase discussed above, AMPK 
controls the activity of other enzymes by adding phosphate groups to 
specific serine or threonine residues within their structure. AMPK 
activity is itself controlled by posttranslational modifications; the 
enzyme is activated by addition of a phosphate group and inhibited 
by removal of that phosphate group. More importantly for the present 
discussion, AMPK is also activated by both AMP and ADP, the two 
alternate products of ATP hydrolysis. AMP activates AMPK by 
binding to an allosteric site on the enzyme, and ADP activates AMPK 
by blocking removal of the above-mentioned phosphate group. As 
AMP or ADP concentrations rise, AMPK is activated, causing it to 
phosphorylate and inhibit key enzymes involved in anabolic path- 
ways while at the same time phosphorylating and activating key 
enzymes in catabolic pathways. The end result of AMPK activation is 
a decrease in the activity of pathways that consume ATP and an 
increase in the activity of pathways that produce ATP, leading to an 
elevation in the concentration of ATP within the cell. 

AMPK is involved not only in cellular energy regulation but, at 
least in mammals, in regulation of energy balance within the whole 
body. In mammals, appetite is controlled by centers within the hypo- 
thalamus of the brain that respond to the levels of certain nutrients 
and hormones within the blood. A drop in blood glucose levels, for 
example, acts on the hypothalamus to stimulate appetite, which 
appears to be mediated through activation of AMPK in hypotha- 
lamic nerve cells. Conversely, the feeling of being “full” that we expe- 
rience after eating a meal is thought to be triggered by certain 
hormones (e.g., leptin secreted by fat cells) that inhibit the activity of 
AMPK in these same brain cells. Because of its role as an “energy 
thermostat”? AMPK has become a prime target in the development 
of drugs to treat both obesity and diabetes. The diabetes drug met- 
formin activates AMPK, leading to the suppression of gluconeogen- 
esis in the liver and a consequent decrease in blood glucose levels. 


REVIEW | 


1. What are two common mechanisms for regulating the 
activity of an enzymy by changing its active site 
conformation? 


oo 


3.14 THE HUMAN PERSPECTIVE 


Caloric Restriction and Longevity 


In Chapter 2, we discussed the theory that aging results from accu- 
mulated damage caused by free radicals. Here we will consider 
another aspect of aging, namely the observation that reduced 
caloric intake in many model animals can increase their life span 
and apparently slow the aging process itself. 


The metabolic web illustrated in Figure 3.22 touches every 
biochemical pathway in the cell and every aspect of human 
physiology, thus it does not seem surprising that alterations in 
overall metabolic activity could have large effects on our health 
and well-being. One important example is the potential link 
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FIGURE 1 Eating less food helps nematode worms to live longer. 
This graph, known as a Kaplan-Meier curve, shows the percent of 
animals that are still alive at a given number of days after birth. 
Eventually the curves drop to zero since there is an upper limit to how 
long any animal can live. Improved longevity is indicated by curves that 
are shifted to the right, because that means that a larger percentage of 
individuals are still alive at a given age than in normal animals. This plot 
shows four types of mutant worms that eat less because their pharynx, 
their feeding organ, does not work properly. For comparison, normal 
worms are plotted in the curve indicated with the white squares. 
Whereas normal worms live on average for around 20 days, the lifespan 
of the mutants woms that eat less is increased to 30 days. 

Source: B. Lakowski and S. Hekimi. 1998. The genetics of caloric 
restriction in Caenorhabditis elegans. Proc. Natl. Acad. Sci. U.S.A. 
95:13091-13096. 


between metabolic activity and lifespan. The initial idea that 
reduced caloric intake could lead to longer life came from studies 
in rats and mice. Feeding the animals 10-30 percent less food 
allowed them to have longer lives, with an increase in both the 
average lifespan as well as in the maximum lifespan. The effect 
was dramatic, but these studies left two questions unanswered. 
First, what is the molecular and cellular mechanism of the effect, 
and second, would the same effect be seen in humans? 

Studies of the mechanism of caloric restriction have exploited 
powerful model organisms like yeast (Saccharomyces cerevi- 
siae), fruit flies (Drosophila melanogaster), and nematode worms 
(Caenorhabditis elegans). These organisms have short life spans, 
so experiments can be done more quickly than in rats or mice and 
for far less money. In addition both models have powerful and 
convenient genetics that allows genes to be identified far more 
easily than in rodents. Careful experiments confirmed that caloric 
restriction increases life span in flies and worms, just as had been 
observed in rodents (FIGURE 1). Genetic experiments in these 
model systems have identified a variety of genes that mediate 
the caloric restriction effect on aging. One interesting outcome 


of these experiments has been that different genes are involved 
in the effect depending on when in life the caloric restriction is 
applied, suggesting for example that lifelong caloric restriction 
versus caloric restriction applied later in life might act through 
different genetic pathways. Model organism studies have also 
indicated that caloric restriction may increase the production of 
reactive oxygen. As discussed in Section 2.2, oxygen radicals 
have been long assumed to speed up aging but experiments 
have shown that the opposite can sometimes be true, and that 
oxygen radicals may actually provide protection against aging. 
The increased longevity achieved during caloric restriction may 
involve this latter type of protective effect. 

Regardless of the molecular mechanism, the possibility that 
aging could be delayed in humans by something as simple as a 
20 percent reduction in caloric intake is tremendously exciting. 
Although eating substantially fewer calories would be challeng- 
ing, as anyone who has been on a diet can attest, the promise 
of being able to delay or prevent many of the complications of 
aging would be a powerful lure. To ask whether caloric restriction 
was effective in animals more similar to humans than the rodents 
that had previously been used, two large studies were begun in 
the 1980s using rhesus monkeys, one at the Wisconsin National 
Primate Research Center (WNPRC) and one at the National 
Institute of Aging (NIA), part of the NIH. Because primates have 
much longer lifespans than mice or rats (not to mention worms 
or flies) these experiments had to be run for decades. The two 
studies gave different results: The WNPRC study showed an 
increased life span in the calorie-restricted monkeys while the 
NIA study showed no effect. One important difference between 
the two studies is that the NIA study fed the monkeys food based 
on natural ingredients while the WNPRC used a synthetic food 
made from purified components such as lactalbumin and sucrose. 
Assuming that more natural ingredients make for healthier food, 
perhaps cutting back on unhealthy food in the WNPRC study 
may have had a positive impact on life span by reducing deaths 
caused by heart disease and other ailments. 

What both the WNPRC and NIA studies agreed on was that 
indicators of overall health were improved in the calorie restricted 
monkeys (for example blood glucose levels). Thus it remains pos- 
sible that caloric restriction in humans could help us lead healthier 
and more active lives in old age, even if we don't actually live any 
longer. It is also important to remember that most people don't 
simply die of old age—usually something else gets us first like 
cancer or a heart attack. Given that obesity contributes to the 
development of many diseases such as diabetes, heart disease, 
and cancer, it might be a good idea for many of us to restrict 
our caloric intake anyway, regardless of whether it affects aging 
per se. 

Regardless of whether caloric restriction will prove effec- 
tive as a lifespan-increasing strategy, the studies relating caloric 
intake to aging teach us that metabolism can directly affect our 
lives and can be changed by the foods that we eat. 


Analytic Questions RA 


2. Feedback inhibition typically alters the activity of the first enzyme 
of a metabolic pathway rather than one of the latter enzymes of 
the pathway. Why is this adaptive? 


1. How would you expect a drop in pH to affect a reaction cata- 
lyzed by chymotrypsin? How might an increase in pH affect this 
reaction? 


10. 


11, 


12. 


. After reviewing the reactions of glutamine formation on page 88, 


explain why each of the following statements concerning the 
third (or overall) reaction is either true or false. 


a. If the reaction were written in reverse, its AG°’ would be 
+3.9 kcal/mol. 


b. If all reactants and products were at standard conditions at 
the beginning of an experiment, after a period of time, the 
[NH,]/[ADP] ratio would decrease. 


c. As the reaction proceeds, the AG moves closer to zero. 


d. At equilibrium, the forward and reverse reactions are equal 
and the [ATP]/[ADP] ratio becomes one. 


e. In the cell, it is possible for glutamine to be formed when the 
[glutamine]/[glutamic acid] ratio is greater than one. 


. You have just isolated a new enzyme and have determined the 


velocity of reaction at three different substrate concentrations. 
You find that the slope of the product versus time curve is the 
same for all three concentrations. What can you conclude about 
the conditions in the reaction mixture? Do you think the velocity 
would change if you changed the concentration of the enzyme 
instead of changing the concentration of the substrate? 


. Lysozyme is a slow-acting enzyme, requiring approximately two 


seconds to catalyze a single reaction. What is the turnover num- 
ber of lysozyme? 


. In the reaction S =P, if one mole of product (P) has the same 


free energy as one mole of substrate (S) , what is the value for the 
Ki‘, of this reaction? What is the value of the AG’? If you started 
with a1M solution of S, would it spontaneously react to give any P? 
What would the concentrations of S and P be when the reaction 
had reached equilibrium? (See Quantitative Tutorial Video). 


. What is meant in terms of concentration ratios when it is said 


that the AG of ATP hydrolysis in the cell is approximately 
—12 kcal/mol, whereas the AG” is —7.3 kcal/mol? 


. The enzymes under cellular regulation are those whose reactions 


typically proceed under nonequilibrium conditions. What would 
be the effect of allosteric inhibition of an enzyme that operated 
close to equilibrium? 


. Inthe reaction A =B, if the Keg is 10°, what is the AG°’? What is 


the AG” if the Reg had been determined to be 10 2? What is the 
K',, of the hexokinase reaction indicated in Figure 3.24 (step 1)? 
(See Quantitative Tutorial Video). 


Consider two reactions where AG®% for the first reaction is 
—1kcal/mol and AG® for the second reaction is —4 kcal/mol. 
Under standard conditions, both reactions proceed spontane- 
ously toward the formation of products indicating their AG 
must be negative. Is the K,, for the second reaction greater or 
less than the Ki, for the first reaction? By how much? 


If the reaction XA+Y—=XY+A has a AG%of+7.3kcal/mol, 
could this reaction be driven in the cell by coupling it to ATP 
hydrolysis? Why or why not? 

Suppose that two proteins, A and B, can associate with each 
other to form a complex. You measure the dissociation constant 
at several different temperatures and find that as you heat up the 
sample, the Ky decreases. Does this mean that binding of A and 


13. 


14. 


15. 


16. 


Ty. 


18. 


19: 


20. 


B becomes more favorable or less favorable as the temperature 
increases? 


The reaction of compound X with compound Y to produce com- 
pound Z is an unfavorable reaction (AG =+5kcal/mol). Draw 
the chemical reactions that would occur if ATP was utilized to 
drive the reaction. 


ATP has evolved as the central molecule in energy metabolism. 
Could 1,3-bisphosphoglycerate serve the same function? Why 
or why not? 


Calculate the AG for ATP hydrolysis in a cell in which the 
[ATP]/[ADP] ratio had climbed to 100:1, while the PR. concentration 
remained at 10MM. How does this compare to the ratio of 
[ATP]/[ADP] when the reaction is at equilibrium and the P, 
concentration remains at 10 mM? What would be the value for 
AG when the reactants and products were all at standard condi- 
tions (1M)? 


Consider the reaction: 


Glucose +P, = glucose 6-phosphate +H,O 
AG" = +3 kcal/mol 


What is the equilibrium constant, Se for this reaction? (Note: 


The concentration of water is ignored.) Does the positive AG™ in 
the above reaction mean that the reaction can never go sponta- 
neously from left to right? 


Under physiologic conditions, [Glucose]=5mM, [Glucose 6- 
phosphate] = 83 mM, and [P]=1mM. Under these conditions, 
will the reaction of problem 16 go spontaneously from left to 
right? If not, what would the concentration of glucose need to 
be for the reaction to go from left to right, if the concentrations 
of the other reactants and products are as stated above? 


Consider the reaction: 


Glutamate + ammonia = glutamine + H,O 
AG” = +3.4 kcal/mol 


If the concentration of ammonia (NH,) is 10 mM, what is the ratio 
of glutamate/glutamine required for the reaction to proceed 
spontaneously from left to right at 25°C? (See Quantitative Tutorial 
Video). 


It should be clear that the synthesis of glutamine cannot occur 
in a cell by the reaction described in problem 18. The actual 
reaction couples glutamine synthesis to ATP hydrolysis: 


Glutamate + ammonia+ ATP = glutamine + ADP +P 


What is the AG” for this reaction? Suppose that all reactants and 
products, except ammonia, are present at 10 mM. What concen- 
tration of ammonia would be required to drive the reaction to 
the right, that is, to drive the net synthesis of glutamine? 


A noncompetitive inhibitor does not prevent the enzyme from 
binding its substrate. What will be the effect of increasing the 
substrate concentration in the presence of a noncompetitive 
inhibitor? Do you expect a noncompetitive inhibitor to affect the 
enzyme V._..? K,,? Explain briefly. 
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The Structure and Function 
of the Plasma Membrane 


Source: © Peter Turnley / Corbis Images. 


a CHAPTER OUTLINE 


NERVE GAS 


O, March 20, 1995, during the morning rush hour in the 
Tokyo Subway system, one of the busiest in the world, a 
religious cult released a chemical called Sarin into the air. 
The attack was coordinated between multiple subway 
stations, and before long thousands of people were finding 
it difficult to breathe. Although only eight people died in the 
attack, hundreds were seriously injured. Sarin is a type of 
nerve gas—a chemical weapon produced by the Nazis that 
is designed to interfere with the nervous system. After World 
War II, both the United States and the Soviet Union pro- 
duced massive stockpiles of sarin and other nerve gases, 
and while these weapons were not used on a global scale, 
nerve gas was used by Iraqi armed forces in an attack on the 
Kurdish city of Halabja in 1988, as well as in the lran—lraq war. 
The sarin molecule is small but deadly, and it works by 
poisoning an enzyme that degrades chemical signals used 
by nerves to activate muscle cells. Sarin victims die because 
the muscles used for breathing become paralyzed. In this 
chapter we will learn how proteins in the cell membrane 
convert chemical signals to electrical signals that are 
required for many physiological activities. 
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Studying the Structure and Properties 
of Integral Membrane Proteins 
Membrane Lipids and Membrane 
Fluidity 

The Dynamic Nature of the Plasma 
Membrane 


4.8 
4.9 


4.10 
4.11 


4.12 


4.13 


The Red Blood Cell: An Example of 4.14 Active Transport 


Plasma Membrane Structure 4.15 THE HUMAN PERSPECTIVE: 
Solute Movement across Cell Defects in lon Channels and 
Membranes Transporters as a Cause of 
Diffusion through the Lipid Bilayer Inherited Disease 

The Diffusion of lons through 4.16 Membrane Potentials 

Membranes 4.17 Propagation of Action Potentials as 
EXPERIMENTAL PATHWAYS: an Impulse 

The Acetylcholine Receptor 4.18 Neurotransmission: Jumping the 
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4.1 Introduction to the molecular composition of the various layers of a membrane, an 115 


argument that went to the very heart of the subject of membrane 
Plasma Membrane structure and function. As we will see shortly, cell membranes con- 
tain a lipid bilayer, and the two dark-staining layers in the electron 
: : . : micrographs of Figure 4.1 correspond primarily to the inner and 
that protects its human inhabitants from an unpredictable and harsh outer polar surfaces of the bilayer. We will return to the structure of 
external world. You might expect the outer boundary ofa living cells embranes bel ow, but first we will survey some of the major func- 


to be constructed of an equally tough and impenetrable barrier tn of membranes in the life of a cell (FIGURE 4.2). 
because it must also protect its delicate internal contents from a non- 


living, and often inhospitable, environment. Yet cells are separated 

from the external world by a thin, fragile structure called the plaama An Overview of Membrane Functions 
membrane that is only 5 to 10 nm wide. It would require about 

5000 plasma membranes stacked one on top of the other to equalthe | 1. Compartmentalization. Membranes are continuous, unbroken 


The outer walls of a house or car provide a strong, inflexible barrier 
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thickness of a single page of this book. sheets and, as such, inevitably enclose compartments. The plasma 
Because it is so thin, no hint of the plasma membrane is detected membrane encloses the contents of the entire cell, whereas the 
when a section of a cell is examined under a light microscope. In nuclear and cytoplasmic membranes enclose diverse intracellular 
fact, it wasn’t until the late 1950s that techniques for preparing and spaces. The various membrane-bounded compartments of a cell 
staining tissue had progressed to the point where the plasma mem- possess markedly different contents. Membrane compartmen- 
brane could be resolved in the electron microscope. These early elec- talization allows specialized activities to proceed without exter- 
tron micrographs, such as those taken by J. D. Robertson of Duke nal interference and enables cellular activities to be regulated 
University (FIGURE 4.1a), portrayed the plasma membrane as a independently of one another. 
three-layered structure, consisting of darkly staining inner and outer — 2. Scaffold for biochemical activities. Membranes not only enclose 
layers and lightly staining middle layer. All membranes that were compartments but are also a distinct compartment themselves. 
examined closely—whether they were plasma, nuclear, or cytoplas- For reactants floating around in solution, their interactions are 
mic membranes (Figure 4.1b), or taken from plants, animals, or dependent on random collisions. In contrast, components that 
microorganisms—showed this same ultrastructure. In addition to are embedded in a membrane are no longer floating free and can 
providing a visual image of this critically important cellular structure, be ordered for effective interaction. 


these electron micrographs touched off a vigorous debate as to the = 3. Providing a selectively permeable barrier. Membranes prevent 


the unrestricted exchange of molecules from one side to the 
other. At the same time, membranes provide the means of com- 
munication between the compartments they separate. The 
plasma membrane, which encircles a cell, can be compared to a 
moat around a castle: Both serve as a general barrier, yet both 
have gated “bridges” that promote the movement of select 
elements into and out of the enclosed living space. 


eo Ss ht aS 


FIGURE 4.1 The trilaminar appearance of membranes. (a) Electron micrograph 
showing the three-layered (trilaminar) structure of the plasma membrane of an 
erythrocyte after staining the tissue with the heavy metal osmium. Osmium binds 
preferentially to the polar head groups of the lipid bilayer, producing the trilaminar 
pattern. The arrows denote the inner and outer edges of the membrane. (b) The 
outer edge of a differentiated muscle cell grown in culture showing the similar 
trilaminar structure of both the plasma membrane (PM) and the membrane of the 
sarcoplasmic reticulum (SR), a calcium-storing compartment of the cytoplasm. 
Source: (a) Courtesy of J. D. Robertson; (b) From Andrew R. Marks, et al., 

J. Cell Biol., 114, 305, 1991. Reproduced with permission of the Rockefeller 

(a) 50 nm. University Press. 
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FIGURE 4.2 A summary of membrane functions in a plant cell. (1) An 
example of membrane compartmentalization in which hydrolytic enzymes 
(acid hydrolases) are sequestered within the membrane-bounded vacuole. 
(2) An example of the role of cytoplasmic membranes as a site of enzyme 
localization. The fixation of CO, by the plant cell is catalyzed by an enzyme 
that is associated with the outer surface of the thylakoid membranes of the 
chloroplasts. (3) An example of the role of membranes as a selectively 
permeable barrier. Water molecules are able to penetrate rapidly through 
the plasma membrane, causing the plant cell to fill out the available space 
and exert pressure against its cell wall. (4) An example of solute transport. 
Hydrogen ions, which are produced by various metabolic processes in the 
cytoplasm, are pumped out of plant cells into the extracellular space by a 
transport protein located in the plasma membrane. (5) An example of the 
involvement of a membrane in the transfer of information from one side to 
another (signal transduction). In this case, a hormone (e.g., abscisic acid) 
binds to the outer surface of the plasma membrane and triggers the release 
of a chemical message (such as IP3) into the cytoplasm. In this case, IP3 
causes release of Ca** ions from a cytoplasmic warehouse. (6) An example 
of the role of membranes in cell-cell communication. Openings between 
adjoining plant cells, called plasmodesmata, allow materials to move directly 
from the cytoplasm of one cell into its neighbors. (7) An example of the role 
of membranes in energy transduction. The conversion of ADP to ATP occurs 
in close association with the inner membrane of the mitochondrion. 


4. Transporting solutes. The plasma membrane contains the machin- 
ery for physically transporting substances from one side of the 
membrane to another, often from a region where the solute is pre- 
sent at low concentration into a region where that solute is present 
at much higher concentration. The membrane’s transport machin- 
ery allows a cell to accumulate substances, such as sugars and 
amino acids, that are necessary to fuel its metabolism and build its 


macromolecules. The plasma membrane is also able to transport 
specific ions, thereby establishing ionic gradients across itself. This 
capability is especially critical for nerve and muscle cells. 


5. Responding to external stimuli. The plasma membrane plays a 
critical role in the response of a cell to external stimuli, a process 
known as signal transduction. Membranes possess receptors 
that combine with specific molecules (ligands) or respond to 
other types of stimuli such as light or mechanical tension. 
Different types of cells have membranes with different receptors 
and are, therefore, capable of recognizing and responding to dif- 
ferent environmental stimuli. The interaction of a plasma mem- 
brane receptor with an external stimulus may cause the membrane 
to generate a signal that stimulates or inhibits internal activities. 
For example, signals generated at the plasma membrane may tell 
a cell to manufacture more glycogen, to prepare for cell division, 
to move toward a higher concentration of a particular com- 
pound, to release calcium from internal stores, or possibly to 
commit suicide. 


6. Intercellular interaction. Situated at the outer edge of every liv- 
ing cell, the plasma membrane of multicellular organisms medi- 
ates the interactions between a cell and its neighbors. The plasma 
membrane allows cells to recognize and signal one another, to 
adhere when appropriate, and to exchange materials and infor- 
mation. Proteins within the plaama membrane may also facilitate 
the interaction between extracellular materials and the intracel- 
lular cytoskeleton. 


7. Energy transduction. Membranes are intimately involved in the 
processes by which one type of energy is converted to another 
type (energy transduction). The most fundamental energy trans- 
duction occurs during photosynthesis when energy in sunlight is 
absorbed by membrane-bound pigments, converted into chemi- 
cal energy, and stored in carbohydrates. Membranes are also 
involved in the transfer of chemical energy from carbohydrates 
and fats to ATP. In eukaryotes, the machinery for these energy 
conversions is contained within membranes of chloroplasts and 
mitochondria. 


We will concentrate in this chapter on the structure and functions of 
the plasma membrane, but remember that the principles discussed 
here are common to all cell membranes. Specialized aspects of the 
structure and functions of mitochondrial, chloroplast, cytoplasmic, 
and nuclear membranes will be discussed in Chapters 5, 6, 8, and 12, 
respectively. 


A Brief History of Studies on Plasma 
Membrane Structure 


The first insights into the chemical nature of the outer boundary layer 
of a cell were obtained by Ernst Overton of the University of Ziirich 
during the 1890s. Overton knew that nonpolar solutes dissolved more 
readily in nonpolar solvents than in polar solvents, and that polar sol- 
utes had the opposite solubility. Overton reasoned that a substance 
entering a cell from the medium would first have to dissolve in the 
outer boundary layer of that cell. To test the permeability of the outer 
boundary layer, Overton placed plant root hairs into hundreds of dif- 
ferent solutions containing a diverse array of solutes. He discovered 
that the more lipid-soluble the solute, the more rapidly it would enter 
the root hair cells. He concluded that the dissolving power of the outer 
boundary layer of the cell matched that of a fatty oil. 


Movable 
Lipids pane 


e 7 
Stationary ,” \ 
barrier « 


| 
BZ 


(a) 


FIGURE 4.3 The plasma membrane contains a lipid bilayer. (a) Calculating the surface area of a lipid preparation. When a sample of phospholipids is 
dissolved in an organic solvent, such as hexane, and spread over an aqueous surface, the phospholipid molecules form a layer over the water that is a single 
molecule thick: a monomolecular layer. The molecules in the layer are oriented with their hydrophilic groups bonded to the surface of the water and their 
hydrophobic chains directed into the air. To estimate the surface area the lipids would cover if they were part of a membrane, the lipid molecules can be 
compressed into the smallest possible area by means of movable barriers. Using this type of apparatus, which is called a Langmuir trough after its inventor, 
Gorter and Grendel concluded that red blood cells contained enough lipid to form a layer over their surface that was two molecules thick: a bilayer. (b) As 
Gorter and Grendel first proposed, the core of a membrane contains a bimolecular layer of phospholipids oriented with their water-soluble head groups 
facing the outer surfaces and their hydrophobic fatty acid tails facing the interior. The structures of the head groups are given in Figure 4.6a. (c) Simulation of 
a fully hydrated lipid bilayer composed of the phospholipid phosphatidylcholine. Phospholipid head groups are purple, water molecules are blue, fatty acid 


chains are green. 


Source: (c) Simulation of a fully hydrated lipid bilayer. Phospholipid head groups are shown in blue and orange, water molecules in red and white, and fatty 


acid chains are green. 


The first proposal that cellular membranes might contain a lipid 
bilayer was made in 1925 by two Dutch scientists, E. Gorter and 
E Grendel. These researchers extracted the lipid from human red blood 
cells and measured the amount of surface area the lipid would cover 
when spread over the surface of water (FIGURE 4.3a). Because mature 
mammalian red blood cells lack both nuclei and cytoplasmic orga- 
nelles, the plasma membrane is the only lipid-containing structure in 
the cell. Consequently, all of the lipids extracted from the cells can be 
assumed to have resided in the cells’ plasma membranes. The ratio of 
the surface area of water covered by the extracted lipid to the surface 
area calculated for the red blood cells from which the lipid was extracted 
varied between 1.8 to 1 and 2.2 to 1. Gorter and Grendel speculated 
that the actual ratio was 2:1 and concluded that the plasma membrane 
contained a bimolecular layer of lipids, that is, a lipid bilayer 
(Figure 4.3b). They also suggested that the polar groups of each 
molecular layer (or leaflet) were directed outward toward the aqueous 
environment, as shown in Figure 4.3b,c. This would be the thermody- 
namically favored arrangement, because the polar head groups of 
the lipids could interact with surrounding water molecules, just as the 
hydrophobic fatty acyl chains would be protected from contact with the 
aqueous environment (Figure 4.3c). Thus, the polar head groups would 
face the cytoplasm on one edge and the blood plasma on the other. 
Even though Gorter and Grendel made several experimental errors 
(which fortuitously canceled one another out), they still arrived at the 
correct conclusion that membranes contain a lipid bilayer. 

In the 1920s and 1930s, cell physiologists obtained evidence that 
there must be more to the structure of membranes than simply a 


lipid bilayer. It was found, for example, that lipid solubility was not 
the sole determining factor as to whether a substance could pene- 
trate the plasma membrane. Similarly, the surface tensions of mem- 
branes were calculated to be much lower than those of pure lipid 
structures. This decrease in surface tension could be explained by 
the presence of protein in the membrane. 

This protein is present in the form of individual protein mole- 
cules and protein complexes that penetrate a fluid lipid bilayer 
(FIGURE 4.4a) and extend out into the surrounding aqueous envi- 
ronment (Figure 4.4b). Because of the fluidity of the lipid bilayer, 
cellular membranes are dynamic structures in which the compo- 
nents are mobile and capable of coming together to engage in various 
types of transient or semipermanent interactions. 


REVIEW | 


1. Describe some of the important roles of membranes 
in the life of a eukaryotic cell. What do you think might 
be the effect of a membrane that was incapable of 
performing one or another of these roles? 


2. Summarize some of the major steps leading to the 
current model of membrane structure. How does 
each new model retain certain basic principles of 
earlier models? 
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FIGURE 4.4 Plasma mebranes as lipid plus proteins. (a) A representation of the plasma membrane showing the organization of proteins embedded in 
the lipid bilayer. The external surface of most membrane proteins, as well as a small percentage of the phospholipids, contain short chains of sugars, 
making them glycoproteins and glycolipids. Those portions of the polypeptide chains that extend through the lipid bilayer typically occur as a helices 
composed of hydrophobic amino acids. The two leaflets of the bilayer contain different types of lipids as indicated by the differently colored head groups. 
(b) Molecular model of the membrane of a synaptic vesicle constructed using known structures of the various proteins along with information on their 
relative numbers obtained from the analysis of purified synaptic vesicles. The high protein density of the membrane is apparent. Most of the proteins in 
this membrane are required for the interaction of the vesicle with the plasma membrane. The large blue protein at the lower right pumps H* ions into 


the vesicle. 


Source: (b) From Shigeo Takamori, et al. Courtesy of Reinhard Jahn, Cell, 127:841, 2006, reprinted with permission from Elsevier. 


4.2 The Lipid Composition 
of Membranes 


Membranes are lipid-protein assemblies in which the components 
are held together in a thin sheet by noncovalent bonds. As noted on 
page 117, the core of the membrane consists of a sheet of lipids 
arranged in a bimolecular layer (Figure 4.3b,c). The lipid bilayer 
serves primarily as a structural backbone of the membrane and pro- 
vides the barrier that prevents random movements of water-soluble 
materials into and out of the cell. The proteins of the membrane, on 
the other hand, carry out most of the specific functions summarized in 
Figure 4.2. Each type of differentiated cell contains a unique comple- 
ment of membrane proteins that contributes to the specialized activ- 
ities of that cell type (see Figure 4.32d for an example). 

The ratio of lipid to protein in a membrane varies, depending 
on the type of cellular membrane (plasma vs. endoplasmic reticulum 
vs. Golgi), the type of organism (bacterium vs. plant vs. animal), and 
the type of cell (cartilage vs. muscle vs. liver). For example, the inner 
mitochondrial membrane has a very high ratio of protein/lipid in 
comparison to the red blood cell plasma membrane, which is high 
in comparison to the membranes of the myelin sheath that form a 
multilayered wrapping around a nerve cell (FIGURE 4.5). To a large 
degree, these differences can be correlated with the basic functions 
of these membranes. The inner mitochondrial membrane contains 
the protein carriers of the electron-transport chain, and relative to 
other membranes, lipid is diminished. In contrast, the myelin sheath 
acts primarily as electrical insulation for the nerve cell it encloses, a 
function that is best carried out by a thick lipid layer of high electrical 
resistance with a minimal content of protein. Membranes also 
contain carbohydrates, which are attached to the lipids and proteins 
as indicated in Figure 4.4a. 


lum 


FIGURE 4.5 The myelin sheath. Electron micrograph of a nerve cell axon 
surrounded by a myelin sheath consisting of concentric layers of plasma 
membrane that have an extremely low protein/lipid ratio. The myelin 
sheath insulates the nerve cell from the surrounding environment, which 
increases the velocity at which impulses can travel along the axon 
(discussed on page 162). The perfect spacing between the layers is 
maintained by interlocking protein molecules (called Po) that project 
from each membrane. 

Source: From Leonard Napolitano, Francis LeBaron, and Jospeh Scaletti, 
J. Cell Biol. 34:820, 1967. Reproduced with permission of the Rockefeller 
University Press. 


Membrane Lipids 


Membranes contain a wide diversity of lipids, all of which are 
amphipathic; that is, they contain both hydrophilic and hydropho- 
bic regions. There are three main types of membrane lipids: phos- 
phoglycerides, sphingolipids, and cholesterol. 


PHOSPHOGLYCERIDES Most membrane lipids contain a 
phosphate group, which makes them phospholipids. Because most 
membrane phospholipids are built on a glycerol backbone, they are 
called phosphoglycerides (FIGURE 4.6a). Unlike triglycerides, 
which have three fatty acids (page 46) and are not amphipathic, 
membrane glycerides are diglycerides—only two of the hydroxyl 
groups of the glycerol are esterified to fatty acids; the third is esteri- 
fied to a hydrophilic phosphate group. Without any additional sub- 
stitutions beyond the phosphate and the two fatty acyl chains, the 
molecule is called phosphatidic acid, which is virtually absent in 
most membranes. Instead, membrane phosphoglycerides have an 
additional group linked to the phosphate, most commonly either 
choline (forming phosphatidylcholine, PC), ethanolamine (forming 
phosphatidylethanolamine, PE), serine (forming phosphatidylserine, 
PS), or inositol (forming phosphatidylinositol, PI). Each of these 
groups is small and hydrophilic and, together with the negatively 
charged phosphate to which it is attached, forms a highly water- 
soluble domain at one end of the molecule, called the head group. 
At physiologic pH, the head groups of PS and PI have an overall 
negative charge, whereas those of PC and PE are neutral. In contrast, 
the fatty acyl chains are hydrophobic, unbranched hydrocarbons 
approximately 16 to 22 carbons in length (Figure 4.6a). A membrane 
fatty acid may be fully saturated (i-e., lack double bonds), monoun- 
saturated (i.e., possess one double bond), or polyunsaturated (i.e., 
possess more than one double bond). Phosphoglycerides often 
contain one unsaturated and one saturated fatty acyl chain. Recent 
interest has focused on the apparent health benefits of two highly 
unsaturated fatty acids (EPA and DHA) found at high concentration 
in fish oil. EPA and DHA contain five and six double bonds, respec- 
tively, and are incorporated primarily into PE and PC molecules of 
certain membranes, most notably in the brain and retina. EPA and 
DHA are described as omega-3 fatty acids because their last double 
bond is situated three carbons from the omega (CH;) end of the fatty 
acyl chain. With fatty acid chains at one end of the molecule and a 
polar head group at the other end, all of the phosphoglycerides 
exhibit a distinct amphipathic character. 


SPHINGOLIPIDS A less abundant class of membrane lipids, 
called sphingolipids, are derivatives of sphingosine, an amino alcohol 
that contains a long hydrocarbon chain (Figure 4.6b). Sphingolipids 
consist of sphingosine linked to a fatty acid (R of Figure 4.6b) by its 
amino group. This molecule is a ceramide. The various sphingosine- 
based lipids have additional groups esterified to the terminal alcohol 
of the sphingosine moiety. If the substitution is phosphorylcholine, 
the molecule is sphingomyelin, which is the only phospholipid of the 
membrane that is not built with a glycerol backbone. If the substitu- 
tion is a carbohydrate, the molecule is a glycolipid. If the carbohy- 
drate is a simple sugar, the glycolipid is called a cerebroside; if it is a 
small cluster of sugars that includes sialic acid, the glycolipid is called 
a ganglioside. Hundreds of different gangliosides have been identified 
by differences in their carbohydrate chains. Because all sphingolipids 
have two long, hydrophobic hydrocarbon chains at one end and 
a hydrophilic region at the other, they are also amphipathic and 
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FIGURE 4.6 The chemical structures of membrane lipids. (a) The 
structures of phosphoglycerides (see also Figure 2.22). (b) The structures of 
sphingolipids. Sohingomyelin is a phospholipid; cerebrosides and 
gangliosides are glycolipids. A third membrane lipid is cholesterol, which is 
shown in the next figure. (R = fatty acyl chain). [The green portion of each 
lipid, which represents the hydrophobic tail(s) of the molecule, is actually 
much longer than the hydrophilic head group (see Figure 4.23).] 
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fatty acyl chains of sphingolipids, however, tend to be longer and 
more highly saturated than those of phosphoglycerides. 

Glycolipids are interesting membrane components. Relatively 
little is known about them, yet tantalizing hints have emerged to 
suggest they play crucial roles in cell function. The nervous system is 
particularly rich in glycolipids. The myelin sheath pictured in Figure 
4.5 contains a high content of a particular glycolipid, called galac- 
tocerebroside (shown in Figure 4.6b), which is formed when a galac- 
tose is added to ceramide. Mice lacking the enzyme that carries out 
this reaction exhibit severe muscular tremors and eventual paralysis. 
Similarly, humans who are unable to synthesize a particular ganglio- 
side (Gyy3) suffer from a serious neurological disease characterized 
by severe seizures and blindness. Glycolipids also play a role in cer- 
tain infectious diseases; the toxins that cause cholera and botulism 
both enter their target cell by first binding to cell-surface ganglio- 
sides, as does the influenza virus. 


CHOLESTEROL Another lipid component of certain mem- 
branes is the sterol cholesterol (see Figure 2.21), which in certain 
animal cells may constitute up to 50 percent of the lipid molecules in 
the plasma membrane. Plant cells contain cholesterol-like sterols, 
but biologists disagree as to whether they completely lack cholesterol. 
Cholesterol molecules are oriented with their small hydrophilic 
hydroxyl group toward the membrane surface and the remainder 
of the molecule embedded in the lipid bilayer (FIGURE 4.7). The 
hydrophobic rings of a cholesterol molecule are flat and rigid, 
and they interfere with the movements of the fatty acid tails of the 
phospholipids (page 131). 


4 oF? 4% 
> - 
Xf 748 as 


ey 


Li} 


td 
446 
rE 


». ( 
}* 
4 
e 


a 


et 

‘> 

7 
4 


oo FAy, 
2 ow 4 
Pe 
"dee 
¢: 


A 


FIGURE 4.7 The cholesterol molecules (shown in green) of a lipid bilayer 
are oriented with their small hydrophilic end facing the external surface of 
the bilayer and the bulk of their structure packed in among the fatty acid 
tails of the phospholipids. The placement of cholesterol molecules 
interferes with the flexibility of the lipid hydrocarbon chains, which tends to 
stiffen the bilayer while maintaining its overall fluidity. Unlike other lipids of 
the membrane, cholesterol is often rather evenly distributed between the 
two layers (leaflets). 

Source: From H. L. Scott, Curr. Opin. Struct. Biol. 12:499, 2002, Figure 3. 
© 2002, with permission from Elsevier. 


The Nature and Importance 
of the Lipid Bilayer 


Each type of cellular membrane has its own characteristic lipid com- 
position, differing from one another in the types of lipids, the nature 
of the head groups, and the particular species of fatty acyl chain(s). 
Because of this structural variability, it is estimated that some bio- 
logical membranes contain hundreds of chemically distinct species 
of phospholipids, which can be catalogued by mass spectrometry. 
The biological significance of this remarkable diversity of lipid spe- 
cies remains the subject of interest and speculation. 

The percentages of some of the major types of lipids of a variety 
of membranes are given in Table 4.1. The lipids of a membrane are 
more than simple structural elements; they can have important 
effects on the biological properties of a membrane. Lipid composi- 
tion can determine the physical state of the membrane (page 131) 
and influence the activity of particular membrane proteins. 
Membrane lipids also provide the precursors for highly active chem- 
ical messengers that regulate cellular function (Section 15.6). 

Various types of measurements indicate that the combined fatty 
acyl chains of both leaflets of the lipid bilayer span a width of about 
30 A and that each row of head groups (with its adjacent shell of 
water molecules) adds another 15 A. Thus, the entire lipid bilayer is 
only about 60 A (6 nm) thick. The presence in membranes of this 
thin film of amphipathic lipid molecules has remarkable conse- 
quences for cell structure and function. Because of thermodynamic 
considerations, the hydrocarbon chains of the lipid bilayer are never 
exposed to the surrounding aqueous solution. Consequently, mem- 
branes are never seen to have a free edge; they are always continuous, 
unbroken structures. As a result, membranes form extensive inter- 
connected networks within the cell. Because of the flexibility of the 
lipid bilayer, membranes are deformable and their overall shape can 
change, as occurs during locomotion (FIGURE 4.8a) or cell division 
(Figure 4.8b). The lipid bilayer is thought to facilitate the regulated 
fusion or budding of membranes. For example, the events of secre- 
tion, in which cytoplasmic vesicles fuse to the plasma membrane 
(Figure 4.8c), or of fertilization, where two cells fuse to form a single 
cell, involve processes in which two separate membranes come 
together to become one continuous sheet (see Figure 8.32). The 
importance of the lipid bilayer in maintaining the proper internal 
composition of a cell, in separating electric charges across the plasma 
membrane, and in many other cellular activities will be apparent 
throughout this and subsequent chapters. 

Another important feature of the lipid bilayer is its ability to 
self-assemble, which can be demonstrated more easily within a test 
tube than a living cell. If, for example, a small amount of phosphati- 
dylcholine is dispersed in an aqueous solution, the phospholipid 
molecules assemble spontaneously to form the walls of fluid-filled 
spherical vesicles, called liposomes. The walls of these liposomes 
consist of a continuous lipid bilayer that is organized in the same 
manner as that of the lipid bilayer of a natural membrane. Liposomes 
have proven invaluable in membrane research. Membrane proteins 
can be inserted into liposomes and their function studied in a much 
simpler environment than that of a natural membrane. Liposomes 
have also been developed as vehicles to deliver drugs or DNA mole- 
cules within the body. The drugs or DNA can be linked to the wall of 
the liposome or contained at high concentration within its lumen 
(Figure 4.9). In these studies, the walls of the liposomes are con- 
structed to contain specific proteins (such as antibodies or hor- 
mones) that allow the liposomes to bind selectively to the surfaces of 


TABLE 4.1 Lipid Compositions of Some Biological Membranes* 


Lipid Human erythrocyte 


Human myelin 


Beef heart mitochondria 


Phosphatidic acid 15 
Phosphatidylcholine 19 
Phosphatidyl-ethanolamine 18 
Phosphatidylgycerol 0 


Phosphatidylserine 8.5 
Cardiolipin 

Sphingomyelin 

Glycolipids 

Cholesterol 


*The values given are weight percent of total lipid. 


0.5 


Source: C. Tanford, The Hydrophobic Effect, p. 109, copyright 1980, John Wiley & Sons, Inc. Reprinted by permission of John Wiley & Sons, Inc. 


(a) (b) 


FIGURE 4.8 The dynamic properties of the plasma membrane. (a) The leading edge of a moving cell often contains sites where the plasma membrane 
displays undulating ruffles. (b) Division of a cell is accompanied by the deformation of the plasma membrane as it is pulled toward the center of the cell. 
Unlike most dividing cells, the cleavage furrow of this dividing ctenophore egg begins at one pole and moves unidirectionally through the egg. (c) 
Membranes are capable of fusing with other membranes. This electron micrograph shows a secretory granule discharging its contents after fusion with the 


overlying plasma membrane (arrows). 


Source: (a) Courtesy of Jean Paul Revel; (6) Courtesy of Gary Freeman; (c) Courtesy of Susan Jo Burwen. 


particular target cells where the drug or DNA is intended to go. Most 
of the early clinical studies with liposomes met with failure because 
the injected vesicles were rapidly removed by phagocytic cells of the 
immune system. This obstacle has been overcome with the develop- 
ment of so-called stealth liposomes that contain an outer coating of 
a synthetic polymer that protects the liposomes from immune 
destruction (FIGURE 4.9). One important example is Caelyx, a stealth 
liposome containing the chemotherapy drug doxorubicin. Caelyx is 
now an accepted therapy for treatment of metastatic breast cancer. 


The Asymmetry of Membrane Lipids 


The lipid bilayer consists of two distinct leaflets that have a distinctly 
different lipid composition. One line of experiments that has led to 
this conclusion takes advantage of the fact that lipid-digesting enzymes 
cannot penetrate the plasma membrane and, consequently, are only 
able to digest lipids that reside in the outer leaflet of the bilayer. If 
intact human red blood cells are treated with a lipid-digesting phos- 
pholipase, approximately 80 percent of the phosphatidylcholine 
(PC) of the membrane is hydrolyzed, but only about 20 percent of 
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FIGURE 4.9 Liposomes. A schematic diagram of a stealth liposome 
containing a hydrophilic polymer (such as polyethylene glycol) to protect it 
from destruction by immune cells, antibody molecules that target it to 
specific body tissues, a water-soluble drug enclosed in the fluid-filled 
interior chamber, and a lipid-soluble drug in the bilayer. 
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FIGURE 4.10 The asymmetric distribution of phospholipids (and 
cholesterol) in the plasma membrane of human erythrocytes. (SM, 
sphingomyelin; PC, phosphatidylcholine; PS, phosphatidylserine; PE, 
phosphatidylethanolamine; Pl, phosphatidylinositol; Cl, cholesterol.) 


the membranes phosphatidylethanolamine (PE) and less than 
10 percent of its phosphatidylserine (PS) are attacked. These data 
indicate that, compared to the inner leaflet, the outer leaflet has a 
relatively high concentration of PC (and sphingomyelin) and a low 
concentration of PE and PS (FIGURE 4.10). It follows that the lipid 
bilayer can be thought of as composed of two more-or-less stable, 
independent monolayers having different physical and chemical 
properties. 

The different classes of lipids in Figure 4.10 exhibit different 
properties. All the glycolipids of the plaama membrane are in the outer 
leaflet where they often serve as receptors for extracellular ligands. 
Phosphatidylethanolamine, which is concentrated in the inner leaflet, 
tends to promote the curvature of the membrane, which is important 
in membrane budding and fusion. Phosphatidylserine, which is con- 
centrated in the inner leaflet, has a net negative charge at physiologic 
pH, which makes it a candidate for binding positively charged lysine 
and arginine residues, such as those adjacent to the membrane- 
spanning o helix of glycophorin A in Figure 4.18. The appearance of PS 
on the outer surface of aging lymphocytes marks the cells for destruc- 
tion by macrophages, whereas its appearance on the outer surface of 
platelets leads to blood coagulation. Phosphatidylinositol (PI), which 
is concentrated in the inner leaflet, can be phosphorylated at different 
sites on the inositol ring, which converts the lipid into a phosphoi- 
nositide. As discussed in Chapter 15, phosphoinositides play a key role 
in the transfer of stimuli from the plasma membrane to the cytoplasm 
(Section 15.6) and the recruitment of proteins to the cytosolic face of 
the plasma membrane. 


REVIEW | 


1. Draw the basic structure of the major types of lipids 
found in cellular membranes. How do sphingolipids 
differ from glycerolipids? Which lipids are phospholipids? 
Which are glycolipids? How are these lipids organized 
into a bilayer? How is the bilayer important for 
membrane activities? 


2. What is a liposome? How are liposomes used in 
medical therapies? 


——— 


4.3 Membrane Carbohydrates 


The plasma membranes of eukaryotic cells also contain carbohydrate. 
Depending on the species and cell type, the carbohydrate content of 
the plasma membrane ranges between 2 and 10 percent by weight. 
More than 90 percent of the membrane’s carbohydrate is covalently 
linked to proteins to form glycoproteins; the remaining carbohydrate 
is covalently linked to lipids to form glycolipids, which were discussed 
on page 119. As indicated in Figure 4.4c, all of the carbohydrate of the 
plasma membrane faces outward into the extracellular space.' The 
carbohydrate of internal cellular membranes also faces away from the 
cytosol (the basis for this orientation is illustrated in Figure 8.14). 
The modification of proteins was discussed briefly on page 50. 
The addition of carbohydrate, or glycosylation, is the most com- 
plex of these modifications. The carbohydrate of glycoproteins is pre- 
sent as short, branched hydrophilic oligosaccharides, typically 
having fewer than about 15 sugars per chain. In contrast to most 
high-molecular-weight carbohydrates (such as glycogen, starch, or 
cellulose), which are polymers of a single sugar, the oligosaccharides 
attached to membrane proteins and lipids can display extensive 
variability in composition and structure. Even the same protein can 
display different chains of sugars in different cells and tissues. 
Oligosaccharides may be attached to several different amino acids by 
two major types of linkages (FIGURE 4.11). These carbohydrate pro- 
jections play an important role in mediating the interactions of a cell 
with its environment (Chapter 7) and sorting of membrane proteins 
to different cellular compartments (Chapter 8). The carbohydrates 
of the glycolipids of the red blood cell plasma membrane determine 
whether a person’s blood type is A, B, AB, or O (FIGURE 4.12). 
A person having blood type A has an enzyme that adds an 
N-acetylgalactosamine to the end of the chain, whereas a person with 


‘It can be noted that even though phosphatidylinositol contains a sugar group (Figure 4.6), 
it is not considered to be part of the carbohydrate portion of the membrane in this 
discussion. 
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FIGURE 4.11 Two types of linkages that join sugars to a polypeptide 
chain. The N-glycosidic linkage between asparagine and N-acetylglucosamine 
is more common than the O-glycosidic linkage between serine or threonine 
and N-acetylgalactosamine. 


B antigen 


FIGURE 4.12 Blood-group antigens. Whether a person has type A, B, AB, or 
O blood is determined by a short chain of sugars covalently attached to 
membrane lipids and proteins of the red blood cell membrane. The oligosaccha- 
rides attached to membrane lipids (forming a ganglioside) that produce the A, B, 
and O blood types are shown here. A person with type AB blood has ganglio- 
sides with both the A and B structure. (Gal, galactose; GIcNAc, N-acetylglucosa- 
mine; Glu, glucose; Fuc, fucose; GalNAc, N-acetylgalactosamine.) 


type B blood has an enzyme that adds galactose to the chain terminus. 
These two enzymes are encoded by alternate versions of the same 
gene, yet they recognize different substrates. People with AB blood 
type possess both enzymes, whereas people with O blood type lack 
enzymes capable of attaching either terminal sugar. The ABO carbo- 
hydrate modifications are found in many other tissues besides blood, 
and genetic variation in the AB gene is a strong predictor of risk for 
pancreatic cancer, heart disease, and viral infection, but the biochem- 
ical function of the ABO blood-group antigens remains a mystery. 
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1. What is an oligosaccharide? How are oligosaccharides 
linked to membrane proteins? How are they related to 
human blood types? 


4.4 Membrane Proteins 


Depending on the cell type and the particular organelle within that 
cell, a membrane may contain hundreds of different proteins. Each 
membrane protein has a defined orientation relative to the cytoplasm, 
so that the properties of one surface of a membrane are very different 
from those of the other surface (as in Figure 4.4a). This asymmetry is 
referred to as membrane “sidedness.’ In the plasma membrane, for 
example, those parts of membrane proteins that interact with other 
cells or with extracellular substances are exposed to the extracellular 
space, whereas those parts of membrane proteins that interact with 
cytoplasmic molecules are exposed to the cytosol. Membrane proteins 
can be grouped into three distinct classes distinguished by the inti- 
macy of their relationship to the lipid bilayer (FIGURE 4.13). These are 


1. Integral proteins (Figure 4.132) that penetrate the lipid bilayer. 
Integral proteins are transmembrane proteins; that is, they 
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FIGURE 4.13 Three classes of membrane protein. (a) Integral pro- 
teins typically contain one or more transmembrane helices (see Figure 5.4 
for an exception). (b) Peripheral proteins are noncovalently bonded to 
the polar head groups of the lipid bilayer and/or to an integral mem- 
brane protein. (c) Lipid-anchored proteins are covalently bonded to a 
lipid group that resides within the membrane. The lipid can be phosphati- 
dylinositol, a fatty acid, or a prenyl group (a long-chain hydrocarbon 

built from five-carbon isoprenoid units). |, inositol; GIcNAc, 
N-acetylglucosamine; Man, mannose; Etn, ethanolamine; GPI, 
glycosylphosphatidylinositol. 
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pass entirely through the lipid bilayer and thus have domains that 
protrude from both the extracellular and cytoplasmic sides of the 
membrane. Some integral proteins have only one membrane- 
spanning segment, whereas others are multispanning. Genome- 
sequencing studies suggest that integral proteins constitute 25-30 
percent of all encoded proteins and roughly 60 percent of all cur- 
rent drug targets. 


2. Peripheral proteins (Figure 4.13b) that are located entirely 
outside of the lipid bilayer, on either the cytoplasmic or extracel- 
lular side, yet are associated with the surface of the membrane by 
noncovalent bonds. 


3. Lipid-anchored proteins (Figure 4.13c) that are located out- 
side the lipid bilayer, on either the extracellular or cytoplasmic 
surface, but are covalently linked to a lipid molecule that is situ- 
ated within the bilayer. 


Integral Membrane Proteins 


Most integral membrane proteins function in the following capacities: 
as receptors that bind specific substances at the membrane surface, as 
channels or transporters involved in the movement of ions and solutes 
across the membrane, or as agents that transfer electrons during the 
processes of photosynthesis and respiration. Like the phospholipids of 
the bilayer, integral membrane proteins are also amphipathic, having 
both hydrophilic and hydrophobic portions. As discussed below, 
those portions of an integral membrane protein that reside within the 
lipid bilayer—the transmembrane domains—tend to have a hydro- 
phobic character. Amino acid residues in transmembrane domains 
form van der Waals interactions with the fatty acyl chains of the 
bilayer, which seals the protein into the lipid “wall” of the membrane. 
As a result, the permeability barrier of the membrane is preserved, the 
protein is anchored within the bilayer, and the protein is brought into 
direct contact with surrounding lipid molecules (FIGURE 4.14a). 
According to current consensus, most of the lipid molecules that 
make contact with a transmembrane domain, such as those shown in 
Figure 4.14a, are simply passive bystanders and are rapidly exchanged 
with other lipid molecules in the bilayer. There is increasing evidence, 
however, that certain sites on the surface of many membrane proteins 
do form important functional interactions with specific lipid mole- 
cules. An example of this is shown in Figure 14.4), where anionic lipid 
molecules are seen to bind in the crevice at the interfaces between the 
subunits of a tetrameric KcsA K* channel. The channel does not open 
normally in a bilayer that lacks these specific lipid molecules. 

Those portions of an integral membrane protein that project into 
either the cytoplasm or extracellular space tend to be more like the 
globular proteins discussed in Section 2.10. These nonembedded. 
domains tend to have hydrophilic surfaces that interact with water- 
soluble substances (low-molecular-weight substrates, hormones, and 
other proteins) at the edge of the membrane. In some membrane pro- 
teins, the transmembrane domains are essentially devoid of water 
molecules, whereas others allow the aqueous solvent to penetrate 
deep into the proteins membrane-embedded regions. Several large 
families of membrane proteins contain an interior channel that pro- 
vides an aqueous passageway through the lipid bilayer. The linings of 
these channels typically contain key hydrophilic residues at strategic 
locations. As will be discussed later, integral proteins need not be fixed 
structures but may be able to move laterally within the membrane. 

The concept that proteins penetrate through membranes, rather 
than simply remaining external to the bilayer, was derived primarily 
from the results of a technique called freeze-fracture replication 


Lipid 
bilayer 


(b) 


FIGURE 4.14 The interactions between membrane proteins and lipids. 
(a) Aquaporin is a membrane protein containing four subunits (colored 
differently in the illustration) with each subunit containing an aqueous 
channel. Analysis of the protein's structure revealed the presence of a 
surrounding layer of bound lipid molecules. In this case, these lipid 
molecules are not likely to play a role in the function of aquaporin because 
the protein retains its function as a water channel in bilayers containing 
nonnative lipids. (b) Two views of another tetrameric membrane protein, in 
this case the bacterial K* channel, KcsA. Anionic phosphatidylglycerol 
molecules (red/gray) are seen in each crevice between the subunits and are 
thought to be required for normal channel function. A K* ion (purple 
sphere) is seen in transit through the pore. 

Source: (a) From Carola Hunte and Sebastian Richers, Curr. Opin. Struct. 
Biol. 18:407, © 2008, with permission from Elsevier Science; (b) From 
Anthony G. Lee, Trends Biochem. Sci. 36:497, © 2011, with permission 
from Elsevier Science. 


(see Section 18.5). In this procedure, tissue is frozen solid and then 
struck with a knife blade, which fractures the block into two pieces. 
As this occurs, the fracture plane often takes a path between the two 
leaflets of the lipid bilayer (FIGURE 4.15a). Once the membranes are 
split in this manner, metals are deposited on their exposed surfaces to 
form a shadowed replica, which is viewed in the electron microscope 
(see Figure 18.20). As shown in Figure 4.15), the replica resembles a 
road strewn with pebbles, which are called membrane-associated 
particles. Because the fracture plane passes through the center of the 
bilayer, most of these particles correspond to integral membrane 
proteins that extend at least halfway through the lipid core. When 
the fracture plane reaches a given particle, it goes around it rather 
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FIGURE 4.15 Freeze fracture: a technique for investigating cell membrane 
structure. (a) When a block of frozen tissue is struck by a knife blade, a fracture 
plane runs through the tissue, often following a path that leads it through the middle 
of the lipid bilayer. The fracture plane goes around the proteins rather than cracking 


SUIDJOIY SULIQUIA/\| e P'7 


Cytoplasm 


(a) them in half, and they segregate with one of the two halves of the bilayer. The 
exposed faces within the center of the bilayer can then be covered with a metal 
deposit to form a metallic replica. These exposed faces are referred to as the E, or 
ectoplasmic face, and the P, or protoplasmic face. (b) Replica of a freeze-fractured 


than cracking it in half. Consequently, each protein (particle) sepa- 
rates with one half of the plasma membrane (Figure 4.15c), leaving a 
corresponding pit in the other half (see Figure 7.27c). One of the 
great values of the freeze-fracturing technique is that it allows an 
investigation of the microheterogeneity of the membrane. Localized 
differences in parts of the membrane stand out in these replicas and 
can be identified (as illustrated by the replica of a gap junction shown 
in Figure 7.30d). Biochemical analyses, in contrast, average out such 
differences. 


Peripheral Membrane Proteins 


Peripheral proteins are associated with the membrane by weak 
electrostatic bonds (refer to Figure 4.13b). Peripheral proteins can 
usually be solubilized by extraction with high-concentration salt 
solutions that weaken the electrostatic bonds holding peripheral 
proteins to a membrane. In actual fact, the distinction between inte- 
gral and peripheral proteins is blurred because many integral mem- 
brane proteins consist of several polypeptides, some that penetrate 
the lipid bilayer and others that remain on the periphery. 

The best studied peripheral proteins are located on the internal 
(cytosolic) surface of the plasma membrane, where they form a 
fibrillar network that acts as a membrane “skeleton” (see Figure 4.32d). 
These proteins provide mechanical support for the membrane and 
function as an anchor for integral membrane proteins. Other periph- 
eral proteins on the internal plasma membrane surface function as 
enzymes, specialized coats (see Figure 8.24), or factors that transmit 
transmembrane signals (see Figure 15.19). Peripheral proteins 


human erythrocyte. The P fracture face is seen to be covered with particles approxi- 
mately 8 nm in diameter. A small ridge (arrow) marks the junction of the particulate 
face with the surrounding ice. (c) This micrograph shows the surface of an erythro- 
cyte that was frozen and then fractured, but rather than preparing a replica, the cell 
was thawed, fixed, and labeled with a marker for the carbohydrate groups that 
project from the external surface of the integral protein glycophorin (Figure 4.18). 
Thin sections of the labeled, fractured cell reveal that glycophorin molecules (black 
particles) have preferentially segregated with the outer half of the membrane. The 
red line shows the path of the fracture plane. 

Source: (b) From Thomas W. Tillack and Vincent T. Marchesi, J. Cell Biol. 45:649, 
1970, Reproduced with permission of The Rockefeller University Press; (c) From 
Pedro Pinto Da Silva and Maria R. Torrisi, J. Cell Biol. 93:467, 1982, Reproduced 
with permission of The Rockefeller University Press. 


typically have a dynamic relationship with the membrane, being 
recruited to the membrane or released from the membrane depend- 
ing on prevailing conditions. 


Lipid-Anchored Membrane Proteins 


Several types of lipid-anchored membrane proteins can be distin- 
guished. Numerous proteins present on the external face of the 
plasma membrane are bound to the membrane by a small, complex 
oligosaccharide linked to a molecule of phosphatidylinositol that is 
embedded in the outer leaflet of the lipid bilayer (refer to Figure 4.13c). 
Peripheral membrane proteins containing this type of glycosyl- 
phosphatidylinositol linkage are called GPl-anchored proteins. 
They were discovered when it was shown that certain membrane 
proteins could be released by a phospholipase that specifically recog- 
nized and cleaved inositol-containing phospholipids. The normal 
cellular prion protein PrP© (page 63) is a GPI-linked molecule, as are 
various receptors, enzymes, and cell-adhesion proteins. A rare type 
of anemia, paroxysmal nocturnal hemoglobinuria, results from a 
deficiency in GPI synthesis that makes red blood cells susceptible 
to lysis. 

Another group of proteins present on the cytoplasmic side of the 
plasma membrane is anchored to the membrane by one or more 
long hydrocarbon chains embedded in the inner leaflet of the lipid 
bilayer (refer to Figure 4.13c and accompanying legend). At least two 
proteins associated with the plasma membrane in this way (Src and 
Ras) have been implicated in the transformation of a normal cell to 
a malignant state. 
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1. What is meant by the statement that the proteins of a 
membrane are distributed asymmetrically? Is this also 
true of the membrane’s carbohydrate? 


2. Describe the properties of the three classes of 
membrane proteins (integral, peripheral, and lipid- 
anchored), how they differ from one another, and how 
they vary among themselves. 


e—____! 


4.5 Studying the Structure 
and Properties of Integral 
Membrane Proteins 


Because of their hydrophobic transmembrane domains, integral 
membrane proteins are difficult to isolate in a soluble form. Removal 
of these proteins from the membrane normally requires the use of a 
detergent, such as the ionic (charged) detergent SDS (which dena- 
tures proteins) or the nonionic (uncharged) detergent Triton X-100 
(which generally does not alter a protein’s tertiary structure) 


CH3—(CHp2)13—OSO3-Na* 
Sodium dodecyl sulfate (SDS) 


CH, CH; 
CH . CH e CY (OCH —CHs) p00 
3 2 2 2/10 

CH; CH; 


Triton X-100 


Like membrane lipids, detergents are amphipathic, being composed 
of a polar end and a nonpolar hydrocarbon chain (see Figure 2.20). 
As a consequence of their structure, detergents can substitute for 
phospholipids in stabilizing integral proteins while rendering them 
soluble in aqueous solution (FIGURE 4.16). Once the proteins have 
been solubilized by the detergent, various analyses can be carried out 
to determine the protein’s amino acid composition, molecular mass, 
amino acid sequence, and so forth. 
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FIGURE 4.16 Solubilization of membrane proteins with detergents. 
The nonpolar ends of the detergent molecules associate with the nonpolar 
residues of the protein that had previously been in contact with the fatty 
acyl chains of the lipid bilayer. In contrast, the polar ends of the detergent 
molecules interact with the surrounding water molecules, keeping the 
protein in solution. Nonionic detergents, as shown here, solubilize 
membrane proteins without disrupting their structure. 


Researchers have had great difficulty obtaining crystals of most 
integral membrane proteins for use in X-ray crystallography. In fact, 
fewer than 1 percent of the known high-resolution protein struc- 
tures represent integral membrane proteins.” Furthermore, most of 
these structures represent prokaryotic versions of a particular pro- 
tein, which are often smaller than their eukaryotic homologues and 
easier to obtain in large quantity. Once the structure of one member 
of a membrane protein family is determined, researchers can usually 
apply a strategy called homology modeling to learn about the struc- 
ture and activity of other members of the family. For example, solu- 
tion of the structure of the bacterial K* channel KcsA (shown in 
Figure 4.39) provided a wealth of data that could be applied to the 
structure and mechanism of action of the much more complex 
eukaryotic K* channels (Figure 4.42). 

One of the first membrane proteins whose entire three- 
dimensional structure was determined by X-ray crystallography is 
shown in FIGURE 4.17a. This protein—the bacterial photosynthetic 
reaction center—consists of three subunits containing 11 membrane- 
spanning a helices. Most membrane proteins have not been as 
amenable to crystallization as the photosynthetic reaction center. 


Many integral membrane proteins have a substantial portion that is present in the 
cytoplasm or extracellular space. In many cases, this soluble portion has been cleaved 
from its transmembrane domain, crystallized, and its tertiary structure determined. 
While this approach provides valuable data about the protein, it fails to provide infor- 
mation about the proteins orientation within the membrane. 


(a) 


(b) (c) 


FIGURE 4.17 Structure of integral proteins. (a) Tertiary structure of the 
photosynthetic reaction center of a bacterium as determined by X-ray 
crystallography. The protein contains three different membrane-spanning 
polypeptides, shown in yellow, light blue, and dark blue. The helical nature 
of each of the transmembrane segments is evident. (b) High resolution 
electron microscopy images of the TRPV ion channel reconstructed by 
averaging many individual electron micrographs of proteins frozen in ice at 
low temperatures (Cryo-electron microscopy). (c) Structure of the TRPV 
channel determined by electron microscopy. 

Source: (a) From G. Feher, J. P. Allen, M. Y. Okamura, and D. C. Rees, 
Nature 339:113, © 1989; Reprinted by permission from Macmillan 
Publishers Ltd.; (b, c) From Maofu Liao et al., Nature 504:107. 2013. 


Among the obstacles, many membrane proteins (1) are present at 
low numbers per cell, (2) are unstable in the detergent-containing 
solutions required for their extraction, (3) are prone to aggregation, 
and (4) are heavily modified by glycosylation and cannot be 
expressed as recombinant proteins in other types of cells. Some of 
the technical difficulties in preparing membrane protein crystals 
have been overcome using new methodologies and laborious efforts. 
In one study, for example researchers were able to obtain high-quality 
crystals of a bacterial transporter after testing and refining more 
than 95,000 different conditions for crystallization. In some cases, 
mutant versions of a membrane protein are found to be better suited 
to forming the ordered arrays of molecules that make up a crystal 
lattice. In many other cases, crystallization has been achieved by 
covalently linking the membrane protein to other molecules, often 
small soluble proteins. Recent developments in electron microscopy 
technology have allowed membrane protein structure to be deter- 
mined at high resolution by imaging large numbers of identical pro- 
teins at very low temperatures to reduce imaging noise, then averaging 
their images together (Figure 4.17b). Because the microscopy is done 
at very low temperatures it is known as cyro-electron microscopy 
(See the Experimental Walkthrough Video: Solving protein structures using 
cryo-EM for this chapter). Using this approach, it is now possible to see 
individual amino acid side chains and reconstruct a molecular struc- 
ture entirely from electron microscopy information (Figure 4.17¢c). 

Despite increasing success in protein crystallization and rapid 
advances in cryo-electron microscopy, researchers still rely largely 
on indirect approaches for determining the three-dimensional 
organization of most membrane proteins. We will examine some of 
these approaches in the following paragraphs. 


Identifying Transmembrane Domains 


A great deal can be learned about the structure of a membrane protein 
and its orientation within the lipid bilayer from a computer-based 
(computational) analysis of its amino acid sequence, which is readily 
deduced from the nucleotide sequence of an isolated gene. The first 
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question one might ask is: Which segments of the polypeptide chain 127 


are actually embedded in the lipid bilayer? Those segments of a protein 
embedded within the membrane, which are described as the trans- 
membrane domains, have a simple structure; they consist of a 
string of about 20 predominantly nonpolar amino acids that span the 
core of the lipid bilayer as an a helix.’ The chemical structure of a single 
transmembrane helix is shown in FIGURE 4.18, which depicts the two- 
dimensional structure of glycophorin A, the major integral protein of 
the erythrocyte plasma membrane. Of the 20 amino acids that make 
up the lone « helix of a glycophorin monomer (amino acids 73 to 92 of 
Figure 4.18), all but three have hydrophobic side chains (or an H atom 
in the case of the glycine residues). The exceptions are serine and thre- 
onine, which arenoncharged, polar residues (Figure 2.26). FIGURE 4.19a 
shows a portion of a transmembrane helix with a threonine residue, 
not unlike those of glycophorin A. The hydroxyl group of the residue’s 
side chain can form a hydrogen bond with one of the oxygen atoms of 
the peptide backbone. Fully charged residues may also appear in trans- 
membrane helices, but they tend to be near one of the ends of the helix 
and to be accommodated in ways that allow them to fit into their 
hydrophobic environment. This is illustrated in the model transmem- 
brane helices shown in Figures 4.19b and c. Each of the helices in these 
figures contains a pair of charged residues whose side chains are able to 
reach out and interact with the innermost polar regions of the mem- 
brane, even if it requires distorting the helix to do so. Figure 4.19d 
shows two tyrosine residues with their hydrophobic aromatic side 
chains; each aromatic ring is oriented parallel with the hydrocarbon 
chains of the bilayer with which it has become integrated. 

Knowing the amino acid sequence of an integral membrane 
protein, we can usually identify the transmembrane segments using 


3It was noted on page 54 that the « helix is a favored conformation because it allows for a 
maximum number of hydrogen bonds to be formed between atoms of the polypeptide 
backbone, thereby creating a highly stable (low-energy) configuration. This is particu- 
larly important for a membrane-spanning polypeptide that is surrounded by fatty acyl 
chains and, thus, cannot form hydrogen bonds with an aqueous solvent. Transmembrane 
helices are at least 20 amino acids in length, because this is the minimum stretch of 
polypeptide capable of spanning the hydrocarbon core of a lipid bilayer of 30 A width. 
A few integral membrane proteins have been found to contain loops or helices that 
penetrate but do not span the bilayer. An example is the P helix in Figure 4.39. 


FIGURE 4.18 Glycophorin A, an integral protein with a single 
transmembrane domain. The single a helix that passes through the 
membrane consists predominantly of hydrophobic residues (brown- 
colored circles). The four positively charged amino acid residues of 
the cytoplasmic domain of the membrane protein form ionic bonds 
with negatively charged lipid head groups. Carbohydrates are 
attached to a number of amino acid residues on the outer surface of 
the protein (shown in the inset). All but one of the 16 oligosaccha- 
rides are small O-linked chains (the exception is a larger oligosac- 
charide linked to the asparagine residue at position 26). 
Glycophorin molecules are present as homodimers within the 
erythrocyte membrane (Figure 4.32d). The two helices of the dimer 
cross over one another in the region between residues 79 and 83. 
This Gly-X-X-X-Gly sequence is commonly found where transmem- 
brane helices come into close proximity. 
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FIGURE 4.19 Accommodating various amino acid residues within transmembrane helices. (a) In this portrait of a small portion of a transmembrane 
helix, the hydroxyl group of the threonine side chain (arrow) is able to form a (shared) hydrogen bond with a backbone oxygen within the lipid bilayer. 
Hydrogen bonds are indicated by the dashed lines and their distances are shown in angstroms. (b) The side chains of the two lysine residues of this 
transmembrane helix are sufficiently long and flexible to form bonds with the head groups and water molecules of the polar face of the lipid bilayer. (c) The 
side chains of the two aspartic acid residues of this transmembrane helix can also reach the polar face of the bilayer but introduce distortion in the helix to 
do so. (d) The aromatic side chains of the two tyrosine residues of this transmembrane helix are oriented perpendicular to the axis of the membrane and 


parallel to the fatty acyl chains with which they interact. 


Source: (a-d) From Anna C. V. Johansson and Erik Lindahl, Biophys. J. 91:4459, 4453, 2006, © 2006, with permission from Elsevier. 


a hydropathy plot, in which each site along a polypeptide is assigned a 
value that provides a measure of the hydrophobicity of the amino 
acid at that site as well as that of its neighbors. This approach provides 
a “running average” of the hydrophobicity of short sections of the 
polypeptide and guarantees that one or a few polar amino acids ina 
sequence do notalter the profile of the entire stretch. Hydrophobicity 
of amino acids can be determined using various criteria, such as 
their lipid solubility or the energy that would be required to transfer 
them from a nonpolar medium into an aqueous medium. A hydrop- 
athy plot for glycophorin A is shown in FIGURE 4.20. Transmembrane 
segments are usually identified as a jagged peak that extends well 
into the hydrophobic side of the spectrum. A reliable prediction 
concerning the orientation of the transmembrane segment within 
the bilayer can usually be made by examining the flanking amino 
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FIGURE 4.20 Hydropathy plot for glycophorin A, a single membrane- 
spanning protein. Hydrophobicity is measured by the free energy required 
to transfer each segment of the polypeptide from a nonpolar solvent to an 
aqueous medium. Values above the 0 line are energy-requiring (+AGs), 
indicating predominantly nonpolar side chains. Peaks that project above 
the red-colored line indicate continuous stretches of mostly nonpolar amino 
acids, and are interpreted as transmembrane domains. 


acid residues. In most cases, as illustrated by glycophorin in Figure 
4.18, those parts of the polypeptide at the cytoplasmic flank of a 
transmembrane segment tend to be more positively charged than 
those at the extracellular flank. 

Not all integral membrane proteins contain transmembrane o 
helices. A number of membrane proteins contain a relatively large 
channel positioned within a circle of membrane-spanning { strands 
organized into a barrel as illustrated in Figure 5.4. To date, aqueous 
channels constructed of f barrels have only been found in the outer 
membranes of bacteria, mitochondria, and chloroplasts. 


Experimental Approaches to Identifying 
Conformational Changes within an 
Integral Membrane Protein 


Suppose you have isolated a gene for an integral membrane protein 
and, based on its nucleotide sequence, determined that it contains 12 
apparent membrane-spanning « helices. You might want to know 
which sites within these transmembrane domains are accessible to 
the aqueous environment and how this changes as the protein car- 
ries out its function. This type of information is particularly impor- 
tant when working with a membrane channel or transporter that 
functions in the movement of hydrophilic substances across the 
membrane. Although these determinations are difficult to make 
without detailed structural models, considerable insight can be 
gained by site-directed mutagenesis, that is, by introducing specific 
changes into the gene that encodes the protein (Section 18.25). An 
experiment of this type, which was conducted on lactose permease, 
a sugar-transporting protein in bacterial cell membranes, is shown 
in FIGURE 4.21. In this experiment, researchers prepared different 
versions of the permease in which nearly all of the individual amino 
acids of the protein were replaced, one at a time, by a cysteine resi- 
due. Each mutated membrane protein was then incubated with a 
water-soluble reagent (NEM) that is capable of adding an alkyl group 
to the cysteine side chain—as long as it has access to that side chain. 
The incubations with NEM were carried out under two different 
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FIGURE 4.21 An experiment employing site-directed mutagenesis to learn about dynamic changes in the conformation of a membrane protein as 
it carries out its activity. The experimental strategy of the experiment and its results are discussed in the text. The cytoplasmic surface of the protein 
(lactose permease) is at the top. (a) The red spheres indicate the residues of the membrane protein that reacted with the alkylating agent NEM in the 
absence of a sugar to be transported. (b) The gold spheres denote the residues that are much more accessible to the alkylating agent when the protein is 
incubated with the sugar. The gold spheres in b are seen to cluster in a portion of the protein near the external medium (the periplasm in bacteria). The 
authors concluded that the gold spheres correspond to residues that line an outward-facing cavity, i.e., one that is open to the external medium. The results 
support a model of alternating access to the medium as depicted in part c. (Note the structure that is depicted in parts a—b is that of the inward-facing 


conformation as determined by X-ray crystallography.) 


Source: (a, b) From H. Ronald Kaback, et al., PNAS 104:492, 2007, © 2007 National Academy of Sciences, U.S.A.; (c) Reprinted from Current Opinion 
in Structural Biology 14:414, 2004 Fig. 1 by H. R. Kaback, with permission from Elsevier. 


conditions: either in the presence of the sugar to be transported or in 
its absence. The image in Figure 4.21a shows those residues (depicted 
as red spheres) of the protein that become alkylated by NEM in the 
absence of the sugar. Many of these accessible residues are thought to 
line an inward-facing hydrophilic cavity in the permease that is open 
to the cytoplasm. Those cysteine residues that are not labeled, most 
notably on helices VI and XII, are presumed to either face the fatty 
acyl chains of the phospholipids or to be located in tightly packed 
regions of the molecule, thus making them inaccessible to NEM. 
The image in Figure 4.21b shows those residues (depicted as 
gold spheres) that show a significant increase in reactivity to NEM 
when the permease is incubated with the sugar to be transported. 
The difference in NEM reactivity of the residues between the protein 
in the absence of the sugar (a) and the presence of the sugar (b) indi- 
cates that different parts of the protein are accessible to the aqueous 
medium under these two conditions. The results, in other words, 
suggest that addition of the sugar induces a conformational change 
in the permease. Careful analysis of the positions of the residues that 
become labeled in these two conformations supports the model 
shown in Figure 4.21c. According to this model of alternating access, 
the sugar-binding site is open to the cytoplasm of the bacterial cell in 
one conformation and is open to the extracellular space in the other 
conformation. Alternation between the two conformations brings 
about transport of the sugar across the membrane. This type of ion- 
driven transport system is discussed further on page 155. 
Determining spatial relationships between specific amino acids 
in amembrane protein is another approach to learning about dynamic 
events that occur as a protein carries out its function. One way to learn 
about the distance between selected residues in a protein is to intro- 
duce chemical groups whose properties are sensitive to the distance 
that separates them. Nitroxides are chemical groups that contain an 


unpaired electron, which produces a characteristic spectrum when 
monitored by a technique called electron paramagnetic resonance 
(EPR) spectroscopy. A nitroxide group can be introduced at any site in 
a protein by first mutating that site to a cysteine and then attaching the 
nitroxide to the —SH group of the cysteine residue. FIGURE 4.22 
shows how this technique was used to discover the conformational 
changes that occur in a membrane protein as its channel is opened in 
response to changes in the pH of the medium. The protein in ques- 
tion, a bacterial K* channel, is a tetramer composed of four identical 
subunits. The cytoplasmic opening to the channel is bounded by four 
transmembrane helices, one from each subunit of the protein. Figure 
4.22. shows the EPR spectra that were obtained when a nitroxide was 
introduced near the cytoplasmic end of each transmembrane helix. 
The red line shows the spectrum obtained at pH 6.5 when the channel 
is in the closed state, and the blue line shows the spectrum at pH 3.5 
when the channel is open. The shape of each line depends on the 
proximity of the nitroxides to one another. The spectrum is broader at 
pH 6.5 because the nitroxide groups on the four subunits are closer 
together at this pH, which decreases the intensity of their EPR signals. 
These results indicate that the activation of the channel is accompa- 
nied by an increased separation between the labeled residues of the 
four subunits (Figure 4.22b). An increase in diameter of the channel 
opening allows ions in the cytoplasm to reach the actual permeation 
pathway (shown in red) within the channel, which allows only the 
passage of K* ions (discussed on page 144). An alternate technique, 
called FRET, that can also be used to measure changes in the distance 
between labeled groups within a protein, is illustrated in Figure 18.8. 
As discussed in Chapter 2, NMR spectroscopy is another technique 
that provides information about distances between atoms in a protein 
structure, and recently NMR has become increasingly the method of 
choice for studying dynamics of membrane proteins. 
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FIGURE 4.22 Use of EPR spectroscopy to monitor changes in conformation of a bacterial K+ ion channel as it opens and closes. (a) EPR spectra from 
nitroxides that have been attached to cysteine residues near the cytoplasmic end of the four transmembrane helices that line the channel. The cysteine 
residue in each helix replaces a glycine residue that is normally at that position. The shapes of the spectra depend on the distances between unpaired 
electrons in the nitroxides on different subunits. (Nitroxides are described as “spin-labels,” and this technique is known as site-directed spin labeling.) 

(b) A highly schematic model of the ion channel in the open and closed states based on the data from part a. Opening of the channel is accompanied by 
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the movement of the four nitroxide groups apart from one another. 


Source: (a) Reprinted by permission from Macmillan Publishers Ltd: From E. Perozo et al., Nature Struct. Biol. 5:468, 1998. 


REVIEW | 4.6 Membrane Lipids 
1. Why are detergents necessary to solubilize membrane and Membrane Fluidity 


proteins? How might one determine the diversity of - ; er 
integral proteins that reside in a purified membrane The physical state of the lipid of a membrane is described by its fluidity 


fraction? (or viscosity).* Consider a simple artificial bilayer composed of phos- 
phatidylcholine and phosphatidylethanolamine, whose fatty acids are 
largely unsaturated. If the temperature of the bilayer is kept relatively 
warm (e.g., 37°C), the lipid exists in a relatively fluid state (FIGURE 4.23a). 


2. How can one determine: (1) the location of 
transmembrane segments in the amino acid sequence 


or (2) the relative locations of transmembrane helices 
with access to the external medium? “Bluidity and viscosity are inversely related; fluidity is a measure of the ease of flow, 
e—_______ and viscosity is a measure of the resistance to flow. 


(a) 


FIGURE 4.23 The structure of the lipid bilayer depends on the temperature. The bilayer shown here is composed of two phospholipids: phosphatidyl- 
choline and phosphatidylethanolamine. (a) Above the transition temperature, the lipid molecules and their hydrophobic tails are free to move in certain 
directions, even though they retain a considerable degree of order. (b) Below the transition temperature, the movement of the molecules is greatly 
restricted, and the entire bilayer can be described as a crystalline gel. 

Source: (a, b) R. N. Robertson, The Lively Membranes, Cambridge Univ. Press, 1983, reprinted with permission of Cambridge Univ. Press. 


At this temperature, the lipid bilayer is best described as a two- 
dimensional liquid crystal. As in a crystal, the molecules still retain a 
specified orientation; in this case, the long axes of the molecules tend 
toward a parallel arrangement, yet individual phospholipids can rotate 
around their axis or move laterally within the plane of the bilayer. If the 
temperature is slowly lowered, a point is reached where the bilayer dis- 
tinctly changes (Figure 4.23b). The lipid is converted from a liquid 
crystalline phase to a frozen crystalline gel in which the movement of 
the phospholipid fatty acid chains is greatly restricted. The temperature 
at which this change occurs is called the transition temperature. 

The transition temperature of a particular bilayer depends on the 
ability of the lipid molecules to be packed together, which depends in 
turn on the particular lipids of which it is constructed. Saturated fatty 
acids have the shape of a straight, flexible rod. Cis-unsaturated fatty 
acids, on the other hand, have crooks in the chain at the sites of a dou- 
ble bond (Figures 2.19 and 4.23). Consequently, phospholipids with 
saturated chains pack together more tightly than those containing 
unsaturated chains. The greater the degree of unsaturation of the fatty 
acids of the bilayer, the Jower the temperature before the bilayer gels. 
The introduction of one double bond in a molecule of stearic acid low- 
ers the melting temperature almost 60°C (Table 4.2).° Another factor 
that influences bilayer fluidity is fatty acid chain length. The shorter 
the fatty acyl chains of a phospholipid, the lower its melting tempera- 
ture. The physical state of the membrane is also affected by cholesterol. 
Because of their orientation within the bilayer (Figure 4.7), cholesterol 
molecules disrupt the close packing of fatty acyl chains and interfere 
with their mobility. The presence of cholesterol tends to abolish sharp 
transition temperatures and creates a condition of intermediate fluid- 
ity. In physiologic terms, cholesterol tends to increase the durability 
while decreasing the permeability of a membrane. 


The Importance of Membrane Fluidity 


What effect does the physical state of the lipid bilayer have on the 
biological properties of the membrane? Membrane fluidity provides 
a perfect compromise between a rigid, ordered structure in which 
mobility would be absent and a completely fluid, nonviscous liquid 
in which the components of the membrane could not be oriented 
and structural organization and mechanical support would be lack- 
ing. In addition, fluidity allows for interactions to take place within 
the membrane. For example, membrane fluidity makes it possible 
for clusters of membrane proteins to assemble at particular sites 
within the membrane and form specialized structures, such as inter- 
cellular junctions, light-capturing photosynthetic complexes, and 
synapses. Because of membrane fluidity, molecules that interact can 
come together, carry out the necessary reaction, and move apart. 


*The effect of fatty acid saturation on melting temperature is illustrated by familiar 
food products. Vegetable oils remain a liquid in the refrigerator, whereas margarine is 
a solid. Vegetable oils contain polyunsaturated fatty acids, whereas the fatty acids of 
margarine have been saturated by a chemical process that hydrogenates the double 
bonds of the vegetable oils used as the starting material. 


TABLE 4.2 Melting Points of the Common 18-Carbon Fatty Acids 


cis Double bonds 


Fatty acid 


M.p. (°C) 


Stearic acid 0 70 
Oleic acid 1 13 
Linoleic acid =—7 
Linolenic acid -17 
Eicosapentanoic acid (EPA)* —54 


*EPA has 20 carbons. 


Fluidity also plays a key role in membrane assembly, a subject dis- 131 


cussed in Chapter 8. Membranes arise only from preexisting mem- 
branes, and their growth is accomplished by the insertion of lipids and 
proteins into the fluid matrix of the membranous sheet. Many of the 
most basic cellular processes, including cell movement, cell growth, cell 
division, formation of intercellular junctions, secretion, and endocyto- 
sis, depend on the movement of membrane components and would 
probably not be possible if membranes were rigid, nonfluid structures. 


Maintaining Membrane Fluidity 


The internal temperature of most organisms (other than birds and 
mammals) fluctuates with the temperature of the external environ- 
ment. Because it is essential for many activities that the membranes of 
a cell remain in a fluid state, cells respond to changing conditions 
by altering the types of phospholipids of which they are made. 
Maintenance of membrane fluidity is an example of homeostasis at 
the cellular level and can be demonstrated in various ways. For example, 
if the temperature of a culture of cells is lowered, the cells respond met- 
abolically. The initial “emergency” response is mediated by enzymes that 
remodel membranes, making the cell more cold resistant. Remodeling 
is accomplished by (1) desaturating single bonds in fatty acyl chains to 
form double bonds, and (2) reshuffling the chains between different 
phospholipid molecules to produce ones that contain two unsaturated 
fatty acids, which greatly lowers the melting temperature of the bilayer. 
Desaturation of single bonds to form double bonds is catalyzed by 
enzymes called desaturases. Reshuffling is accomplished by phospholi- 
pases, which split the fatty acid from the glycerol backbone, and acyl- 
transferases, which transfer fatty acids between phospholipids. In 
addition, the cell changes the types of phospholipids being synthesized 
in favor of ones containing more unsaturated fatty acids. As a result of 
the activities of these various enzymes, the physical properties of a cell's 
membranes are matched to the prevailing environmental conditions. 
Maintenance of fluid membranes by adjustments in fatty acyl composi- 
tion has been demonstrated in a variety of organisms, including hiber- 
nating mammals, pond-dwelling fish whose body temperature changes 
markedly from day to night, cold-resistant plants, and bacteria living in 
hot springs. Prokaryotic cells that live at very high temperatures have 
plasma membranes that contain highly unusual lipids. 


Lipid Rafts 


Every so often an issue emerges that splits the community of cell biol- 
ogists into believers and nonbelievers. The issue of lipid rafts falls into 
this category. When membrane lipids are extracted from cells and 
used to prepare artificial lipid bilayers, cholesterol and sphingolipids 
tend to self-assemble into microdomains that are more gelated and 
highly ordered than surrounding regions consisting primarily of 
phosphoglycerides. Because of their distinctive physical properties, 
such microdomains tend to float within the more fluid and disor- 
dered environment of the artificial bilayer (FIGURE 4.24a). Asa result, 
these patches of cholesterol and sphingolipid are referred to as lipid 
rafts. When added to these artificial bilayers, certain proteins tend to 
become concentrated in the lipid rafts, whereas others tend to remain 
outside their boundaries. GPI-anchored proteins show a particular 
fondness for the ordered regions of the bilayer (Figure 4.24a). 

The controversy arises over whether similar types of sphingolipid- 
and cholesterol-rich lipid rafts, such as that illustrated in Figure 4.24b, 
exist within living cells. Most of the evidence in favor of lipid rafts 
is derived from studies that employ unnatural treatments, such as 
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FIGURE 4.24 Lipid rafts. (a) Image of the upper surface of an artificial lipid bilayer containing phosphatidylcholine, which appears as the black 
background, and sphingomyelin molecules, which organize themselves spontaneously into the orange-colored rafts. The yellow peaks show the 

positions of a GPl-anchored protein, which is almost exclusively raft-associated. This image is provided by an atomic force microscope, which measures 

the height of various parts of the specimen at the molecular level. (6) Schematic model of a lipid raft within a cell. The outer leaflet of the raft consists 
primarily of cholesterol (yellow) and sphingolipids (red head groups). Phosphatidylcholine molecules (blue head groups) with long saturated fatty acids 

also tend to concentrate in this region. GPl-anchored proteins are thought to become concentrated in lipid rafts. The lipids in the outer leaflet of 

the raft have an organizing effect on the lipids of the inner leaflet. As a result, the inner-leaflet raft lipids consist primarily of cholesterol and 
glycerophospholipids with long saturated fatty acyl tails. The inner leaflet tends to concentrate lipid-anchored proteins, such as Src kinase, that are involved 
in cell signaling. (The controversy over the existence of lipid rafts is discussed in Nature Revs. Mol. Cell Biol. 11:688, 2010 and Science 334:1046, 2011.) 


Source: (a) From D. E. Saslowsky, et al., J. Biol. Chem. 277, cover of #30, July 26, 2002; (b) Courtesy of J. Michael Edwardson © 2002 The American Society 


for Biochemistry and Molecular Biology. 


detergent extraction, cholesterol depletion, or crosslinking of mem- 
brane proteins. The artificial nature of these treatments makes the 
results difficult to interpret. Attempts to demonstrate the presence 
of lipid rafts in living cells have generally been unsuccessful, which 
can either mean that such rafts do not exist or they are so small (5 to 
25 nm diameter) and short-lived as to be difficult to detect with cur- 
rent techniques. The concept of lipid rafts is very appealing because it 
provides a means to introduce order into a seemingly random sea of 
lipid molecules. Lipid rafts are postulated to serve as floating plat- 
forms that concentrate particular proteins, thereby organizing the 
membrane into functional compartments (Figure 4.24b). For exam- 
ple, lipid rafts are thought to provide a favorable local environment 
for cell-surface receptors to interact with other membrane proteins 
that transmit signals from the extracellular space to the cell interior. 


REVIEW | 


1. What is the importance of fatty acid unsaturation for 
membrane fluidity? Of enzymes that are able to 
desaturate fatty acids? 


2. What is meant by a membrane'’s transition 
temperature, and how is it affected by the degree of 
saturation or length of fatty acyl chains? How are these 
properties important in the formation of lipid rafts? 


3. Why is membrane fluidity important to a cell? 


4. How can the two sides of a lipid bilayer have different 
ionic charges? 


eo 


4.7 The Dynamic Nature 
of the Plasma Membrane 


It is apparent from the previous discussion that the lipid bilayer can 
exist in a relatively fluid state. As a result, a phospholipid can move 
laterally within the same leaflet with considerable ease. The mobility 
of individual lipid molecules within the bilayer of the plasma mem- 
brane can be directly observed under the microscope by linking the 
polar heads of the lipids to gold particles or fluorescent compounds 
(see Figure 4.29). It is estimated that a phospholipid can diffuse from 
one end of a bacterium to the other end in a second or two. In con- 
trast, it takes a phospholipid molecule a matter of hours to days to 
move across to the other leaflet. Thus, of all the possible motions 
that a phospholipid can make, its flip-flop to the other side of the 
membrane is the most restricted (FIGURE 4.25). This finding is not 
surprising. For flip-flop to occur, the hydrophilic head group of the 
lipid must pass through the internal hydrophobic sheet of the mem- 
brane, which is thermodynamically unfavorable. However, cells con- 
tain enzymes that actively move certain phospholipids from one 
leaflet to the other. These enzymes, known as flippases, play a role in 
establishing lipid asymmetry. 

Because lipids provide the matrix in which integral proteins of a 
membrane are embedded, the physical state of the lipid is an impor- 
tant determinant of the mobility of integral proteins. The demon- 
stration that integral proteins can move within the plane of the 
membrane was a cornerstone in the formulation of the fluid-mosaic 
model. The dynamic properties of membrane proteins have been 
revealed in several ways. 
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FIGURE 4.25 The possible movements of phospholipids in a 
membrane. The types of movements in which membrane phospholipids 
can engage and the approximate time scales over which they occur. 
Whereas phospholipids move from one leaflet to another at a very slow 
rate, they diffuse laterally within a leaflet rapidly. Lipids lacking polar 
groups, such as cholesterol, can move across the bilayer quite rapidly. 


The Diffusion of Membrane 
Proteins after Cell Fusion 


Cell fusion is a technique whereby two different types of cells, or 
cells from two different species, can be fused to produce one cell with 
a common cytoplasm and a single, continuous plasma membrane. 
Cells are induced to fuse with one another by making the outer sur- 
face of the cells “sticky” so that their plasma membranes adhere to one 
another. Cells can be induced to fuse by addition of certain inacti- 
vated viruses that attach to the surface membrane, by adding the com- 
pound polyethylene glycol, or by a mild electric shock. Cell fusion has 
played an important role in cell biology and is currently used in an 
invaluable technique to prepare specific antibodies (Section 18.26). 
The first experiments to demonstrate that membrane proteins 
could move within the plane of the membrane utilized cell fusion, 
and they were reported in 1970 by Larry Frye and Michael Edidin of 
Johns Hopkins University. In their experiments, mouse and human 
cells were fused, and the locations of specific proteins of the plasma 
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membrane were followed once the two membranes had become 133 


continuous. To follow the distribution of either the mouse mem- 
brane proteins or the human membrane proteins at various times 
after fusion, antibodies against one or the other type of protein were 
prepared and covalently linked to fluorescent dyes. The antibodies 
against the mouse proteins were complexed with a dye that fluo- 
resces green and the antibodies against human proteins with one 
that fluoresces red. When the antibodies were added to fused cells, 
they bound to the human or mouse proteins and could be located 
under a fluorescence light microscope (FIGURE 4.26a). At the time of 
fusion, the plasma membrane appeared half human and half mouse; 
that is, the two protein types remained segregated in their own hem- 
isphere (step 3, Figure 4.26a,b). As the time after fusion increased, 
the membrane proteins were seen to move laterally within the mem- 
brane into the opposite hemisphere. By about 40 minutes, each spe- 
cies’ proteins were uniformly distributed around the entire hybrid 
cell membrane (step 4, Figure 4.26a). If the same experiment was 
performed at lower temperature, the viscosity of the lipid bilayer 
increased, and the mobility of the membrane proteins decreased. 
These early cell fusion experiments gave the impression that integral 
membrane proteins were capable of virtually unrestricted move- 
ments. As we will see shortly, subsequent studies made it apparent 
that membrane dynamics was a much more complex subject than 
first envisioned. 


Restrictions on Protein 
and Lipid Mobility 


Several techniques allow researchers to follow the movements of 
molecules in the membranes of living cells using the light micro- 
scope. In a technique called fluorescence recovery after pho- 
tobleaching (FRAP), which is illustrated in FIGURE 4.27a, integral 
membrane components in cultured cells are first labeled by linkage 
to a fluorescent dye. A particular membrane protein can be labeled 
using a specific probe, such as a fluorescent antibody. Once labeled, 
cells are placed under the microscope and irradiated by a sharply 
focused laser beam that bleaches the fluorescent molecules in its 
path, leaving a circular spot (typically about 1 jum diameter) on the 
surface of the cell that is largely devoid of fluorescence. If the labeled 


(b) 5 um 


FIGURE 4.26 The use of cell fusion to reveal mobility of membrane proteins. (a) Outline of the experiment in which human and mouse cells were fused 
(steps 1-2) and the distribution of the proteins on the surface of each cell were followed in the hybrids over time (steps 3-4). Mouse membrane proteins are 


indicated by solid circles, human membrane proteins by open circles. Locations of human and mouse proteins in the plasma membranes of the hybrid cells 


were monitored by interaction with fluorescent red and fluorescent green antibodies, respectively. (b) Micrograph showing a fused cell in which mouse and 


human proteins are still in their respective hemispheres (equivalent to the hybrid in step 3 of part a). 


Source: (b) From L. D. Frye and Michael Edidin, J. Cell Science 7:328, 334, 1970, By Permission of The Company of Biologists, Ltd. Courtesy of Michael 
Edidin, Johns Hopkins University.http://jcs.biologists.org/content/7/2/319 full.pdf+html?sid=d93ae648-abca-4f5d-90a6- a6d9726d7d30. 
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FIGURE 4.27 Measuring the diffusion rates of membrane proteins by 
fluorescence recovery after photobleaching (FRAP). (a) In this technique, 
a particular component of the membrane is first labeled with a fluorescent 
dye (step 1). A small region of the surface is then irradiated to bleach the 
dye molecules (step 2), and the recovery of fluorescence in the bleached 
region is followed over time (step 3). (b) The rate of fluorescence recovery 
within the illuminated spot can vary depending on the protein(s) being 
followed. The rate of recovery is related to the diffusion coefficient of the 
fluorescently labeled protein. 


proteins in the membrane are mobile, then the random movements 
of these molecules should produce a gradual reappearance of fluo- 
rescence in the irradiated circle. The rate of fluorescence recovery 
(Figure 4.27b) provides a direct measure of the rate of diffusion 


(expressed as a diffusion coefficient, D) of the mobile molecules. 
The extent of fluorescence recovery (expressed as a percentage of the 
original intensity) provides a measure of the percentage of the labeled 
molecules that are free to diffuse. 

Early studies utilizing FRAP suggested that (1) membrane pro- 
teins moved much more slowly in the plasma membrane of living 
cells than they would in a pure lipid bilayer and (2) a significant frac- 
tion of membrane proteins (30 to 70 percent) were not free to diffuse 
back into the irradiated circle. But the FRAP technique has its draw- 
backs. FRAP can only follow the average movement of a relatively 
large number of labeled molecules (hundreds to thousands) as they 
diffuse over a relatively large distance (e.g., 1 um). Asa result, research- 
ers using FRAP cannot distinguish between proteins that are truly 
immobile and ones that can only diffuse over a limited distance in 
the time allowed. To get around these limitations, alternate tech- 
niques have been developed that allow researchers to observe the 
movements of individual protein molecules over very short dis- 
tances and to determine how they might be restrained. 

In single-particle tracking (SPT), individual membrane pro- 
tein molecules are labeled, usually with fluorescent molecular tags 
that emit light under a microscope. The movements of the labeled 
molecules are then followed by a type of microscopy known as 
TIRE (Total Internal Reflection Fluorescence) that is specialized for 
imaging fluorescent molecules at the surface of cells. The results of 
these studies often depend on the particular protein being investigated. 
For example, 


e Some membrane proteins move randomly throughout the 
membrane (Figure 4.28, protein A), although generally at rates 
considerably less than would be measured in an artificial lipid 
bilayer. (If protein mobility were based strictly on physical 
parameters such as lipid viscosity and protein size, one would 
expect proteins to migrate with diffusion coefficients of approxi- 
mately 10-8 to 10-? cm/sec rather than 107'° to 10-!* cm?/sec, as 
is observed for molecules of this group.) The reasons for the 
reduced diffusion coefficient have been debated. 


e Some membrane proteins fail to move and are considered to be 
immobilized (Figure 4.28, protein B). 


e Insome cases, a protein is found to move in a highly directed 
(i.e., nonrandom) manner toward one part of the cell or another 
(Figure 4.28, protein C). For example, a particular membrane 
protein may tend to move toward the leading or the trailing edge 
of a moving cell. 


e In most studies, the largest fraction of protein species exhibit 
random (Brownian) movement within the membrane at rates 
consistent with free diffusion (diffusion coefficients about 
5 x 10°° cm*/sec), but the molecules are unable to migrate freely 
more than a few tenths of a micrometer. Long-range diffusion 
occurs but at slower rates, apparently because of the presence 
of a system of barriers. These barriers are discussed in the 
following sections. 


CONTROL OF MEMBRANE PROTEIN MOBILITY It 
is apparent that plasma membrane proteins are not totally free to drift 
around randomly on the lipid “sea.” Instead, this class of proteins is 
subjected to various influences that affect their mobility and, in doing 
so, promote intramembrane organization. Some membranes are 
crowded with proteins, so that the random movements of one mole- 
cule can be impeded by its neighbors (FIGURE 4.28, protein D). 


FIGURE 4.28 Patterns of movement of integral membrane proteins. 
Depending on the cell type and the conditions, integral membrane proteins 
can exhibit several different types of mobility. Protein A is capable of 
diffusing randomly throughout the membrane, though its rate of movement 
may be limited; protein B is immobilized as the result of its interaction with 
the underlying membrane skeleton; protein C is being moved in a particular 
direction as the result of its interaction with a motor protein at the 
cytoplasmic surface of the membrane; movement of protein D is restricted 
by other integral proteins of the membrane; movement of protein E is 
restricted by fences formed by proteins of the membrane skeleton, but it 
can hop into adjacent compartments through transient openings in a fence; 
movement of protein F is restrained by extracellular materials. 


The strongest influences on an integral membrane protein are thought 
to be exerted from just beneath the membrane on its cytoplasmic face. 
The plasma membranes of many cells possess a fibrillar network, or 
“membrane skeleton,’ consisting of peripheral proteins situated on the 
cytoplasmic surface of the membrane. A certain proportion of a mem- 
brane’s integral protein molecules are either tethered to the membrane 
skeleton (Figure 4.28, protein B) or otherwise restricted by it. 
Information concerning the presence of membrane barriers has 
been obtained using an innovative technique that allows investiga- 
tors to trap integral proteins and drag them through the plasma 
membrane with a known force. This technique, which uses an appa- 
ratus referred to as optical tweezers, takes advantage of the tiny opti- 
cal forces that are generated by a focused laser beam. The integral 
proteins to be studied are tagged with antibody-coated beads, which 
serve as handles that can be gripped by the laser beam. It is generally 
found that optical tweezers can drag an integral protein for a limited 
distance before the protein encounters a barrier that causes it to be 
released from the laser’s grip. As it is released, the protein typically 
springs backward, suggesting that the barriers are elastic structures. 
One approach to studying factors that affect membrane protein 
mobility is to genetically modify cells so that they produce altered 
membrane proteins. Integral proteins whose cytoplasmic portions 
have been genetically deleted often move much greater distances 
than their intact counterparts, indicating that barriers reside on the 
cytoplasmic side of the membrane. These findings suggest that the 
membrane’s underlying skeleton forms a network of “fences” around 
portions of the membrane, creating compartments that restrict the 
distance an integral protein can travel (Figure 4.28, protein E). 
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another through breaks in the fences. Such openings are thought to 
appear and disappear along with the dynamic disassembly and reas- 
sembly of parts of the meshwork. Membrane compartments may | 
keep specific combinations of proteins in close enough proximity to | 
facilitate their interaction. 

Integral proteins lacking that portion that would normally pro- | 
ject into the extracellular space typically move at a much faster rate | 
than the wild-type version of the protein. This finding suggests that | 
the movement of a transmembrane protein through the bilayer is | 
slowed by extracellular materials that can entangle the external por- | 
tion of the protein molecule (Figure 4.28, protein F). 


MEMBRANE LIPID MOBILITY Proteins are huge mole- 
cules, so it isn’t surprising that their movement within the lipid 
bilayer might be restricted. Phospholipids, by comparison, are small 
molecules that make up the very fabric of the lipid bilayer. One 
might expect their movements to be completely unfettered, yet a 
number of studies have suggested that phospholipid diffusion is also | 
restricted. When individual phospholipid molecules of a plasma | 
membrane are tagged and followed under the microscope using | 
ultra-high-speed cameras, they are seen to be confined for very brief 
periods and then hop from one confined area to another. FIGURE 4.29a 
shows the path taken by an individual phospholipid within the 


Finish 


Finish 
(a) (b) 


FIGURE 4.29 Experimental demonstration that diffusion of phospho- 
lipids within the plasma membrane is confined. (a) The track of a single 
labeled unsaturated phospholipid is followed for 56 ms as it diffuses within 
the plasma membrane of a rat fibroblast. Phospholipids diffuse freely within 
a confined compartment before hopping into a neighboring compartment. 
The rate of diffusion within a compartment is as rapid as that expected by 
unhindered Brownian movement. However, the overall rate of diffusion of 
the phospholipid appears slowed because the molecule must hop a barrier 
to continue its movement. The movement of the phospholipid within each 
compartment is represented by a single color. (6) The same experiment 
shown in a is carried out for 33 ms in an artificial bilayer, which lacks the 
“picket fences” present in a cellular membrane. The much more open, 
extended trajectory of the phospholipid can now be explained by simple, 
unconfined Brownian movement. For the sake of comparison, fake 
compartments were assigned in b and indicated by different colors. 
Source: From Takahiro Fujiwara et al., Courtesy of Akihiro Kusumi, Nagoya, 
J. Cell Biol. 157:1073, 2002; Reproduced with permission of The 
Rockefeller University Press. 
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136 plasma membrane over a period of 56 milliseconds. Computer analy- 
sis indicates that the phospholipid diffuses freely within one compart- 
ment (shaded in purple) before it jumps the “fence” into a neighboring 
compartment (shaded in blue) and then over another fence into an 
adjacent compartment (shaded in green), and so forth. Treatment of 
the membrane with agents that disrupt the underlying membrane 
skeleton removes some of the fences that restrict phospholipid diffu- 
sion. But if the membrane skeleton lies beneath the lipid bilayer, how 
can it interfere with phospholipid movement? The authors of these 
studies speculate that the fences are constructed of rows of integral 
membrane proteins whose cytoplasmic domains are attached to 
the membrane skeleton. This is not unlike the confinement of 
horses or cows by a picket fence whose posts are embedded in the 
underlying soil. 


MEMBRANE DOMAINS AND CELL POLARITY For 
the most part, studies of membrane dynamics, such as those discussed 
above, are carried out on the relatively homogeneous plasma mem- 
brane situated at the upper or lower surface of a cell residing on a cul- 
ture dish. Most membranes, however, exhibit distinct variations in 
protein composition and mobility, especially in cells whose various 
surfaces display distinct functions. For example, the epithelial cells that 
line the intestinal wall or make up the microscopic tubules of the kid- 
ney are highly polarized cells whose different surfaces carry out differ- 
ent functions (FIGURE 4.30). Studies indicate that the apical plasma 
membrane, which selectively absorbs substances from the lumen, pos- 
sesses different enzymes than the lateral plasma membrane, which 
interacts with neighboring epithelial cells, or the basal membrane, 
which adheres to an underlying extracellular substrate (a basement 
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FIGURE 4.30 Differentiated functions of the plasma membrane of an 
epithelial cell. The apical surface of this intestinal epithelial cell contains 
integral proteins that function in ion transport and hydrolysis of disaccha- 
rides, such as sucrose and lactose; the lateral surface contains integral 
proteins that function in intercellular interaction; and the basal surface 
contains integral proteins that function in the association of the cell with the 
underlying basement membrane. 


membrane). In other examples, the receptors for neurotransmitter 
substances are concentrated into regions of the plasma membrane 
located within synapses (see Figure 4.57), and receptors for low-density 
lipoproteins are concentrated into patches of the plasma membrane 
specialized to facilitate their internalization (see Figure 8.38). 

Of all the various types of mammalian cells, sperm may have the 
most highly differentiated structure. A mature sperm can be divided 
into head, midpiece, and tail, each having its own specialized func- 
tions. Although divided into a number of distinct parts, a sperm is 
covered by a continuous plasma membrane which, as revealed by 
numerous techniques, consists of a mosaic of different types of local- 
ized domains. For example, when sperm are treated with a variety of 
antibodies, each antibody combines with the surface of the cell in a 
unique topographic pattern that reflects the distribution within the 
plasma membrane of the particular protein antigen recognized by 
that antibody (FIGURE 4.31). 


REVIEW | 


1. Describe two techniques to measure the rates of 
diffusion of a specific membrane protein. 


2. Compare and contrast the types of protein mobility 
depicted in Figure 4.28. 


3. Compare the rate of lateral diffusion of a lipid 
with that of flip-flop. What is the reason for the 


difference? 


eo 


& 


ry 
(a) (b) 
| Anterior head 
Posterior head 
~~ 
| Posterior tail 
(c) 30um = (d) 


FIGURE 4.31 Differentiation of the mammalian sperm plasma 
membrane as revealed by fluorescent antibodies. (a—c) Three pairs of 
micrographs, each showing the distribution of a particular protein at the 
cell surface as revealed by a bound fluorescent antibody. The three proteins 
are localized in different parts of the continuous sperm membrane. Each 
pair of photographs shows the fluoresence pattern of the bound antibody 
and a phase contrast micrograph of the same cell. (d) Diagram summarizing 
the distribution of the proteins. 

Source: (ac) From Diana G. Myles, Paul Primakoff, and Anthony R. Bellvé, 
Cell 23:434,1981 © with permission from Elsevier. 


4.8 The Red Blood Cell: 
An Example of Plasma 
Membrane Structure 


Of all the diverse types of membranes, the plasma membrane of the 
human erythrocyte (red blood cell) is the most studied and best 
understood (FIGURE 4.32a). There are several reasons for the popu- 
larity of this membrane. The cells are inexpensive to obtain and 
readily available in huge numbers from whole blood. They are 
already present as single cells and need not be dissociated from a 
complex tissue. The cells are simple by comparison with other cell 
types, lacking nuclear and cytoplasmic membranes that inevitably 
contaminate plasma membrane preparations from other cells. In 
addition, purified, intact erythrocyte plasma membranes can be 
obtained simply by placing the cells in a dilute (hypotonic) salt solu- 
tion. The cells respond to this osmotic shock by taking up water and 
swelling, a phenomenon termed hemolysis. As the surface area of 
each cell increases, the cell becomes leaky, and the contents, com- 
posed almost totally of dissolved hemoglobin, flow out of the cell 
leaving behind a plasma membrane “ghost” (Figure 4.320). 

Once erythrocyte plasma membranes are isolated, the proteins 
can be solubilized and separated from one another (fractionated), 
providing a better idea of the diversity of proteins within the mem- 
brane. Fractionation of membrane proteins can be accomplished 
using polyacrylamide gel electrophoresis (PAGE) in the presence of 
the ionic detergent sodium dodecyl sulfate (SDS). (The technique of 
SDS-PAGE is discussed in Section 18.13.) The SDS keeps the integral 
proteins soluble and, in addition, adds a large number of negative 
charges to the proteins with which it associates. Because the number 
of charged SDS molecules per unit weight of protein tends to be rela- 
tively constant, the molecules separate from one another according 
to their molecular weight. The largest proteins move most slowly 
through the molecular sieve of the gel. The major proteins of the 
erythrocyte membrane are separated into about a dozen conspicu- 
ous bands by SDS-PAGE (Figure 4.32c). Among the proteins are a 
variety of enzymes (including glyceraldehyde 3-phosphate dehydro- 
genase, one of the enzymes of glycolysis), transport proteins (for 
ions and sugars), and skeletal proteins (e.g., spectrin). 


Integral Proteins of the 
Erythrocyte Membrane 


A model of the erythrocyte plasma membrane showing its major 
proteins is seen in Figure 4.32d. The most abundant integral proteins 
of this membrane are a pair of carbohydrate-containing, membrane- 
spanning proteins called band 3 and glycophorin A. The high density 
of these proteins within the membrane is evident in the freeze-frac- 
ture micrographs of Figure 4.15. Band 3, which gets its name from its 
position in an electrophoretic gel (Figure 4.32c), is present as a dimer 
composed of two identical subunits (a homodimer). Each subunit 
spans the membrane at least a dozen times and contains a relatively 
small amount of carbohydrate (6-8 percent of the molecule’s weight). 
Band 3 protein serves as a channel for the passive exchange of anions 
across the membrane. As blood circulates through the tissues, car- 
bon dioxide becomes dissolved in the fluid of the bloodstream (the 
plasma) and undergoes the following reaction: 


H,0 +CO2z => H2,CO3 => HCO; +Ht 


The bicarbonate ions (HCO3) enter the erythrocyte in exchange for 137 


chloride ions, which leave the cell. In the lungs, where carbon diox- 
ide is released, the reaction is reversed and bicarbonate ions leave the 
erythrocyte in exchange for chloride ions. The reciprocal movement 
of HCO; and Cl occurs through a channel in the center of each 
band 3 dimer. 

Glycophorin A was the first membrane protein to have its amino 
acid sequence determined. The arrangement of the polypeptide 
chain of glycophorin A within the plasma membrane is shown in 
Figure 4.18. (Other related glycophorins, B, C, D, and E, are also 
present in the membrane at much lower concentrations.) Like band 
3, glycophorin A is also present in the membrane as a dimer. Unlike 
band 3, each glycophorin A subunit spans the membrane only once, 
and it contains a bushy carbohydrate cover consisting of 16 oligosac- 
charide chains that together make up about 60 percent of the mole- 
cule’s weight. It is thought that the primary function of glycophorin 
derives from the large number of negative charges borne on sialic 
acid, the sugar residue at the end of each carbohydrate chain. Because 
of these charges, red blood cells repel each other, which prevents the 
cells from clumping as they circulate through the body’s tiny vessels. 
It is noteworthy that persons who lack both glycophorin A and B in 
their red blood cells show no ill effects from their absence. At the 
same time, the band 3 proteins in these individuals are more heavily 
glycosylated, which apparently compensates for the otherwise missing 
negative charges required to prevent cell-cell interaction. Glycophorin 
also happens to be the receptor utilized by the protozoan that causes 
malaria, providing a path for entry into the blood cell. Consequently, 
individuals whose erythrocytes lack glycophorin A and B are thought 
to be protected from acquiring malaria. 


The Erythrocyte Membrane Skeleton 


The erythrocyte plasma membrane is supported by a fibrillar skeleton 
composed of peripheral membrane proteins (Figure 4.32d,e) that 
plays a major role in determining the biconcave shape of the erythro- 
cyte. As discussed on page 135, the membrane skeleton can establish 
domains within the membrane that enclose particular groups of 
membrane proteins and may greatly restrict the movement of these 
proteins. The major component of the skeleton is an elongated fibrous 
protein, called spectrin. Spectrin is a heterodimer approximately 100 nm 
long, consisting of an a and b subunit that curl around one another. 
Two such dimeric molecules are linked at their head ends to form a 
200-nm-long filament that is both flexible and elastic. Spectrin is 
attached to the internal surface of the membrane by means of nonco- 
valent bonds to another peripheral protein, ankyrin (the green spheres 
of Figure 4.32), which in turn is linked noncovalently to the cytoplas- 
mic domain of a band 3 molecule. As is evident in Figures 4.32d and 
e, spectrin filaments are organized into hexagonal or pentagonal 
arrays. This two-dimensional network is constructed by linking both 
ends of each spectrin filament to a cluster of proteins that include a 
short filament of actin and tropomyosin, proteins typically involved in 
contractile activities (see Chapter 9). A number of genetic diseases 
(hemolytic anemias) characterized by fragile, abnormally shaped 
erythrocytes have been traced to mutations in ankyrin or spectrin. 

If the peripheral proteins are removed from erythrocyte ghosts, 
the membrane becomes fragmented into small vesicles, indicating 
that the inner protein network is required to maintain the integrity 
of the membrane. Erythrocytes are circulating cells that are squeezed. 
under pressure through microscopic capillaries whose diameter is 
considerably less than that of the erythrocytes themselves. To 
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FIGURE 4.32 The plasma membrane of the human erythrocyte. (a) Scanning electron micrograph 
of human erythrocytes. (b) Micrograph showing plasma membrane ghosts, which were isolated by 
allowing erythrocytes to swell and hemolyze as described in the text. (c) The results of SDS— 
polyacrylamide gel electrophoresis (SDS-PAGE) used to fractionate the proteins of the erythrocyte 
membrane, which are identified at the sides of the gel. (d) A model of the erythrocyte plasma 
membrane as viewed from the internal surface, showing the integral proteins embedded in the lipid 
bilayer and the arrangement of peripheral proteins that make up the membrane’s internal skeleton. 
The band 3 dimer shown here is simplified. The band 4.1 protein stabilizes actin-spectrin complexes. 
(e) Electron micrograph showing the arrangement of the proteins of the inner membrane skeleton. 
Source: (a) Courtesy Francois M. M. Morel, Richard F. Baker, and Harold Wayland, Reproduced with 
permission of The Rockefeller University Press; (b) Courtesy of Joseph F. Hoffman; (c) From V. T. 
Marchesi, H. Furthmayr, and M. Tomita, Annu. Rev. Biochem. vol. 45; © 1976 by Annual Reviews Inc.; 
(d) After D. Voet and J. G. Voet, Biochemistry, 2d ed.; Copyright © 1995, John Wiley & Sons, Inc.; 
(e) From Shih-Chun Liu, Laura H. Derick, and Jiri Palek, J. Cell Biol. 104:527, 1987, Reproduced with 
permission of The Rockefeller University Press. 


traverse these narrow passageways, and to do so day after day, the 
red blood cell must be highly deformable, durable, and capable of 
withstanding shearing forces that tend to pull it apart. The spectrin- 
actin network gives the cell the strength, elasticity, and pliability nec- 
essary to carry out its demanding function. 

When first discovered, the membrane skeleton of the erythrocyte 
was thought to be a unique structure suited to the unique shape and 
mechanical needs of this cell type. However, as other cells were exam- 
ined, similar types of membrane skeletons containing members of the 
spectrin and ankyrin families have been revealed, indicating that 
inner membrane skeletons are widespread. Dystrophin, for example, 
is a member of the spectrin family that is found in the membrane 
skeleton of muscle cells. Mutations in dystrophin are responsible for 
causing muscular dystrophy, a devastating disease that cripples and 
kills children. The most debilitating mutations are ones that lead to a 
complete absence of the protein in the cell. The plasma membranes of 
muscle cells lacking dystrophin are apparently destroyed as a conse- 
quence of the mechanical stress exerted on them as the muscle con- 
tracts. As a result, the muscle cells die and eventually are no longer 
replaced. 


REVIEW | 


1. Discuss two major functions of the integral and 


peripheral proteins of the erythrocyte membrane. 
—— 


4.9 Solute Movement 
across Cell Membranes 


Because the contents of a cell are completely surrounded by its 
plasma membrane, all communication between the cell and the 
extracellular medium must be mediated by this structure. The plasma 
membrane is a barrier that retains the dissolved materials of the cell 
so that they do not simply leak out into the environment, yet it must 
allow the necessary exchange of materials into and out of the cell. 
The lipid bilayer of the membrane is ideally suited to prevent the loss 
of charged and polar solutes from a cell. Consequently, some special 
provision must be made to allow the movement of nutrients, ions, 
waste products, and other compounds in and out of the cell. There 
are basically two means for the movement of substances through a 
membrane: passively by diffusion or actively by an energy-coupled 
transport process. Both types of movements lead to the net flux of a 
particular ion or compound. The term net flux indicates that the 
movement of the substance into the cell (influx) and out of the cell 
(efflux) is not balanced, but that one exceeds the other. 

Four different processes are known by which substances move 
across membranes: simple diffusion through the lipid bilayer; simple 
diffusion through an aqueous, protein-lined channel; diffusion that is 
facilitated by a protein transporter; and active transport, which requires 
an energy-driven protein “pump” capable of moving substances against 
a concentration gradient (FIGURE 4.33). We will consider each in turn, 
but first we will discuss the energetics of solute movement. 


The Energetics of Solute Movement 


Diffusion is a spontaneous process in which a substance moves 
from a region of high concentration to a region of low concentration, 
eventually eliminating the concentration difference between the two 
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FIGURE 4.33 Four basic mechanisms by which solute molecules move 
across membranes.The relative sizes of the letters indicate the directions 
of the concentration gradients. (a) Simple diffusion through the bilayer, 
which always proceeds from high to low concentration. (b) Simple diffusion 
through an aqueous channel formed within an integral membrane protein 
or a cluster of such proteins. As in a, movement is always down a concen- 
tration gradient. (c) Facilitated diffusion in which solute molecules bind 
specifially to a membrane protein carrier (a facilitative transporter). As in 

a and b, movement is always from high to low concentration. (d) Active 
transport by means of a protein transporter with a specific binding site that 
undergoes change in affiniy driven with energy released by an exergonic 
process, such as ATP hydrolysis. Movement occurs against a concentration 
gradient. (e) Examples of each type of mechanism as it occurs in the 


membrane of an erythrocyte. 


regions. As discussed in Chapter 3, diffusion depends on the ran- 
dom thermal motion of solutes and is an exergonic process driven by 
an increase in entropy. Individual diffusing molecules do not “know” 
which way to move—an individual molecule will be equally likely to 
move into a region of higher or lower concentration. But because 
there are more molecules in a given volume of a high concentration 
region than in a low concentration region, on average more molecules 
will move from higher concentration regions to lower concentration 
regions than vice versa. We will restrict the following discussion to 
diffusion of substances across membranes. 

To understand the energetics of diffusion, we recall that AG 
depends on how far away a state is from equilibrium. In diffusion, 
equilibrium occurs when the concentration is equal on both sides of 
the membrane. The free-energy change when an uncharged solute (a 
nonelectrolyte) diffuses across a membrane thus depends on how 
different the concentration is on the two sides of the membrane. This 
difference is known as the concentration gradient. The following 
relationship describes the movement of a nonelectrolyte into the cell: 


AG =RT In [ci] 
[C 


oO 


[Ci] 
AG =2.303 RT | — 
810 [Co] 


where AG is the free-energy change (Section 3.2), Ris the gas constant, 
T is the absolute temperature, and [C;]/[C,] is the ratio of the 
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of the membrane. At 25°C, 


[Ci] 


AG =1.4 kcal/mol «1 
O1* 10810 [Co] 


Clearly, if the concentrations inside and outside are equal, AG = 0, 
thus the system is at equilibrium. If the ratio of [C;]/[C,] is less than 
1.0, then the log of the ratio is negative, AG is negative, and the net 
influx of solute is thermodynamically favored (exergonic). If, for 
example, the external concentration of solute is 10 times the internal 
concentration, AG = -1.4 kcal/mole. Thus, the maintenance of a 
tenfold concentration gradient represents a storage of 1.4 kcal/mol. 
As solute moves into the cell, the concentration gradient decreases, 
the stored energy is dissipated, and AG decreases until, at equilib- 
rium, AG is zero. (To calculate AG for movement of a solute out of 
the cell, the term for concentration ratios becomes [C,]/[C,]). 


Formation of an Electrochemical Gradient 


If the solute is an electrolyte (a charged species), the overall charge 
difference between the two compartments must also be considered. 
As a result of the mutual repulsion of ions of like charges, it is ther- 
modynamically unfavorable for an electrolyte to move across a 
membrane from one compartment into another compartment hav- 
ing a net charge of the same sign. Conversely, if the charge of the 
electrolyte is opposite in sign to the compartment into which it is 
moving, the process is thermodynamically favored. The greater the 
difference in charge (the potential difference or voltage) between the 
two compartments, the greater the difference in free energy. Thus, 
the tendency of an electrolyte to diffuse between two compartments 
depends on two gradients: a chemical gradient, determined by the 
concentration difference of the substance between the two compart- 
ments, and the electric potential gradient, determined by the differ- 
ence in charge. Together these differences are combined to form an 
electrochemical gradient. The free-energy change for the diffu- 
sion of an electrolyte into the cell is 


AG =RT In +2F AE, 


oO 


where z is the charge of the solute, F is the Faraday constant (23.06 
kcal/V - equivalent, where an equivalent is the amount of the electro- 
lyte having one mole of charge), and AE,, is the potential difference 
(in volts) between the two compartments. We saw in the previous 
example that a tenfold difference in concentration of a nonelectrolyte 
across a membrane at 25°C generates a AG of —1.4 kcal/mol. Suppose 
the concentration gradient consisted of Na* ions, which were present 
at tenfold higher concentration outside the cell than in the cytoplasm. 
Because the voltage across the membrane of a cell is typically about 
-70 mV (page 159), the free-energy change for the movement of a 
mole of Na* ions into the cell under these conditions would be 


AG =-1.4 kcal/mol +zFAE,, 
AG =-1.4 kcal/mol + (1)(23.06 kcal/V -mol)(— 0.07V) 
=-3.1 kcal/mol 


Thus under these conditions, the concentration difference and the 
electric potential make similar contributions to the storage of free 
energy across the membrane. 

The interplay between concentration and potential differences 
is seen in the diffusion of potassium ions (K*) out of a cell. The efflux 
of the ion is favored by the K* concentration gradient, which has a 
higher K* concentration inside the cell, but hindered by the electri- 
cal gradient that its diffusion creates, which leaves a higher negative 
charge inside the cell. We will discuss this subject further when we 
consider the topic of membrane potentials and nerve impulses in 
Section 4.16. 


REVIEW PO 


1. Compare and contrast the four basically different 
ways that a substance can move across the plasma 
membrane (as indicated in Figure 4.33). 

2. Contrast the energetic difference between the 
diffusion of an electrolyte versus a nonelectrolyte 
across the membrane. 


eo 


4.10 Diffusion through 
the Lipid Bilayer 


The plasma membrane is a selectively permeable barrier that allows 
solute passage by several mechanisms, including simple diffusion 
through the lipid bilayer. Diffusion is an energy-independent pro- 
cess in which a solute moves down an electrochemical gradient, dis- 
sipating the free energy stored in the gradient. Small inorganic 
solutes, such as 03, CO,, and H,O, penetrate the lipid bilayer readily, 
as do solutes with high lipid solubility. Ions and polar organic sol- 
utes, such as sugars and amino acids, require special transporters to 
enter or leave the cell. 


Diffusion of Substances 
through Membranes 


Two qualifications must be met before a nonelectrolyte can diffuse 
passively across a plasma membrane. The substance must be present 
at higher concentration on one side of the membrane than the other, 
and the membrane must be permeable to the substance. A membrane 
may be permeable to a given solute either (1) because that solute 
can pass directly through the lipid bilayer, or (2) because that solute can 
traverse an aqueous pore that spans the membrane. Let us begin by 
considering the former route in which a substance must dissolve in 
the lipid bilayer on its way through the membrane. 

Discussion of simple diffusion leads us to consider the polarity 
of a solute. One simple measure of the polarity (or nonpolarity) of a 
substance is its partition coefficient, which is the ratio of its solu- 
bility in a nonpolar solvent, such as octanol or a vegetable oil, to that 
in water under conditions where the nonpolar solvent and water are 
mixed together. FIGURE 4.34 shows the relationship between parti- 
tion coefficient and membrane permeability of a variety of chemicals 
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FIGURE 4.34 The relationship between partition coefficient and 
membrane permeability. In this case, measurements were made of the 
penetration of a variety of chemicals and drugs across the plasma mem- 
branes of the cells that line the capillaries of the brain. Substances 
penetrate by passage through the lipid bilayer of these cells. The partition 
coefficient is expressed as the ratio of solubility of a solute in octanol to its 
solubility in water. Permeability is expressed as penetrance (P) in cm/sec. 
For all but a few compounds, such as vinblastine and vincristine, 
penetrance is directly proportional to lipid solubility. 


Source: From N. J. Abbott and |. A. Romero, Molec. Med. Today, 2:110, 
1996; Copyright 1996, with permission from Elsevier Science. 


and drugs. It is evident that the greater the lipid solubility, the faster 
the penetration. 

Another factor determining the rate of penetration of a 
compound through a membrane is its size. If two molecules have 


Net water loss 
Cell shrinks 


Net water gain 
Cell swells 


(a) Hypotonic solution (b) Hypertonic solution 


(c) lsotonic solution 


approximately equivalent partition coefficients, the smaller mole- 141 


cule tends to penetrate the lipid bilayer of a membrane more rapidly 
than the larger one. Very small, uncharged molecules penetrate very 
rapidly through cellular membranes. Consequently, membranes are 
highly permeable to small inorganic molecules, such as Oj, COs, 
NO, and H,0, which are thought to slip between adjacent phospho- 
lipids. In contrast, larger polar molecules, such as sugars, amino 
acids, and phosphorylated intermediates, exhibit poor membrane 
penetrability. As a result, the lipid bilayer of the plasma membrane 
provides an effective barrier that keeps these essential metabolites 
from diffusing out of the cell. Some of these molecules (e.g., sugars 
and amino acids) must enter cells from the bloodstream, but they 
cannot do so by simple diffusion. Instead, special mechanisms must 
be available to mediate their penetration through the plasma mem- 
brane. The use of such mechanisms allows a cell to regulate the 
movement of substances across its surface barrier. We will return to 
this feature of membranes later. 


The Diffusion of Water 
through Membranes 


Water molecules move much more rapidly through a cell membrane 
than do dissolved ions or small polar organic solutes, which are 
essentially nonpenetrating. Because of this difference in the penetra- 
bility of water versus solutes, membranes are said to be semiper- 
meable. Water moves readily through a semipermeable membrane 
from a region of lower solute concentration to a region of higher 
solute concentration. This process is called osmosis, and it is readily 
demonstrated by placing a cell into a solution containing a nonpen- 
etrating solute at a concentration different than that present within 
the cell itself. 

When two compartments of different solute concentration are 
separated by a semipermeable membrane, the compartment of higher 
solute concentration is said to be hypertonic (or hyperosmotic) 
relative to the compartment of lower solute concentration, which is 
described as being hypotonic (or hypoosmotic). When a cell is 
placed into a hypotonic solution, the cell rapidly gains water by 


No net loss or gain 


FIGURE 4.35 The effects of differences in the concentra- 
tion of solutes on opposite sides of the plasma membrane. 
(a) A cell placed in a hypotonic solution (one having a lower 
solute concentration than the cell) swells because of a net gain 
of water by osmosis. (b) A cell in a hypertonic solution shrinks 
because of a net loss of water by osmosis. (c) A cell placed in 
an isotonic solution maintains a constant volume because the 


inward flux of water is equal to the outward flux. 
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hypertonic solution rapidly loses water by osmosis and shrinks 
(Figure 4.35b). These simple observations show that a cell’s volume 
is controlled by the difference between the solute concentration 
inside the cell and that in the extracellular medium. The swelling 
and shrinking of cells in slightly hypotonic and hypertonic media are 
usually only temporary events. Within a few minutes, the cells 
recover and return to their original volume. In a hypotonic medium, 
recovery occurs as the cells lose ions, thereby reducing their internal 
osmotic pressure. In a hypertonic medium, recovery occurs as the 
cells gain ions from the medium. Once the internal solute concentra- 
tion (which includes a high concentration of dissolved proteins) 
equals the external solute concentration, the internal and external 
fluids are isotonic (or isosmotic), and no net movement of water 
into or out of the cells occurs (Figure 4.35c). 

Osmosis is an important factor in a multitude of bodily func- 
tions. Your digestive tract, for example, secretes several liters of fluid 
daily, which is reabsorbed osmotically by the cells that line your 
intestine. If this fluid weren't reabsorbed, as happens in cases of 
extreme diarrhea, you would face the prospect of rapid dehydration. 
Indeed, the bacterial disease cholera kills over 100,000 people every 
year, mostly because of water loss into the intestines, driven by 
osmosis. Plants utilize osmosis in different ways. Unlike animal cells, 
which are generally isotonic with the medium in which they are 
bathed, plant cells are generally hypertonic compared to their fluid 
environment. As a result, there is a tendency for water to enter the 
cell, causing it to develop an internal (turgor) pressure that pushes 
against its surrounding wall (FIGURE 4.36a). Turgor pressure pro- 
vides support for nonwoody plants and for the nonwoody parts of 
trees, such as the leaves. If a plant cell is placed into a hypertonic 
medium, its volume shrinks as the plasma membrane pulls away 
from the surrounding cell wall, a process called plasmolysis 
(Figure 4.36b). The loss of water due to plasmolysis causes plants to 
lose their support and wilt. 

Not all cells are equally permeable to water. In fact, many cells 
are much more permeable to water than can be explained by simple 
diffusion through the lipid bilayer. A family of small integral proteins, 
called aquaporins, allow the passive movement of water from one 
side of the plasma membrane to the other. Each aquaporin subunit 
(in the four-subunit protein) contains a central channel that is lined 
primarily by hydrophobic amino acid residues and is highly specific 
for water molecules. A billion or so water molecules can pass—in 
single file—through each channel every second. At the same time, H* 
ions, which normally hop along a chain of water molecules, are not 
able to penetrate these open pores. The apparent mechanism by which 
these channels are able to exclude protons has been suggested by 
a combination of X-ray crystallographic studies, which has revealed 
the structure of the protein, and molecular dynamics simulations 
(page 58), which has put this protein structure into operation. 
A model based on such simulations is shown in FIGURE 4.37a. Very 
near its narrowest point, the wall of an aquaporin channel contains a 
pair of precisely positioned positive charges (residues N203 and N68 
in Figure 4.37b) that attract the oxygen atom of each water molecule 
as it speeds through the constriction in the protein. This interaction 
reorients the central water molecule in a position that prevents it 
from maintaining the hydrogen bonds that normally link it to its 
neighboring water molecules. This removes the bridge that would 
normally allow protons to move from one water molecule to the next. 

Aquaporins are particularly prominent in cells, such as those of 
a kidney tubule or plant root, where the passage of water plays a 


Hypotonic: 
Normal turgor 
pressure 


ae ‘ Hypertonic: 
No turgor 
pressure 


(b) 


FIGURE 4.36 The effects of osmosis on a plant cell. (a) Aquatic plants 
living in freshwater are surrounded by a hypotonic environment. Water 
therefore tends to flow into the cells, creating turgor pressure. (b) If the 
plant is placed in a hypertonic solution, such as seawater, the cell loses 
water, and the plasma membrane pulls away from the cell wall. 


Source: Ed Reschke. 


crucial role in the tissue’s physiologic activities. The hormone vaso- 
pressin, which stimulates water retention by the collecting ducts of 
the kidney, acts by way of one of these proteins (AQP2). Some cases 
of the inherited disorder congenital nephrogenic diabetes insipidus 
arise from mutations in this aquaporin channel. Persons suffering 
from this disease excrete large quantities of urine because their kid- 
neys do not respond to vasopressin. 


REVIEW | 


1. Describe the relationship between partition coefficient 
and molecular size with regard to membrane 
permeability. 

2. Explain the effects of putting a cell into a hypotonic, 
hypertonic, or isotonic medium. 


ee 


FIGURE 4.37 Passage of water molecules 
through an aquaporin channel. (a) Snapshot 
from a molecular dynamics simulation of a 
stream of water molecules (red and white 
spheres) passing in single file through the 
channel in one of the subunits of an aquaporin 
molecule residing within a membrane. (b) A 
model describing the mechanism by which 
water molecules pass through an aquaporin 
channel with the simultaneous exclusion of 
protons. Nine water molecules are shown to be 
lined up in single file along the wall of the 
channel. Each water molecule is depicted as a 
red circular O atom with two associated Hs. In 
this model, the four water molecules at the top 
and bottom of the channel are oriented, as the 
result of their interaction with the carbonyl 
(C—O) groups of the protein backbone 

(page 49), with their H atoms pointed away from 
the center of the channel. These water mol- 
ecules are able to form hydrogen bonds (dashed 
lines) with their neighbors. In contrast, the single 


; ; (a) —=0O---H-@ 
water molecule in the center of the channel is i 
oriented in a position that prevents it from forming hydrogen bonds with other water molecules, which has the effect of oS 
interrupting the flow of protons through the channel. Animations of aquaporin channels can be found at www.nobelprize +, 


.org/nobel_prizes/chemistry/laureates/2003/animations.htm| 


(b) 


Source: (a) From Benoit Roux and Klaus Schulten, Structure 12:1344, 2004, with permission from Elsevier; (b) From 
R. M. Stroud et al., Curr. Opin. Struct. Biol. 13:428, © 2003, with permission from Elsevier. 


4.11 The Diffusion of lons 
through Membranes 


The lipid bilayer that constitutes the core of biological membranes is 
highly impermeable to charged substances, including small ions 
such as Na*, K*, Ca**, and CI’. Yet the rapid movement (conduct- 
ance) of these ions across membranes plays a critical role in a mul- 
titude of cellular activities, including formation and propagation ofa 
nerve impulse, secretion of substances into the extracellular space, 
muscle contraction, regulation of cell volume, and the opening of 
stomatal pores on plant leaves. 

In 1955, Alan Hodgkin and Richard Keynes of Cambridge 
University first proposed that cell membranes contain ion channels, 


FIGURE 4.38 Measuring ion channel conductance by 
patch-clamp recording. (a) In this technique, a highly 
polished glass micropipette is placed against a portion of 
the outer surface of a cell, and suction is applied to seal the 
rim of the pipette against the plasma membrane. Because 
the pipette is wired as an electrode (a microelectrode), a 
voltage can be applied across the patch of membrane 
enclosed by the pipette, and the responding flow of ions 
through the membrane channels can be measured. As 
indicated in the figure, the micropipette can enclose a 
patch of membrane containing a single ion channel, which 
allows investigators to monitor the opening and closing of a 
single gated channel, as well as its conductance at different 
applied voltages. (b) The micrograph shows patch-clamp 
recordings being made from a single photoreceptor cell of 
the retina of a salamander. One portion of the cell is drawn (a) 
into a glass micropipette by suction, while a second 


that is, openings in the membrane that are permeable to specific ions. 
Direct proof of the existence of ion channels emerged through the 
work of Bert Sakmann and Erwin Neher at the Max-Planck Institute 
in Germany in the late 1970s and early 1980s who developed tech- 
niques to monitor the ionic current passing through a single ion chan- 
nel. This is accomplished using very fine micropipette-electrodes 
made of polished glass that are placed on the outer cell surface and 
sealed to the membrane by suction. The voltage across the membrane 
can be maintained (clamped) at any particular value, and the current 
originating in the small patch of membrane surrounded by the pipette 
can be measured (FIGURE 4.38). These landmark studies marked the 
first successful investigations into the activities of individual protein 
molecules. Today, biologists have identified a bewildering variety of 


(b) 35 um 


micropipette-electrode (lower right) is sealed against a small patch of the plasma membrane on another portion of the cell. 
Source: (b) From T. D. Lamb, H. R. Matthews, and V. Torre, J. Physiology 372:319, 1986. © 1986, reproduced with permission of John Wiley & Sons. 
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a central aqueous pore. As might be predicted, mutations in the genes 
encoding ion channels can lead to many serious diseases (see Table 1 
of the Human Perspective, page 158). 

Most ion channels are highly selective in allowing only one par- 
ticular type of ion to pass through the pore. As with the passive dif- 
fusion of other types of solutes across membranes, the diffusion of 
ions through a channel is always downhill, that is, from a state of 
higher energy to a state of lower energy. Most of the ion channels that 
have been identified can exist in either an open or a closed confor- 
mation; such channels are said to be gated. The opening and clos- 
ing of the gates are subject to complex physiologic regulation and can 
be induced by a variety of factors depending on the particular chan- 
nel. Three major categories of gated channels are distinguished: 


1. Voltage-gated channels whose conformational state depends 
on the difference in ionic charge on the two sides of the membrane. 


2. Ligand-gated channels whose conformational state depends 
on the binding of a specific molecule (the ligand), which is usu- 
ally not the solute that passes through the channel. Some ligand- 
gated channels are opened (or closed) following the binding of a 
molecule to the outer surface of the channel; others are opened 
(or closed) following the binding of a ligand to the inner surface 
of the channel. For example, neurotransmitters, such as acetylcho- 
line, act on the outer surface of certain cation channels, whereas? 
cyclic nucleotides, such as cAMP, act on the inner surface of cer- 
tain calcium ion channels. 


3. Mechano-gated channels whose conformational state depends 
on mechanical forces (e.g., stretch tension) that are applied to the 
membrane. Members of one family of cation channels, for exam- 
ple, are opened by the movements of stereocilia (see Figure 9.46) 
on the hair cells of the inner ear in response to sound or motions 
of the head. 


We will focus in the following discussion on the structure and func- 
tion of voltage-gated potassium ion channels because these are the 
best understood. 

In 1998, Roderick MacKinnon and his colleagues at Rockefeller 
University provided the first atomic-resolution image of an ion 
channel protein, in this case, a bacterial K* ion channel called KcsA. 
The relationship between structure and function is evident every- 
where in the biological world, but it would be difficult to find a bet- 
ter example than that of the K* ion channel depicted in FIGURE 4.39. 
As we will see shortly, the formulation of this structure led directly 
to an understanding of the mechanism by which these remarkable 
molecular machines are able to select overwhelmingly for K* ions 
over Na’ ions, yet at the same time allow an incredibly rapid con- 
ductance of K* ions through the membrane. We will also see that the 
mechanisms of ion selectivity and conductance in this bacterial 
channel are virtually identical to those operating in the much larger 
mammalian channels. Evidently, the basic challenges in operating an 
ion channel were solved relatively early in evolution, although many 
refinements appeared over the following one or two billion years. 

The KcsA channel consists of four subunits, two of which are 
shown in Figure 4.39. Each subunit of Figure 4.39 is seen to contain 
two membrane-spanning helices (M1 and M2) and a pore region (P) 
at the extracellular end of the channel. P consists of a short pore helix 
that extends approximately one-third the width of the channel and a 
nonhelical loop (colored light brown in Figure 4.39) that forms the 
lining of a narrow selectivity filter, so named because of its role in 
allowing only the passage of K* ions. 


The lining of the selectivity filter contains a highly conserved 
pentapeptide—Gly-Tyr-Gly-Val-Thr (or GYGVT in single-letter 
nomenclature). The X-ray crystal structure of the KcsA channel shows 
that the backbone carbonyl (C==O) groups from the conserved pen- 
tapeptide (see the backbone structure on page 49) create five succes- 
sive rings of oxygen atoms (four rings are made up of carbonyl oxygens 
from the polypeptide backbone, and one ring consists of oxygen 
atoms from the threonine side chain). Each ring contains four oxygen 
atoms (one from each subunit) and has a diameter of approximately 
3 A, which is slightly larger than the 2.7 A diameter of a K* ion that 
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FIGURE 4.39 Three-dimensional structure of the bacterial KcsA 
channel and the selection of K* ions. This K* ion channel consists of 

four subunits, two of which are shown here. Each subunit is comprised 

of M1 and M2 helices joined by a P (pore) segment consisting of a short 
helix and a nonhelical portion that lines the channel through which the 

ions pass. A portion of each P segment contains a conserved pentapeptide 
(GYGVT) whose residues line the selectivity filter that screens for K* ions. 
The oxygen atoms of the carbonyl groups of these residues project into 
the channel where they can interact selectively with K* ions (indicated by 
the red mesh objects) within the filter. As indicated in the top inset, the 
selectivity filter contains four rings of carbonyl O atoms and one ring of 
threonyl O atoms; each of these five rings contains four O atoms, one 
donated by each subunit. The diameter of the rings is just large enough 

so that eight O atoms can coordinate a single K* ion, replacing its 

normal water of hydration. Although four K* binding sites are shown, 

only two are occupied at one time. 

Source: From Roderick MacKinnon, Nature Med. 5:1108, © 1999, reprinted 
by permission from Macmillan Publishers Ltd. 


has lost its normal shell of hydration. Consequently, the electronega- 
tive O atoms that line the selectivity filter can substitute for the shell of 
water molecules that are displaced as each K" ion enters the pore. In 
this model, the selectivity filter contains four potential K* ion binding 
sites. As indicated in the top inset of Figure 4.39, a K* ion bound at 
any of these four sites would occupy the center of a “box” having four 
O atoms in a plane above the ion and four O atoms in a plane below 
the atom. As a result, each K* ion in one of these sites could coordi- 
nate with eight O atoms of the selectivity filter. Whereas the selectivity 
filter is a precise fit for a dehydrated K* ion, it is much larger than the 
diameter of a dehydrated Na* ion (1.9 A). Consequently, a Na* ion 
cannot interact optimally with the eight oxygen atoms necessary to 
stabilize it in the pore. As a result, the smaller Na* ions cannot over- 
come the higher energy barrier required to penetrate the pore. 

Although there are four potential K* ion binding sites, only two are 
occupied at any given time. Potassium ions are thought to move, two at 
a time—from sites 1 and 3 to sites 2 and 4—as indicated in the top inset 
of Figure 4.39. The entry of a third K* ion into the selectivity filter cre- 
ates an electrostatic repulsion that ejects the ion bound at the opposite 
end of the line. Studies indicate that there is virtually no energy barrier 
for an ion to move from one binding site to the next, which accounts for 
the extremely rapid flow of ions across the membrane. Taken together, 
these conclusions concerning K* ion selectivity and conductance pro- 
vide a superb example of how much can be learned about biological 
function through an understanding of molecular structure. 

The KcsA channel depicted in Figure 4.39 has a gate, just like 
eukaryotic channels. The opening of the gate of the KcsA channel in 
response to very low pH was illustrated in Figure 4.22. The structure 
of KcsA shown in Figure 4.39 is actually the closed conformation of 
the protein (despite the fact that it contains ions in its channel). 
Comparison of the open and closed structures of KcsA strongly sug- 
gests that gating of these molecules is accomplished by conforma- 
tional changes of the cytoplasmic ends of the inner (M2) helices. In 
the closed conformation, as seen in Figure 4.39 and FIGURE 4.40 in 
blue, the M2 helices are straight and cross over one another to form 
a “helix bundle” that seals the cytoplasmic face of the pore. In the 
model shown in Figure 4.40, the channel opens when the M2 helices 
bend at a specific hinge point where a glycine residue is located. 

Now that we have seen how these prokaryotic K* channels oper- 
ate, we are in a better position to understand the structure and function 
of the more complex eukaryotic versions, which are thought to per- 
form in a similar manner. Genes that encode a variety of distinct volt- 
age-gated K* (or Kv) channels have been isolated and the molecular 
anatomy of their proteins scrutinized. The Kv channels of plants play 
an important role in salt and water balance and in regulation of cell 
volume. The Kv channels of animals are best known for their role in 
muscle and nerve function, which is explored at the end of the chapter. 
Eukaryotic Kv channel subunits contain six membrane-associated heli- 
ces, named $1-S6, which are shown two-dimensionally in Figure 4.41. 
These six helices can be grouped into two functionally distinct domains: 


1. A pore domain, which has the same basic architecture as that 
of the entire bacterial channel illustrated in Figure 4.39 and con- 
tains the selectivity filter that permits the selective passage of K* 
ions. Helices M1 and M2 and the P segment of the KcsA channel 
of Figure 4.39 are homologous to helices $5 and S6 and the 
P segment of the voltage-gated eukaryotic channel illustrated in 
FIGURE 4.41. Like the four M2 helices of KcsA, the four S6 helices 
line much of the pore, and their configuration determines whether 
the gate to the channel is open or closed. 


FIGURE 4.40 Schematic illustration of the hinge-bending model for 
the opening of the KcsA channel. The M2 helices from each subunit bend 
outward at a specific glycine residue, which opens the gate at the 
intracellular end of the channel to K* ions. 


Source: Serdar Uysal, et al. 2011. Mechanism of activation gating in the 
full-ength KcsA K* channel. Proc. Natl. Acad. Sci. U.S.A. 108, 11896-9. 


2. A voltage-sensing domain consisting of helices S1-S4 that 
senses the voltage across the plasma membrane (as discussed 
below). 


The three-dimensional crystal structure of a complete eukaryotic 
Kv channel purified from rat brain is shown in FIGURE 4.42a. 
Determination of this structure was made possible by use of a mixture 
of detergent and lipid throughout the purification and crystallization 
process. The presence of negatively charged phospholipids is thought 
to be important in maintaining the native structure of the membrane 
protein and promoting its function as a voltage-gated channel. Like 
the KcsA channel, a single eukaryotic Kv channel consists of four 
homologous subunits arranged symmetrically around the central ion- 
conducting pore (Figure 4.425). The selectivity filter, and thus the pre- 
sumed mechanism of K* ion selection, is virtually identical in the 
prokaryotic KcsA and eukaryotic Kv proteins. The gate leading into a 
Kv channel is formed by the inner ends of the S6 helices and is thought 
to open and close in a manner roughly similar to that of the M2 helices 
of the bacterial channel (shown in Figure 4.40). The protein depicted 
in Figure 4.42 represents the open state of the channel.® 

The S4 helix, which contains several positively charged amino 
acid residues spaced along the polypeptide chain (inset of Figure 4.41), 
acts as the key element of the voltage sensor. The voltage-sensing 
domain is seen to be connected to the pore domain by a short linker 
helix denoted as S4—-S5 in the model in Figure 4.42b. Under resting 
conditions, the negative potential across the membrane (page 159) 
keeps the gate closed. A change in the potential to a more positive 
value (a depolarization, page 160) exerts an electric force on the S4 
helix. This force is thought to cause the transmembrane S4 helix to 


°Other studies strongly suggest that K* channels are gated by two distinct mecha- 
nisms: one involving the opening and closing of the inner helix bundle as described 
here and another involving the selectivity filter, which is not discussed (see PNAS 
107:7623, 2010 and Nature 466:203, 2010). 
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FIGURE 4.41 The structure of one subunit of a 
eukaryotic, voltage-gated K* channel. A 
two-dimensional portrait of a K* channel subunit 
showing its six transmembrane helices and a 
portion of the polypeptide (called the pore helix 
or P) that dips into the protein to form part of the pagina hal 
channel's wall. The inset shows the sequence of ; 
amino acids of the positively charged S4 helix of 
the Drosophila K* Shaker ion channel, which 
serves as a voltage sensor. The positively charged - . - 
side chains are situated at every third residue Inside cell 
along the otherwise hydrophobic helix. This 
member of the Kv family is called a Shaker 
channel because flies with certain mutations in the 
protein shake vigorously when anesthesized with 
ether. The Shaker channel was the first K* channel 


to be identified and cloned in 1987. 
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FIGURE 4.42 Three-dimensional structure of a voltage-gated mammalian K* channel. (a) The crystal structure of the entire tetrameric Kv1.2 channel, a 
member of the Shaker family of K* ion channels found in nerve cells of the brain. The transmembrane portion is shown in red, and the cytoplasmic portion in 
blue. The potassium ion binding sites are indicated in green. (b) Ribbon drawing of the same channel shown in a, with the four subunits that make up the 
channel shown in different colors. If you focus on the red subunit, you can see (1) the spatial separation between the voltage-sensing and pore domains of 
the subunit and (2) the manner in which the voltage-sensing domain from each subunit is present on the outer edge of the pore domain of a neighboring 
subunit. The cytoplasmic portion of this particular channel consists of a T1 domain, which is part of the channel polypeptide itself, and a separate 

6B polypeptide. (c) Ribbon drawing of a single subunit showing the spatial orientation of the six membrane-spanning helices (S1-Sé) and also the presence of 
the S4—S5 linker helix, which connects the voltage-sensing and pore domains. This linker transmits the signal from the $4 voltage sensor that opens the 
channel. The inner surface of the channel below the pore domain is lined by the S6 helix (roughly similar to the M2 helix of the bacterial channel shown in 
Figure 4.39). The channel shown here is present in the open configuration with the S6 helices curved outward (compare to Figure 4.40) at the site marked 
PVP (standing for Pro-Val-Pro, which is likely the amino acid sequence of the “hinge”). 


Source: (a—c) From Stephen B. Long et al., Science 309:867, 899, 2005, courtesy of Roderick Mackinnon; © 2005, reprinted with permission from AAAS. 


move in such a way that its positively charged residues shift from a 
position where they were exposed to the cytoplasm to a new position 
where they are exposed to the outside of the cell. Voltage sensing is a 
dynamic process whose mechanism cannot be resolved by a single 
static view of the protein such as that shown in Figure 4.42. In fact, 
several competing models describing the mechanism of action of the 
voltage sensor are currently debated. However it occurs, the move- 
ment of the S4 helix in response to membrane depolarization initiates 


a series of conformational changes within the protein that opens the 
gate at the cytoplasmic end of the channel. 

Once opened, more than 10 million potassium ions can pass 
through the channel per second, which is nearly the rate that would 
occur by free diffusion in solution. Because of the large ion flux, the 
opening of a relatively small number of K* channels has significant 
impact on the electrical properties of the membrane. After the channel is 
open for a few milliseconds, the movement of K* ions is “automatically” 
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stopped by a process known as inactivation. To understand channel 
inactivation, we have to consider an additional portion of a Kv channel 
besides the two transmembrane domains discussed above. 

Eukaryotic Kv channels typically contain a large cytoplasmic 
structure whose composition varies among different channels. As 
indicated in FIGURE 4.43a, inactivation of the channel is accomplished 
by movement of a small inactivation peptide that dangles from the 
cytoplasmic portion of the protein. The inactivation peptide is 
thought to gain access to the cytoplasmic mouth of the pore by snak- 
ing its way through one of four “side windows” indicated in the fig- 
ure. When one of these dangling peptides moves up into the mouth 
of the pore (Figure 4.43a), the passage of ions is blocked, and the 
channel is inactivated. At a subsequent stage of the cycle, the inacti- 
vation peptide is released, and the gate to the channel is closed. It 
follows from this discussion that the potassium channel can exist in 
three different states—open, inactivated, and closed—which are 
illustrated schematically in Figure 4.43b. 

Potassium channels come in many different varieties. It is 
remarkable that C. elegans, a nematode worm whose body consists of 
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FIGURE 4.43 Conformational states of a voltage-gated K* ion 
channel. (a) Three-dimensional model of a eukaryotic K* ion channel. 
Inactivation of channel activity occurs as one of the inactivation peptides, 
which dangle from the cytoplasmic portion of the complex, fits into the 
cytoplasmic opening of the channel. (6) Schematic representation of a view 
into a K* ion channel, perpendicular to the membrane from the cytoplasmic 
side, showing the channel in the closed (resting), open, and inactivated state. 
Source: (b) Reprinted from Neuron, vol. 20, C. M. Armstrong and B. Hille, 
Voltage-Gated lon Channels and Electrical Escitability, p. 377; copyright 
1998, with permission from Elsevier Science. 


only about 1000 cells, contains approximately 80 different genes that 
encode K* channels. It is evident that a single cell—whether in a 
nematode, human, or plant—is likely to possess a variety of different 
K* channels that open and close in response to different voltages. In 
addition, the voltage required to open or close a particular K* channel 
can vary depending on whether the channel protein is phosphoryl- 
ated, which in turn is regulated by hormones and other factors. It is 
apparent that ion channel function is under the control of a diverse 
and complex set of regulatory agents. The structure and function of a 
very different type of ion channel, the ligand-gated nicotinic acetyl- 
choline receptor, is the subject of the Experimental Pathways section. 


REVIEW | 


1. What are three different types of gated channels? 

2. Describe how a potassium channel can allow 
potassium ions to flow but prevent sodium ions 
from flowing. 
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4.12 EXPERIMENTAL PATHWAYS 


The Acetylcholine Receptor 


The discovery of the acetylcholine receptor illustrates the impor 
tant roles that natural toxins and unusual organisms can play in 
dissecting a fundamental biochemical process with central impor- 
tance to health and medicine. The story begins in 1843, when 
the pharmacist and aspiring playwright Claude Bernard moved 
from a small French town to Paris, where he planned to pursue his 
literary career. Instead, Bernard enrolled in medical school and 
went on to become the foremost physiologist of the nineteenth 
century. Among his many interests was the mechanism by which 
nerves stimulate the contraction of skeletal muscles. His studies 


included the use of curare, a highly toxic drug isolated from 
tropical plants and utilized for centuries by native South American 
hunters to make poisonous darts. Bernard found that curare par- 
alyzed a skeletal muscle without interfering with either the ability 
of nerves to carry impulses to that muscle or the ability of the 
muscle to contract on direct stimulation. Bernard concluded that 
curare somehow acted on the region of contact between the 

nerve and muscle. 
This conclusion was confirmed and extended by John 
Langley, a physiologist at Cambridge University. Langley was 
continued 
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studying the ability of nicotine, another substance derived from 
plants, to stimulate the contraction of isolated frog skeletal 
muscles and the effect of curare in inhibiting nicotine action. 
In 1906, Langley concluded that “the nervous impulse should 
not pass from nerve to muscle by an electric discharge, but by 
the secretion of a special substance on the end of the nerve.”' 
Langley proposed that this “chemical transmitter” was binding 
to a “receptive substance” on the surface of the muscle cells, the 
same site that bound nicotine and curare. These proved to be 
farsighted proposals. 

Langley's suggestion that the stimulus from nerve to muscle 
was transmitted by a chemical substance was confirmed in 1921 in 
an ingenious experiment by the Austrian-born physiologist, Otto 
Loewi, the design of which came to Loewi during a dream. The 
heart rate of a vertebrate is regulated by input from two opposing 
(antagonistic) nerves. Loewi isolated a frog's heart with both 
nerves intact. When he stimulated the inhibitory (vagus) nerve, 
a chemical was released from the heart preparation into a salt 
solution, which was allowed to drain into the medium bathing a 
second isolated heart. The rate of the second heart slowed dra- 
matically, as though its own inhibitory nerve had been activated.” 
Loewi called the substance responsible for inhibiting the frog's 
heart “Vagusstoff.” Within a few years, Loewi had shown that the 
chemical and physiologic properties of Vagusstoff were identical 
to acetylcholine, and he concluded that acetylcholine (ACh) was 
the substance released by the tips of the nerve cells that made up 
the vagus nerve. 

In 1937, David Nachmansohn, a neurophysiologist at the 
Sorbonne, was visiting the World's Fair in Paris where he observed 
several living electric fish of the species Torpedo marmarota that 
were on display. These rays have electric organs that deliver 
strong shocks (40-60 volts) capable of killing potential prey. At 
the time, Nachmansohn was studying the enzyme acetylcholin- 
esterase, which acts to destroy ACh after its release from the 
tips of motor nerves. Nachmansohn was aware that the electric 
organs of these fish were derived from modified skeletal muscle 
tissue (FIGURE 1), and he asked if he could have a couple of the 
fish for study once the fair had ended. The results of the first test 
showed the electric organ was an extraordinarily rich source of 
acetylcholinesterase.° It was also a very rich source of the nicotinic 
acetylcholine receptor (nAChR),* the receptor present on the 
postsynaptic membranes of skeletal muscle cells that binds ACh 
molecules released from the tips of a motor nerve. Finding an 
ideal system can prove invaluable in the study of a particular 
aspect of cell structure or function. As will be evident from this 
discussion, the electric organs of fish have been virtually the sole 
source of material for the study of the nAChR. 

The nAChR is an integral membrane protein, and it wasn’t 
until the 1970s that techniques were developed for the isolation 
of such proteins. As discussed in Chapter 18, the purification of 
a particular protein requires a suitable assay to determine the 
amount of that protein present in any particular fraction. The 
ideal assay for nAChR was a compound that bound selectively 


*The receptor is described as nicotinic because it can be activated by nicotine as 
well as by acetylcholine. This contrasts with muscarinic acetylcholine receptors 
of the parasympathetic nerve synapses, which can be activated by muscarine, but 
not nicotine, and are inhibited by atropine, but not curare. Smokers’ bodies 
become accustomed to high levels of nicotine, and they experience symptoms of 
withdrawal when they stop smoking because the postsynaptic neurons that pos- 
sess nAChRs are no longer stimulated at their usual level. The drug Chantix, 
which is marketed as an aid to stop smoking, acts by binding to the most com- 
mon version of brain nAChR (one with «482 subunits). Once bound, the Chantix 
molecule partially stimulates the receptor while preventing binding of nicotine. 


Electric ray 
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FIGURE 11 The electric organs of Torpedo consist of stacks of 
modified neuromuscular junctions located on each side of the body. 


Source: From Z. W. Hall, An Introduction to Neurobiology, Sinauer 
Associates, Inc., Sunderland, MA © 1992. 


and tightly to this particular protein. Such a compound was dis- 
covered in 1963 by Chen-Yuan Lee and his colleagues of the 
National Taiwan University. The compound was «-bungarotoxin, a 
substance present in the venom of a Taiwanese snake. The a-bun- 
garotoxin causes paralysis by binding tightly to the nAChRs on 
the postsynaptic membrane of skeletal muscle cells, blocking the 
response of the muscle to ACh.4 

Equipped with labeled a-bungarotoxin to use in an assay, 
electric organs as a source, and a detergent capable of solubilizing 
membrane proteins, a number of investigators were able to iso- 
late the acetylcholine receptors in the early 1970s. In one of these 
studies,° the membranes containing the nAChR were isolated by 
homogenizing the electric organs in a blender and centrifuging 
the suspension to pellet the membrane fragments. Membrane 
proteins were extracted from the membrane fragments using 
Triton X-100 (page 126), and the mixture was passed through a 
column containing tiny beads coated with a synthetic compound 
whose end bears a structural resemblance to ACh (FIGURE 2a). 
As the mixture of dissolved proteins passed through the column, 
two proteins that have binding sites for acetylcholine, nAChR and 
acetylcholinesterase (AChE), stuck to the beads. The remaining 
90 percent of the protein in the extract failed to bind to the beads 
and simply passed through the column and was collected (Figure 
2b). Once this group of proteins had passed through, a solution of 
10° M flaxedil was passed through the column, which selectively 
removed the nAChR from the beads, leaving the AChE behind. 
Using this procedure, the acetylcholine receptor as measured 
by bungarotoxin binding was purified by more than 150-fold in 
a single step. This type of procedure is known as affinity chroma- 
tography, and its general use is discussed in Section 18.11. 

The next step was to determine something about the struc- 
ture of the acetylcholine receptor. Studies in the laboratory of 
Arthur Karlin at Columbia University determined that the nAChR 
was a pentamer, a protein consisting of five subunits. Each receptor 
contained two copies of a subunit called «, and one copy each 
of three other subunits. The subunits could be distinguished by 
extracting membrane proteins in Triton X-100, purifying the nAChR 
by affinity chromatography, and then subjecting the purified pro- 
tein to electrophoresis through a polyacrylamide gel (SDS-PAGE, 
as discussed in Section 18.13), which separates the individual 
polypeptides according to size (FIGURE 3).° The four different sub- 
units proved to be homologous to one another, each subunit con- 
taining four homologous transmembrane helices (M1—M4). 


FIGURE 2 Steps used in the isolation of the 
nAChR. (a) Structure of a synthetic compound, 
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form an affinity column. The ends of the compound 
projecting from the beads resemble acetylcholine, SEPHAROSE 
causing both acetylcholinesterase (AChE) and the 2B 
nicotinic acetylcholine receptor (nAChR) to bind to iis 

the beads. (b) When the Triton X-100 extract was 
passed through the column, both of the 
acetylcholine-binding proteins stuck to the beads, 
while the remaining dissolved protein (about 90 
percent of the total protein in the extract) passed 
directly through the column. Subsequent passage 
of a solution of 10-3 M flaxedil through the column 
released the bound nAChR, without disturbing the 
bound AChE (which was subsequently eluted with 
1M NaCl). 1.0 
Source: (a) From R. W. Olsen, J.-C. Meunier, and 

J.-P. Changeux, FEBS Lett. 28:99, 1972. © 1972, 1.0 
with permission of Elsevier. 
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FIGURE 3 The top portion of the figure shows an SDS-polyacrylamide 
gel following electrophoresis of a preparation of the purified nAChR. 
The receptor consists of four different subunits whose molecular weights 
are indicated. Prior to electrophoresis, the purified receptor preparation 
was incubated with a radioactive compound (3H-MBTA) that resembles 
acetylcholine and binds to the acetylcholine-binding site of the nAChR. 
Following electrophoresis, the gel was sliced into 1-mm sections and the 
radioactivity of each slice determined. All of the radioactivity was bound 
to the 39,000 dalton subunit, indicating this subunit contains the ACh- 
binding site. The dotted line indicates the light absorbance of each fraction, 
which provides a measure of the total amount of protein present in that 
fraction. The heights of the peaks provide a measure of the relative amounts 
of each of the subunits in the protein. All of the subunits are present in 
equal numbers except the smallest subunit (the « subunit, which contains 
the ACh-binding site), which is present in twice the number of copies. 
Source: From C. L. Weill, M. G. McNamee, and A. Karlin, Biochem. 
Biophys. Res. Commun. 61:1002, 1974, Reproduced with permission of 
Elsevier. 
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Another important milestone in the study of the nAChR was 
the demonstration that the purified receptor acted as both a site for 
binding ACh and a channel for the passage of cations. It had been 
postulated years earlier by Jean-Pierre Changeux of the Pasteur 
Institute in Paris that the binding of ACh to the receptor caused 
a conformational change that opened an ion channel within the 
protein. The inward flux of Na‘ ions through the channel could 
then lead to a depolarization of the membrane and the activation 
of the muscle cell. During the last half of the 1970s, Changeux and 
his colleagues succeeded in incorporating purified nAChR mole- 
cules into artificial lipid vesicles.” Using vesicles containing various 
concentrations of labeled sodium and potassium ions, they dem- 
onstrated that binding of ACh to the receptors in the lipid bilayer 
initiated a flux of cations through the “membrane.” It was evi- 
dent that “the pure protein does indeed contain all the structural 
elements needed for the chemical transmission of an electrical 
signal—namely, an acetylcholine binding site, an ion channel and 
a mechanism for coupling their activity.” A personal narrative of 
the early work on nAChR can be found in Reference 8. 

Since that time, researchers have focused on determining 
the structure of the nAChR and the mechanism by which binding 
of acetylcholine induces the opening of the ion gate. Analysis of 
the structure has taken several different paths. In one approach, 
scientists have used purified genes, determination of amino 
acid sequences, and site-directed mutagenesis to determine 
the specific parts of the polypeptides that span the membrane, 
or bind the neurotransmitter, or form the ion channel. These 
noncrystallographic studies on the molecular anatomy of a pro- 
tein are similar in principle to those described in Section 4.5. 

Another approach employed the electron microscope. The 
first glimpses of the nAChR were seen in electron micrographs of 
the membranes of the electric organs (FIGURE 4).? The receptors 


continued 


149 


skemujed |Pe}USWHAdXy e Z|'p 


150 


aueiqua| eluse|q4 a4} JO UO!}DUNY Puke sINjoON4IS PUL e ~YALdVHD 


appeared to be ring shaped, with a diameter of 8 nm and a 
central channel of 2-nm diameter, and protruded out from the 
lipid bilayer into the external space. An increasingly detailed 
model of the nAChR has been developed by Nigel Unwin and 
his colleagues of the Medical Research Council of England.'°"* 
These researchers employed the technique of electron crystal- 
lography (Section 18.16), which involves a mathematical analysis 
of large numbers of electron micrographs of frozen membranes 
containing the proteins being studied. Electric organs are well 
suited for electron crystallography because they contain highly 
ordered (crystalline) arrays of the nAChR within the plane of the 
lipid bilayer. As a result, Unwin was able to analyze the structure of 
the nAChR as it exists in its native lipid environment. He described 
the arrangement of the five subunits around a central channel 
(FIGURE 5). The ion channel consists of a narrow pore lined by a 
wall composed of five inner (M2) « helices, one from each of the 
surrounding subunits. The gate to the pore lies near the middle of 
the membrane, where each of the M2 a helices bends inward (at 
the apex of the V-shaped bars in Figure 5a) to form a kink in the 
unactivated receptor. The side chain of a leucine residue projects 
inward from each kink. In this model, the leucine residues from 
the five inner helices form a tight hydrophobic ring that prevents the 
ions from crossing the membrane. The gate opens following the 
binding of two ACh molecules, one per a subunit. Each ACh 
binds to a site located within a pocket of an a subunit (Figure 5b). 

To study the changes in the nAChR during channel open- 
ing, Unwin carried out the following experiment.'* Preparations 
of nAChR-rich membranes were applied to a support grid that 
was allowed to fall into a bath of liquid nitrogen-cooled ethane, 
which freezes the membranes. Approximately 5 msec before they 
reached the surface of the freezing bath, the grids were sprayed 
with a solution of ACh, which bound to the receptors and trig- 
gered the conformational change required to open the channel. 
By comparing electron micrographs of nAChRs trapped in the 


Cell 
‘membrane 


(a) (b) 


0.1 um 


FIGURE 4 Electron micrograph of negatively stained, receptor-rich 
membranes from the electric organ of an electric fish showing the 
dense array of nAChR molecules. Each receptor molecule is seen as 

a small whitish circle with a tiny black dot in its center; the dot corresponds 
to the central channel, which has collected electron-dense stain. 

Source: From Werner Schiebler and Ferdinand Hucho, Eur. J. Biochem. 
85:58, 1978, reprinted with permission John Wiley & Sons. 


open versus closed state, Unwin found that ACh binding trig- 
gers a small conformational change in the extracellular domains 
of the receptor subunits near the two ACh binding sites. This 
conformational change is then propagated down the protein, 
causing the M2 helices that line the ion-conduction pore to 
straighten out (FIGURE 6), widening the channel and exposing 
more hydrophilic residues, thus allowing Na* ions to enter the 
cell.'54 Other models for channel opening have been proposed 


lon channel 


FIGURE 5 (a) An electron density map of a slice through the nAChR obtained by analyzing electron micrographs of tubular crystals of Torpedo 
membranes embedded in ice. These analyses have allowed researchers to reconstruct the three-dimensional appearance of a single nAChR protein as 
it resides within the membrane. The continuous contours indicate lines of similar density greater than that of water. The two dark, bar-shaped lines 
represent the « helices that line the channel at its narrowest point. (b) Schematic diagram of the nAChR showing the arrangement of the subunits and a 
cross-sectional representation of the protein. Each of the five subunits contains four membrane-spanning helices (M1—M4). Of these, only the inner 


helix (M2) lines the pore, and is the subject of the remainder of the discussion. 


Source: From N. Unwin, J. Mol. Biol. 229:1118, 1993; © 1993, with permission of Elsevier. 
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FIGURE 6 Ribbon drawings illustrating the changes that occur within 
the nAChR conducting channel upon binding of acetylcholine. Only the 
two alpha subunits of the receptor are shown. In the closed state (left) 

the pore is blocked (yellow patch) by the close apposition of a ring of 
hydrophobic residues. The diameter of the pore at its narrowest point is 
about 6 A, which is not sufficient for a hydrated Na’ ion to pass. This 
constriction arises because the inner transmembrane helices are slightly 
bent. Although it is wide enough for passage of a dehydrated Na* ion (blue 
sphere), the wall of the channel lacks the polar groups that would be 
required to substitute for the displaced shell of water molecules (as occurs in 
the selectivity filter of the K* channel, Figure 4.39). Following binding of 
ligand, a conformational change in the ligand-binding domain of the alpha 
subunits (not shown), causes the inner transmembrane helices of the subunits 
(blue) to straighten out, expanding the diameter of the pore, which allows 
the flow of Na* ions through the open channel (right). 


Source: From N. Unwin, Quart Rev Biophys 46:283, 2013. 


by other laboratories, and a more definitive statement about 
the mechanism of gating will probably require a high-resolution 
X-ray crystal structure of the protein in both the open and closed 
states. Discussion of various models and the bases for them can 
be found in References 15-17. 
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4.13 Facilitated Diffusion 


Substances always diffuse across a membrane from a region of 
higher concentration on one side to a region of lower concentration 
on the other side, but they do not always diffuse through the lipid 
bilayer or through a channel. In many cases, the diffusing substance 
first binds selectively to a membrane-spanning protein, called a 
facilitative transporter, that facilitates the diffusion process. The 
binding of the solute to the facilitative transporter on one side of the 
membrane is thought to trigger a conformational change in the pro- 
tein, exposing the solute to the other surface of the membrane, from 
where it can diffuse down its concentration gradient. This mecha- 
nism is illustrated in FIGURE 4.44. Because they operate passively, 
that is, without being coupled to an energy-releasing system, facili- 
tated transporters can mediate the movement of solutes equally well 
in both directions. The direction of net flux depends on the relative 
concentration of the substance on the two sides of the membrane. 
Facilitated diffusion, as this process is called, is similar in 
many ways to an enzyme-catalyzed reaction. Like enzymes, facilita- 
tive transporters are specific for the molecules they transport, 


discriminating, for example, between D and L stereoisomers (page 43). 
In addition, both enzymes and transporters exhibit saturation-type 
kinetics (FIGURE 4.45). Unlike ion channels, which can conduct mil- 
lions of ions per second, most facilitative transporters can move only 
hundreds to thousands of solute molecules per second across the 
membrane, so once the concentration of ions becomes very large, the 
transport rate levels off to this maximum value, and the transporters 
are said to be saturated. Another important feature of facilitative 
transporters is that, like enzymes and ion channels, their activity can 
be regulated. Facilitated diffusion is particularly important in 
mediating the entry and exit of polar solutes, such as sugars and 
amino acids, that do not penetrate the lipid bilayer. This is illustrated 
in the following section. 

The glucose transporter is an example of facilitated diffusion. 
Glucose is the body’s primary source of direct energy, and most 
mammalian cells contain a membrane protein that facilitates the dif- 
fusion of glucose from the bloodstream into the cell (as depicted in 
Figures 4.44 and 4.49). A gradient favoring the continued diffusion 
of glucose into the cell is maintained by phosphorylating the sugar 
after it enters the cytoplasm, thus lowering the intracellular glucose 
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152 Glucose specifically GLUT4. When insulin levels are low, these cells contain 


— relatively few glucose transporters on their plasma membrane. 
Instead, the transporters are present within the membranes of cyto- 
Binding plasmic vesicles. Rising insulin levels act on target cells to stimulate 


the fusion of the cytoplasmic vesicles to the plasma membrane, which 
moves transporters to the cell surface where they can bring glucose 
into the cell (see Figure 15.26). 
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4.14 Active Transport 


Life cannot exist under equilibrium conditions (section 3.4). Now- 

Peer: here is this more apparent than in the imbalance of ions across the 

Dissociation ‘ ; ; , 
plasma membrane. The differences in concentration of the major 


— ions between the outside and inside of a typical mammalian cell are 


shown in Table 4.3. The ability of a cell to generate such steep con- 
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FIGURE 4.44 Facilitated diffusion. A schematic model for the facili- centration gradients across its plasma membrane cannot occur by 
tated diffusion of glucose depicts the alternating conformation of a either simple or facilitated diffusion. Rather, these gradients must be 
carrier that exposes the glucose binding site to either the inside or generated by active transport. 

outside of the membrane. Like facilitated diffusion, active transport depends on integral 
Source: Reprinted from S. A. Baldwin and G. E. Lienhard, Trends Biochem. membrane proteins that selectively bind a particular solute and 
Sci. 6:210, 1981. Used with permission from Elsevier, Ltd. move it across the membrane in a process driven by changes in the 


protein’s conformation. Unlike facilitated diffusion, however, move- 
ment of a solute against a gradient requires the coupled input of 
energy. Consequently, the endergonic movement of ions or other 
solutes across the membrane against a concentration gradient is 
coupled to an exergonic process, such as the hydrolysis of ATP, the 
absorbance of light, the transport of electrons, or the flow of other 
substances down their gradients. Proteins that carry out active 
transport are often referred to as “pumps.” 
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Protein-mediated transport 
(facilitated diffusion) 
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Primary Active Transport: Coupling 
Transport to ATP Hydrolysis 


Simple diffusion 


In 1957, Jens Skou, a Danish physiologist, discovered an ATP- 
= hydrolyzing enzyme in the nerve cells of a crab that was only active in 
Solute concentration the presence of both Na‘ and K* ions. Skou proposed, and correctly so, 


that this enzyme, which was responsible for ATP hydrolysis, was the 
FIGURE 4.45 The kinetics of facilitated diffusion as compared to that 


of simple physical diffusion. 
TABLE 4.3 lon Concentrations Inside and Outside of a Typical 

concentration. Humans have at least five related proteins (isoforms) Mammalian Cell 

that act as facilitative glucose transporters. These isoforms, termed. 


ae : : : i Extracellular Intracellular lonic 
GLUT1 to GLUTS, are distinguished by the tissues in which they are concentration concenttaGon gradient 
located, as well as their kinetic and regulatory characteristics. onan an i 
Insulin is a hormone produced by endocrine cells of the pancreas 5 mM Linen ee 
that plays a key role in maintaining proper blood sugar levels. An 120 mM 10 mM 12x 
increase in blood glucose levels triggers the secretion of insulin, 10°M 107M 10,000x 


10’4M 10’7?M Nearly 2x 
(pH of 7.4) (pH of 7.2) 


which stimulates the uptake of glucose into various target cells, most 
notably skeletal muscle and fat cells (adipocytes). Insulin-responsive 
cells share a common isoform of the facilitative glucose transporter, The ion concentrations for the squid axon are given on page 167. 
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same protein that was active in transporting the two ions; the enzyme 
was called the Na*/K*-ATPase, or the sodium-—potassium pump. 

Unlike the protein-mediated movement of a facilitated diffu- 
sion system, which will carry the substance equally well in either 
direction, active transport drives the movement of ions in only one 
direction. It is the Na*/K*-ATPase that is responsible for the large 
excess of Na* ions outside of the cell and the large excess of K* ions 
inside the cell. The positive charges carried by these two cations are 
balanced by negative charges carried by various anions so that the 
extracellular and intracellular compartments are, for the most part, 
electrically neutral. Cl" ions are present at greater concentration out- 
side of cells, where they balance the extracellular Na* ions. The 
abundance of intracellular K* ions is balanced primarily by excess 
negative charges carried by proteins and nucleic acids. 

The ratio of Na*:K* pumped by the Na*/K*-ATPase is not 1:1, 
but 3:2 (see FIGURE 4.46). In other words, for each ATP hydrolyzed, 
three sodium ions are pumped out as two potassium ions are pumped 


FIGURE 4.46 The Na*/K*-ATPase. (a) Simplified schematic model of the 
transport cycle as described in the text. Note that the actual Na*/K*- 
ATPase is composed of at least two different membrane-spanning subunits: 
a larger a subunit, which carries out the transport activity, and a smaller 

6 subunit, which functions primarily in the maturation and assembly of the 
pump within the membrane. A third (y) subunit may also be present. (b) A 
model of the E2 conformation of the protein based on an X-ray crystallo- 
graphic study. The cation binding sites are located deep within the 
transmembrane domain, which consists of 10 membrane-spanning helices. 
The two rubidium ions are located where the potassium ions would 
normally be bound. 

Source: (b) From Ayako Takeuchi, courtesy of David C. Gadsby, Nature 
450: 958, 2007, © 2007 reprinted by permission from Macmillan 
Publishers Ltd. 


in. Because of this pumping ratio, the Na*/K*-ATPase is electrogenic, 
which means that it contributes directly to the separation of charge 
across the membrane. The Na*/K*-ATPase is an example of a P-type 
ion pump. The “P” stands for phosphorylation, indicating that, dur- 
ing the pumping cycle, the hydrolysis of ATP leads to the transfer of 
the released phosphate group to an aspartic acid residue of the 
transport protein. As denoted by the name of the protein, this reac- 
tion is catalyzed by the pump itself. 

Consider the activity of the protein. It must pick up sodium or 
potassium ions from a region of low concentration, which means that 
the protein must have a relatively high affinity for the ions. Then the 
protein must release the ions on the other side of the membrane into 
a much greater concentration of each ion. To do this, the affinity of 
the protein for that ion must decrease. Thus, the affinity for each ion 
on the two sides of the membrane must be different. This is achieved 
by ATP hydrolysis and the subsequent release of ADP, which induces 
a large conformational change within the protein molecule. The 
change in protein structure—between the E, and E, conformations— 
also serves to alternately expose the ion binding sites to the opposite 
side of the membrane, as discussed in the following paragraph. 

A proposed scheme for the pumping cycle of the Na*/K*-ATPase 
is shown in Figure 4.46a, which is based on X-ray crystallographic 
structures of P-type ion pumps. In step 1, Figure 4.46a, the pump is in 
the E, conformation, and the ion binding sites are accessible to the 
inside of the cell. In this step, the protein has bound three Na” ions 
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pump into an occluded E, state in which the Na* ions can no longer 
flow back into the cytosol. Hydrolysis of the bound ATP (step 2-3) 
and release of ADP (step 3-4) cause the change from the E, to the E, 
conformation. In doing so, the binding site becomes accessible to the 
extracellular compartment, and the protein loses its affinity for Na* 
ions, which are then released outside the cell. Once the three Na* ions 
have been released, the protein picks up two Kions (step 5). The clos- 
ing of another gate within the protein shifts the pump into an occluded 
state (step 6), preventing back flow of the K* ions into the extracellu- 
lar space. Occlusion is followed by dephosphorylation (step 67) and 
ATP binding (step 8), which induces the return of the protein to the 
original E, conformation. In this state, the binding site is open to the 
internal surface of the membrane and has lost its affinity for K* ions, 
leading to the release of these ions into the cell. The cycle is then 
repeated. A model of the crystal structure of the Na‘/K*-ATPase 
structure is shown in Figure 4.46b. Because they require complex 
conformational changes, these active transport pumps move ions 
across membranes at rates that are several orders of magnitude lower 
than their flow through ion channels. 

The sodium-potassium pump is found only in animal cells. This 
protein is thought to have evolved in primitive animals as the primary 
means to maintain cell volume and as the mechanism to generate the 
steep Na* and K* gradients that play such a key role in the formation 
of impulses in nerve and muscle cells. These same ionic gradients are 
used in nonexcitable cells to power the movement of other solutes as 
discussed below. The importance of the sodium-potassium pump 
becomes evident when one considers that it consumes approximately 
one-third of the energy produced by most animal cells and two- 
thirds of the energy produced by nerve cells. Digitalis, a steroid 
obtained from the foxglove plant that has been used for 200 years as 
a treatment for congestive heart disease, binds to the Na*/K*-ATPase. 
Digitalis strengthens the heart’s contraction by inhibiting the Na*/K* 
pump, which leads to a chain of events that increases Ca”* availability 
inside the muscle cells of the heart. 
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Other Primary lon Transport Systems 


The best studied P-type pump is the Ca?*-ATPase whose three- 
dimensional structure has been determined at several stages of the 
pumping cycle. The calcium pump is present in the membranes of 
the endoplasmic reticulum, where it actively transports calcium ions 
out of the cytosol into the lumen of this organelle. Plant cells have a 
H*-transporting, P-type, plasma membrane pump. In plants, this 
proton pump plays a key role in the secondary transport of solutes 
(discussed later), in the control of cytosolic pH, and possibly in con- 
trol of cell growth by means of acidification of the plant cell wall. 

The epithelial lining of the stomach also contains a P-type 
pump, the H*/K*-ATPase, which secretes a solution of concentrated 
acid (up to 0.16 N HCl) into the stomach chamber. In the resting 
state, these pump molecules are situated in cytoplasmic membranes 
of the parietal cells of the stomach lining and are nonfunctional 
(FIGURE 4.47). When food enters the stomach, a hormonal message 
is transmitted to the parietal cells that causes the pump-containing 
membranes to move to the apical cell surface, where they fuse with 
the plasma membrane and begin secreting acid (Figure 4.47). In 
addition to functioning in digestion, stomach acid can also lead to 
heartburn. Prilosec and related drugs are widely used to prevent 
heartburn by inhibiting the stomach’s H*/K*-ATPase. Other acid- 
blocking heartburn medications (e.g., Zantac, Pepcid, and Tagamet) 
do not inhibit the H*/K*-ATPase directly, but block a receptor on 
the surface of the parietal cells, thereby stopping the cells from 
becoming activated by the hormone. 

Unlike P-type pumps, V-type pumps utilize the energy of ATP 
without forming a phosphorylated protein intermediate. V-type 
pumps actively transport hydrogen ions across the walls of cytoplas- 
mic organelles and vacuoles (hence the designation V-type). They 
occur in the membranes that line lysosomes, secretory granules, and 
plant cell vacuoles where they maintain the low pH of the contents. 
V-type pumps have also been found in the plasma membranes of a 
variety of cells. For example, a V-type pump in the plasma membranes 
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FIGURE 4.47 Control of acid secretion in the stomach. In the resting state, the H*/K*-ATPase molecules are present in the walls of cytoplasmic vesicles. 
Food entering the stomach triggers a cascade of hormone-stimulated reactions in the stomach wall leading to the release of histamine, which binds to a 
receptor on the surface of the acid-secreting parietal cells. Binding of histamine to its receptor stimulates a response that causes the H*/K*-ATPase- 
containing vesicles to fuse to the plasma membrane, forming deep folds, or canaliculi. Once at the surface, the transport protein is activated and pumps 
protons into the stomach cavity against a concentration gradient (indicated by the size of the letters). The heartburn drugs Prilosec, Nexium, and Prevacid 
block acid secretion by directly inhibiting the H*/K*-ATPase, whereas several other acid-blocking medications interfere with activation of the parietal cells. 
Acid-neutralizing medications provide basic anions that combine with the secreted protons. 


of kidney tubules helps maintain the body’s acid-base balance by 
secreting protons into the forming urine. V-type pumps are large 
multi-subunit complexes (shown in Figure 4.4d) and similar in 
structure to that of the ATP synthase shown in Figure 5.23. 

Another diverse group of proteins that actively transport ions is 
the ATP-binding cassette (ABC) transporters, so called because all of 
the members of this superfamily share a homologous ATP-binding 
domain. The best studied ABC transporter is described in the 
accompanying Human Perspective. 


Using Light Energy to Actively 
Transport lons 
Halobacterium salinarium (or H. halobium) is an archaebacterium 


that lives in extremely salty environments, such as that found in 
the Great Salt Lake. When grown under anaerobic conditions, the 


FIGURE 4.48 Bacteriorhodopsin: a light-driven proton pump. The 
protein contains seven membrane-spanning helices and a centrally located 
retinal group (shown in purple), which serves as the light-absorbing element 
(chromophore). Absorption of a photon of light causes a change in the 
electronic structure of retinal, leading to the transfer of a proton from the 
—NH* group to a closely associated, negatively charged aspartic acid 
residue (#85) (step 1). The proton is then released to the extracellular side 
of the membrane (step 2) by a relay system consisting of several amino acid 
residues (Asp82, Glu204, and Glu194). The spaces between these residues 
are filled with hydrogen-bonded water molecules that help shuttle protons 
along the pathway. The deprotonated retinal is returned to its original state 
(step 3) when it accepts a proton from an undissociated aspartic acid 
residue (Asp96) located near the cytoplasmic side of the membrane. Asp96 
is then reprotonated by a H* from the cytoplasm (step 4). Asp85 is 
deprotonated (step 5) prior to receiving a proton from retinal in the next 
pumping cycle. As a result of these events, protons move from the 
cytoplasm to the cell exterior through a central channel in the protein. 


Source: From Hartmut Luecke et al., courtesy of Janos K. Lanyi, Science 
286:255, 1999; © 1999, reprinted with permission from AAAS. 


plasma membranes of these prokaryotes take on a purple color 155 
due to the presence of one particular protein, bacteriorhodopsin, 
which acts as a light-driven proton pump. As shown in FIGURE 4.48, 
bacteriorhodopsin contains retinal, the same prosthetic group present 
in rhodopsin, the light-absorbing protein of the rods of the vertebrate 
retina. The absorption of light energy by the retinal group induces a 
series of conformational changes in the protein that cause a proton 
to move from the retinal group, through a channel in the protein, to 
the cell exterior (Figure 4.48). The proton donated by the photo- 
excited retinal is replaced by another proton transferred to the pro- 
tein from the cytoplasm. In effect, this process results in the 
translocation of protons from the cytoplasm to the external environ- 
ment, thereby generating a steep H* gradient across the plasma 
membrane. This gradient is subsequently used by an ATP-synthesizing 
enzyme to phosphorylate ADP, as described in the next chapter. 
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Secondary Active Transport 
(or Cotransport): Coupling 
Transport to Existing lon Gradients 


The establishment of concentration gradients, such as those of Na’, 
Kk", and H*, provides a means by which free energy can be stored in 
a cell. The potential energy stored in ionic gradients is utilized by a 
cell in various ways to perform work, including the transport of other 
solutes. Consider the physiologic activity of the intestine. Within its 
lumen, enzymes hydrolyze high-molecular-weight polysaccharides 
into simple sugars, which are absorbed by the epithelial cells that line 
the intestine. The movement of glucose across the apical plasma mem- 
brane of the epithelial cells, against a concentration gradient, occurs 
by cotransport with sodium ions, as illustrated in FIGURE 4.49. 
The Na* concentration is kept very low within the cells by the action 
of a primary active transport system (the Na*/K*-ATPase), located 
in the basal and lateral plasma membrane, which pumps sodium 
ions out of the cell against a concentration gradient. The tendency 
for sodium ions to diffuse back across the apical plasma membrane 
down their concentration gradient is “tapped” by the epithelial cells 
to drive the cotransport of glucose molecules into the cell against a 
concentration gradient. The glucose molecules are said to be driven 
by secondary active transport. In this case, the transport protein, 
called a Na*/glucose cotransporter, moves two sodium ions and one 
glucose molecule with each cycle. Once inside, the glucose mole- 
cules diffuse through the cell and are moved across the basal mem- 
brane by facilitated diffusion (page 152). 

To appreciate the power of an ion gradient in accumulating 
other types of solutes in cells, we can briefly consider the energetics 
of the Na*/glucose cotransporter. Recall from page 140 that the free- 
energy change for the movement of a mole of Na” ions into the cell 
is equal to —3.1 kcal/mol, and thus 6.2 kcal for two moles of Na* 
ions, which would be available to transport one mole of glucose 
uphill into the cell. Recall also from page 139 that the equation for 
the movement of a nonelectrolyte, such as glucose, across the 
membrane is 


AG =RTine 
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FIGURE 4.49 Secondary transport: the use of energy stored in an ionic 
gradient. The Na*/K*-ATPase residing in the plasma membrane of the lateral 
surface maintains a very low cytosolic concentration of Na*. The Na* gradient 
across the plasma membrane represents a storage of energy that can be 
tapped to accomplish work, such as the transport of glucose by a Na*/glucose 
cotransporter located in the apical plasma membrane. Once transported 
across the apical surface into the cell, the glucose molecules diffuse to the 
basal surface where they are carried by a glucose facilitative transporter out of 
the cell and into the bloodstream. The relative size of the letters indicates the 
directions of the respective concentration gradients. Two Na* ions are 
transported for each glucose molecule; the 2:1 Na*/glucose provides a much 
greater driving force for moving glucose into the cell than a 1:1 ratio. 


Using this equation, we can calculate how steep a concentration gra- 
dient of glucose (X) that this cotransporter can generate. At 25°C, 


-6.2 kcal/mol =1.4 kcal/mol - logy) X 
logio X =—4.43 
ol 
~ 23,000 


This calculation indicates that the Na*/glucose cotransporter is 
capable of transporting glucose into a cell against a concentration 
gradient greater than 20,000-fold. 


FIGURE 4.50 A schematic model of the transport cycle of a 
secondary transporter. Four different conformational states during the 
transport cycle of a bacterial symporter of the LeuT family are shown. The 
protein actively transports the amino acid leucine into the cell using the 
established Na* ion gradient as its source of energy. In step 1, the outer 
gate in the protein is open, whch allows both Na* and leucine to reach 
their binding sites from the extracellular space. In step, 2, the outer gate 
closes, occluding the substrates within the protein. In step 3, a second 
leucine molecule binds to another site just outside the outer gate. In step 
4, the inner gate opens and the substrates are released into the 
cytoplasm. The protein returns to its original state when the inner gate is 
closed and the outer gate is opened. 

Source: Reprinted by permission from Macmillan Publishers Ltd: Nature, 465: 
171, 2010 by N. K. Karpowich and Da-Neng Wang (NYU School of Medicine). 


Plant cells rely on secondary active transport systems to take up 
a variety of nutrients, including sucrose, amino acids, and nitrate. In 
plants, uptake of these compounds is coupled to the downhill, 
inward movement of H* ions rather than Na* ions. The secondary 
active transport of glucose into the epithelial cells of the intestine 
and the transport of sucrose into a plant cell are examples of symport, 
in which the two transported species (Na* and glucose or H* and 
sucrose) move in the same direction. Numerous cotransporters have 
been isolated that engage in antiport, in which the two transported 
species move in opposite directions. For example, cells often main- 
tain a proper cytoplasmic pH by coupling the inward, downhill 
movement of Na* with the outward movement of H*. Cotransporters 
that mediate antiport are usually called exchangers. The three- 
dimensional structures of a number of secondary transporters have 
been solved in recent years, and, like the Na‘/K*-ATPase, they 
exhibit a transport cycle in which the proteins binding sites gain 
alternating access to the cytoplasm and the extracellular space. A 
proposed model of the transport cycle of one of the major families of 
secondary transporters is shown in FIGURE 4.50. 


REVIEW #7 


1. Describe two ways in which energy is utilized to move 
ions and solutes against a concentration gradient. 


2. How does the Na*/K*-ATPase illustrate the asymmetry 
of the plasma membrane? 


3. What is the role of phosphorylation in the mechanism 
of action of the Na*/K*-ATPase? 


4. What is the structural relation between the parts of the 
prokaryotic KcsA K* channel and the eukaryotic 
voltage-regulated K* channel? Which part of the 
channel is involved in ion selectivity, which part in 
channel gating, and which part in channel inactivation? 
How does each of these processes (ion selectivity, 
gating, and inactivation) occur? 


5. Because of its smaller size, one would expect Na” ions to 
be able to penetrate any pore large enough for a K" ion. 


How does the K* channel select for this specific ion? 
oe“ 


Extracellular 
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Cytoplasm 


4.15 THE HUMAN PERSPECTIVE 


Defects in lon Channels and Transporters 
as a Cause of Inherited Disease 


Several severe, inherited disorders have been traced to muta- 
tions in genes that encode ion channel proteins (Table 1). Most of 
the disorders listed in Table 1 affect the movement of ions across 
the plasma membranes of excitable cells (i.e., muscle, nerve, and 
sensory cells), reducing the ability of these cells to develop or 
transmit impulses (page 160). In contrast, cystic fibrosis, the best 
studied and most common inherited ion channel disorder, results 
from a defect in the ion channels of epithelial cells. 

On average, 1 out of every 25 persons of Northern European 
descent carries one copy of the mutant gene that can cause 
cystic fibrosis. Because they show no symptoms of the mutant 
gene, most heterozygotes are unaware that they are carriers. 
Consequently, approximately 1 out of every 2500 infants in this 
Caucasian population (1/25 x 1/25 x 1/4) is homozygous reces- 
sive at this locus and born with cystic fibrosis (CF). Although cystic 
fibrosis affects various organs, including the intestine, pancreas, 
sweat glands, and reproductive tract, the respiratory tract usually 
exhibits the most severe effects. Victims of CF produce a thick- 
ened, sticky mucus that is very hard to propel out of the airways. 
Afflicted individuals typically suffer from chronic lung infections and 
inflammation, which progressively destroy pulmonary function. 

The gene responsible for cystic fibrosis was isolated in 
1989. Once the sequence of the CF gene was determined and 
the amino acid sequence of the corresponding polypeptide was 
deduced, it was apparent that the polypeptide was a member of 
the ABC transporter superfamily. The protein was named cystic 
fibrosis transmembrane conductance regulator (CFTR), an ambig- 
uous term that reflected the fact that researchers weren't sure of 
its precise function. The question was thought to be answered 
after the protein was purified, incorporated into artificial lipid 
bilayers, and shown to act as a cyclic AMP-regulated chloride 
channel, not a transporter. But subsequent studies have added 
numerous complications to the story as it has been shown that, in 
addition to functioning as a chloride channel, CFTR also (1) con- 


FIGURE 1. An explanation for the 
debilitating effects on lung function 


from the absence of the CFTR protein. 
Hydrated 


In the airway epithelium of a normal mucous layer CFTR 


individual, water flows out of the epithelial 
cells in response to the outward move- 
ment of ions, thus hydrating the surface 
mucous layer. The hydrated mucous layer, 
with its trapped bacteria, is readily moved 
out of the airways. In the airway epithe- 
lium of a person with cystic fibrosis, the 
abnormal movement of ions causes water 
to flow in the opposite direction, thus 
dehydrating the mucous layer. As a result, 
trapped bacteria cannot be moved out of 
the airways, which allows them to 
proliferate as a biofilm (page 13) and 
cause chronic infections. 
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! Epithelial 
cell 


Airway epithelium from normal individual 


ducts bicarbonate (HCO3) ions, (2) suppresses the activity of an 
epithelial Na* ion channel (ENaC), and (3) stimulates the activity 
of a family of epithelial chloride/bicarbonate exchangers. As the 
role of CFTR has become more complex, it has become difficult 
to establish precisely how a defect in this protein leads to the 
development of chronic lung infections. While there is consider- 
able debate, many researchers would agree with the following 
statements. 

Because the movement of water out of epithelial cells by 
osmosis follows the movement of salts, abnormalities in the flux 
of Cl’, HCO37, and/or Na* caused by CFTR deficiency leads to 
a decrease in the fluid that bathes the epithelial cells of the air- 
ways (FIGURE 1). A reduction in volume of the surface liquid, and 
a resulting increase in viscosity of the secreted mucus, impair 
the function of the cilia that push mucus and bacteria out of 
the respiratory tract. Many CF patients are helped by inhaling a 
mist of hypertonic saline, which helps to draw more water into 
the airways and reduce the viscosity of the mucus. Clinical trials 
are also being carried out on compounds that have the poten- 
tial to increase the volume of surface fluid by altering the move- 
ments of ions in and out of the epithelial cells (Figure 1). One such 
compound is the ENaC Na*-channel inhibitor named GS-9411, 
which was hoped to help treat CF by reducing Na’® ion absorption 
from the airway fluid into the epithelium. Unfortunately, this drug 
caused elevated potassium in the blood, and the clinical trials 
had to be terminated, illustrating the challenges of treating a dis- 
ease by targeting ion channels with broad roles in physiology. It 
was therefore hoped that better understanding of the genetics of 
CF might lead to development of more precisely targeted drugs 
that could treat CF without causing wide-ranging alterations in 
body chemistry. 

In the past decade, researchers have identified more 
than 1000 different mutations that give rise to cystic fibrosis. 
However, approximately 70 percent of the alleles responsible 
for cystic fibrosis in the United States contain the same ge- 
netic alteration (designated AF508)—they are all missing three 
base pairs of DNA that encode a phenylalanine at position 508, 
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TABLE 1 


Inherited disorder Type of channel 


Gene 


Clinical consequences 


Familial hemiplegic migraine (FHM) Gas 
Episodic ataxia type-2 (EA-2) Ca”* 
Hypokalemic periodic paralysis Ca™* 
Episodic ataxia type-1 Kt 
Benign familial neonatal convulsions K* 
Nonsyndromic dominant deafness K* 
Long QT syndrome 


Hyperkalemic periodic paralysis 
Liddle Syndrome 

Myasthenia gravis 

Dent’s disease 

Myotonia congenita 

Bartter’s syndrome type IV 
Cystic fibrosis 

Cardiac arrhythmias 


See Nature Cell Biol. 6:1040, 2004, or Nature 440:444, 2006, for a more complete list. 


within one of the cytoplasmic domains of the CFTR polypeptide. 
Subsequent research has revealed that CFTR polypeptides 
lacking this particular amino acid fail to be processed normally 
within the membranes of the endoplasmic reticulum and, in 
fact, never reach the surface of epithelial cells. As a result, CF 
patients who are homozygous for the AF508 allele completely 
lack the CFTR channel in their plasma membranes and have a 
severe form of the disease. When cells from these patients ar 
S 


grown in culture at lower temperature, the mutant protein 
transported to the plasma membrane where it functions quite 
well. This finding has prompted a number of drug companies to 
screen for small molecules that can bind to these mutant CFTR 
molecules, preventing their destruction in the cytoplasm and 
allowing them to reach the cell surface. One of these candidate 
drugs, VX-809, failed to show improvements in patient outcome 
in clinical trials. Another drug named Kalydeco (also known 
as ivacaftor), binds to CFTR and keeps the ion channel open. 
This drug is clinically approved for treatment of CF in patients 
who carry the G551D amino acid substitution, which doesn’t 
prevent CFTR from reaching the surface but instead reduces 
the ability of the channel to open. Kalydeco overcomes this 
defect. Unfortunately only 4 percent of CF patients carry the 
G551D mutation. However, clinical trials suggest that Kalydeco 
in combination with VX-809 can be used to treat patients with 
the AF508 allele, presumably by helping those channels that do 
reach the surface to stay open longer. 

According to one estimate, the AF508 mutation had to have 
originated more than 50,000 years ago to have reached such a 
high frequency in the population. The fact that the CF gene has 
reached this frequency suggests that heterozygotes may receive 
some selective advantage over those lacking a copy of the defec- 
tive gene. It has been proposed that CF heterozygotes may be 
protected from the effects of cholera. One difficulty with this pro- 
posal is that there is no record of cholera epidemics in Europe until 
the 1820s. An alternate proposal suggests that heterozygotes are 
protected from typhoid fever because the bacterium responsible 


CACNL1A4 
CACNL1A4 
CACNL1A3 
KCNA1 
KCNQ2 
KCNQ4 
HERG 
KCNQI, or 
SCN5A 
SCN4A 
B-ENaC 
nAChR 
CLCN5 
CLC-1 
CLC-Kb 
CFTR 
many different genes 


Migraine headaches 

Ataxia (lack of balance and coordination) 

Periodic myotonia (muscle stiffness) and paralysis 

Ataxia 

Epileptic convulsions 

Deafness 

Dizziness, sudden death from ventricular 
fibrillation 


Periodic myotonia and paralysis 
Hypertension (high blood pressure) 
Muscle weakness 

Kidney stones 

Periodic myotonia 

Kidney dysfunction, deafness 


Lung congestion and infections 
Irregular or rapid heartbeat 


for this disease adheres poorly to the wall of an intestine having a 
reduced number of CFTR molecules. 

Ever since the isolation of the gene responsible for CF, the 
development of a cure by gene therapy—that is, by replacement 
of the defective gene with a normal version—has been a major 
goal of CF researchers. Cystic fibrosis is a good candidate for gene 
therapy because the worst symptoms of the disease result from 
the defective activities of epithelial cells that line the airways and, 
therefore, are accessible to agents that can be delivered by inha- 
lation of an aerosol. Clinical trials have been conducted using sev- 
eral different types of delivery systems. In one group of trials, the 
normal CFTR gene was incorporated into the DNA of a defective 
adenovirus, a type of virus that normally causes upper respiratory 
tract infections. The recombinant virus particles were then allowed 
to infect the cells of the airway, delivering the normal gene to the 
genetically deficient cells. The primary disadvantage in using ade- 
novirus is that the viral DNA (along with the normal CFTR gene) 
does not become integrated into the chromosomes of the infected 
host cell so that the virus must be readministered frequently. As 
a result, the procedure often induces an immune response within 
the patient that eliminates the virus and leads to lung inflamma- 
tion. Researchers are hesitant to employ viruses that integrate their 
genomes for fear of initiating the formation of cancers. In other 
trials, the DNA encoding the normal CFTR gene has been linked 
to positively charged liposomes (page 121) that can fuse with the 
plasma membranes of the airway cells, delivering their DNA con- 
tents into the cytoplasm. Lipid-based delivery has an advantage 
over viruses in being less likely to stimulate a destructive immune 
response following repeated treatments, but has the disadvan- 
tage of being less effective in achieving genetic modification of 
target cells. To date, none of the clinical trials of gene therapy has 
resulted in significant improvement of either physiologic processes 
or disease symptoms. The development of more effective DNA 
delivery systems, which are capable of genetically altering a greater 
percentage of airway cells, will be required if a treatment for CF 
based on gene therapy is to be achieved. 


4.16 Membrane Potentials 


All organisms respond to external stimulation, a property referred to 
as irritability. Even a single-celled amoeba, if poked with a fine glass 
needle, responds by withdrawing its pseudopodia, rounding up, and 
moving off in another direction. Irritability in an amoeba depends 
on the same basic properties of membranes that lead to the forma- 
tion and propagation of nerve impulses, which is the subject of the 
remainder of the chapter. 

Nerve cells (or neurons) are specialized for the collection, con- 
duction, and transmission of information, which is coded in the 
form of fast-moving electrical impulses. The basic parts of a typi- 
cal neuron are illustrated in FIGURE 4.51a. The nucleus of the neu- 
ron is located within an expanded region called the cell body, 
which is the metabolic center of the cell and the site where most of 
its material contents are manufactured. Extending from the cell 
bodies of most neurons are a number of fine extensions, called 
dendrites, which receive incoming information from external 
sources, typically other neurons. Also emerging from the cell body 
is a single, more prominent extension, the axon, which conducts 
outgoing impulses away from the cell body and toward the target 
cell(s). Although some axons may be only a few micrometers in 
length, others extend over much longer distances (Figure 4.51b) 
and in the body of a large vertebrate, such as a giraffe or whale, 
axons can be many meters long. Most axons split near their ends 
into smaller processes, each ending in a terminal knob—a special- 
ized site where impulses are transmitted from neuron to target cell. 
Many neurons in the brain end in thousands of terminal knobs, 
allowing these brain cells to communicate with thousands of 
potential targets. As discussed on page 118, most neurons in the 
vertebrate body are wrapped in a lipid-rich myelin sheath, whose 
function is described below. 
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The Resting Potential 


A voltage (or electric potential difference) between two points, such 
as the inside and outside of the plasma membrane, results when there 
is an excess of positive ions at one point and an excess of negative ions 
at the other point. Voltages across plasma membranes can be meas- 
ured by inserting one fine glass electrode (or microelectrode) into the 
cytoplasm ofa cell, placing another electrode in the extracellular fluid 
outside the cell, and connecting the electrodes to a voltmeter, an 
instrument that measures a difference in charge between two points 
(FIGURE 4.52). When this experiment was first carried out on a giant 
axon of the squid, a potential difference of approximately 70 millivolts 
(mV) was recorded, the inside being negative with respect to the out- 
side (indicated with a minus sign, -70 mV). The presence of a mem- 
brane potential is not unique to nerve cells; such potentials are 
present in all types of cells, the magnitude varying between about —15 
and —100 mV. When a nerve or muscle cell is in an unexcited state, the 
membrane potential is referred to as the resting potential because 
it is subject to dramatic change, as discussed in the following section. 

The magnitude and direction of the voltage across the plasma 
membrane are determined by the differences in concentrations of 
ions on either side of the membrane and their relative permeabili- 
ties. As described earlier in the chapter, the Na*/K*-ATPase pumps 
Na* out of the cell and K* into the cell, thereby establishing steep 
gradients of these two ions across the plasma membrane. Because of 
these gradients, you might expect that potassium ions would leak 
out of the cell and sodium ions would leak inward through their 
respective ion channels. However, the vast majority of the ion chan- 
nels in the plasma membrane of a resting nerve cell are closed. Those 
that are open are selective for K*; they are often referred to as K* leak 
channels. K* leak channels are members of the K2P family of K* 
channels, which lack the $4 voltage sensor (page 145) and fail to 
respond to changes in voltage. 


(b) 


FIGURE 4.51 The structure of a nerve cell. (a) Schematic drawing of a simple neuron with a myelinated axon. As the inset shows, the myelin sheath 
comprises individual Schwann cells that have wrapped themselves around the axon. The sites where the axon lacks myelin wrapping are called nodes of 
Ranvier. (Note: Myelin-forming cells within the central nervous system are called oligodendrocytes rather than Schwann cells.) (6) A composite micrograph 
of a single rat hippocampal neuron with cell body and dendrites (purple) and an axon 1 cm in length (red). Motor nerve cells in larger mammals can be 


100 times this length. 


Source: (b) From Carlos F. lbafiez, Trends Cell Biol. 17:520, 2007, © 2007, with permission from Elsevier. 
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FIGURE 4.52 Measuring a membrane’s resting potential. A potential is 
measured when a difference in charge is detected between the reference 
and recording electrodes. In (a), both electrodes are on the outside of the 
cell, and no potential difference (voltage) is measured. As one electrode 
penetrates the plasma membrane of the axon in (b), the potential immedi- 
ately drops to —70 mV (inside negative), which approaches the equilibrium 
potential for potassium ions, that is, the potential that would result if the 
membrane were impermeable to all ions except potassium. 


Because K* ions are the only charged species with significant 
permeability in a resting nerve cell, their outflow through the mem- 
brane leaves an excess of negative charges on the cytoplasmic side of 
the membrane. Although the concentration gradient across the 
membrane favors continued efflux of K", the electrical gradient 
resulting from the excess negative charge on the inside of the mem- 
brane favors the retention of K* ions inside the cell. When these two 
opposing forces are balanced, the system is at equilibrium, and there 
is no further net movement of K* ions across the membrane. Using 
the following equation, which is called the Nernst equation, one can 
calculate the membrane potential (V,,) that would be measured at 
equilibrium if the plasma membrane of a nerve cell were permeable 
only to K* ions.’ In this case, V,, would be equal to the potassium 
equilibrium potential (Ex): 


[Ko] 


[Ki'] 


RF 
Ex =2.303— -logyo 
ZF 


For a squid giant axon, the internal [K;*] is approximately 350 mM, 
while the external [K,*] is approximately 10 mM; thus at 25°C (298 
K) and z = +1 (for the univalent K* ion), 


Ex =59 log,) 0.028 =—-91 mV 


A similar calculation of the Na* equilibrium potential (Ey,) would 
produce a value of approximately +55 mV. Because measurements of 


’The Nernst equation is derived from the equation provided on page 140, by setting 
AG at zero, which is the case when the movement of the ions is at equilibrium. 


the voltage across the resting nerve membrane are similar in sign and 
magnitude (—70 mV) to the potassium equilibrium potential just cal- 
culated, the movement of potassium ions across the membrane is 
considered the most important factor in determining the resting 
potential. The difference between the calculated K* equilibrium poten- 
tial (-91 mV) and the measured resting potential (-70 mV, Figure 
4.52) is due to a slight permeability of the membrane to Na* through 
a recently described Na* leak channel. 


The Action Potential 


Our present understanding of membrane potentials and nerve 
impulses rests on a body of research carried out on the giant axons of 
the squid in the late 1940s and early 1950s by a group of British physi- 
ologists, most notably Alan Hodgkin, Andrew Huxley, and Bernard 
Katz. These axons, which are approximately 1 mm in diameter, carry 
impulses at high speeds, enabling the squid to escape rapidly from 
predators. The large size of these axons also made it easier to measure 
their electrical activity, compared to the microscopic axons of mam- 
malian neurons. If the membrane of a resting squid axon is stimu- 
lated by poking it with a fine needle or jolting it with a very small 
electric current, some of its sodium channels open, allowing a limited 
number of sodium ions to diffuse into the cell. This opportunity for 
positively charged ions to move into the cell reduces the membrane 
potential, making it less negative. Because the positive change in 
membrane voltage causes a decrease in the polarity between the two 
sides of the membrane, it is called a depolarization. 

If the stimulus causes the membrane to depolarize by only a few 
millivolts, say from —70 to -60 mV, the membrane rapidly returns to 
its resting potential as soon as the stimulus has ceased (FIGURE 4.53a, 
left box). If, however, the stimulus depolarizes the membrane beyond 
a certain point, called the threshold, which occurs at about —50 mV, 
then a new series of events is launched. The change in voltage causes 
the voltage-gated sodium channels to open. As a result, sodium ions 
diffuse freely into the cell (Figure 4.53a, middle box) down both 
their concentration and electric gradients. The increased permeability 
of the membrane to Na* ions and the corresponding movement of 
positive charge into the cell causes the membrane to reverse poten- 
tial briefly (Figure 4.53b), becoming positive at about +40 mV, which 
approaches the equilibrium potential for Na* (Figure 4.52). 

After approximately 1 msec, the sodium channels spontane- 
ously inactivate, blocking further influx of Na* ions. According to 
the prevailing view, inactivation results from the random diffusion 
of an inactivation peptide into the opening of the channel pore in a 
manner similar to that described for K* channels on page 147. 
Meanwhile, the change in membrane potential caused by Na* influx 
triggers the opening of the voltage-gated potassium channels 
(Figure 4.53a, right box) discussed on page 145. As a result, potas- 
sium ions diffuse freely out of the cell down their steep concentra- 
tion gradient. The decreased permeability of the membrane to Na* 
and the increased permeability to K* cause the membrane potential 
to swing back to a negative value of about —80 mV, approaching that 
of the K* equilibrium potential (Figure 4.52). The large negative 
membrane potential causes the voltage-gated potassium channels to 
close (see Figure 4.43b), which returns the membrane to its resting 
state. Collectively, these changes in membrane potential are called an 
action potential (Figure 4.53). The entire series of changes dur- 
ing an action potential takes only about 5 msec in the squid axon and 
less than 1 msec in a myelinated mammalian nerve cell. Following 
an action potential, the membrane enters a brief refractory period 


Resting potential Depolarization phase —_Repolarization phase during which it cannot be restimulated. The refractory period occurs 161 
sodium gates closed —_sodium gates open —_ potassium gates open because the sodium channels that were inactivated during the initial 
voltage = -70 mV voltage = +40 mV voltage = -80 mV : 

sine Ge a stage of the action potential must close before they can be reopened 

in response to another stimulus. As depicted in Figure 4.43, the 

transformation of the ion channel from the inactivated to the closed 

conformation can only occur after the inactivating peptide has 
swung out of the opening of the pore. 

Although the action potential changes membrane voltage dra- 

matically, only a minute percentage of the ions on the two sides of 


gate 


closed 


Pot es TR thettk --- vaneeie see the membrane are involved in any given action potential. The strik- 
leak channel ~~~ «3 % _— 7 ing changes in membrane potential seen in Figure 4.53b are not 
Time Time caused by changes in Na* and K* ion concentrations on the two sides 

: 2 of the membrane (such changes are insignificant). Rather, they are 


caused by the movements of charge in one direction or the other that 
result from the fleeting changes in permeability to these ions. Those 
Na* and K* ions that do change places across the membrane during 
an action potential are eventually pumped back by the Na*/K*- 
ATPase. Even if the Na*/K*-ATPase is inhibited, a neuron can often 
continue to fire thousands of impulses before the ionic gradients 
established by the pump’s activity are dissipated. 

Once the membrane of a neuron is depolarized to the threshold 
value, a full-blown action potential is triggered without further stim- 
ulation. This feature of nerve cell function is known as the all-or- 
none law. There is no in-between; subthreshold depolarization is 
incapable of triggering an action potential, whereas threshold depo- 
larization automatically elicits a maximum response. The energy 

0 1 2 required to create an action potential is stored ahead of time by the 
(a) Time (ms) Na*/K*-ATPase, which generates steep ionic gradients across the 
plasma membrane. Once that is accomplished, the various ions are 
poised to flow through the membrane down their respective electro- 
chemical gradients as soon as their ion channels are opened, just like 
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‘a water flowing from a dam once the flood-gates are released. 
2 The movements of ions across the plasma membrane of nerve 
2 cells form the basis for neural communication. Certain local anes- 
Fy thetics, such as procaine and novocaine, act by closing ion channels 
5 in the membranes of sensory cells and neurons. As long as these ion 
g channels remain closed, the affected cells are unable to generate 
action potentials and thus unable to inform the brain of events 
occurring at the skin or teeth. 
0 1 2 
(b) Time (ms) 
FIGURE 4.53 Formation of an action potential. (a) Time 1, upper left REVIEW! | 
box: The membrane in this region of the nerve cell exhibits the resting 1. What is a resting potential? How is it established as a 
potential, in which only the K* leak channels are open and the membrane result of ion flow? 


It i imately —70 mV. Time 2 iddle b h th ‘ é : 
Fe Ne ee ae 2. What is an action potential? What are the steps that 


depolarization phase: The membrane has depolarized beyond the . 
. e . lead to its various phases? 


threshold value, opening the voltage-regulated sodium gates, leading to 
an influx of Na* ions (indicated in the permeability change in the lower 

graph). The increased Na* permeability causes the membrane voltage to 

temporarily reverse itself, reaching a value of approximately +40 mV in the 


squid giant axon (time 2). It is this reversal of membrane potential that 


constitutes the action potential. Time 3, upper right box, shows the 4.1 7 Propagation of Action 
repolarization phase: Within a tiny fraction of a second, the sodium gates Potentials as an Impulse 


are inactivated and the potassium gates open, allowing potassium ions to 

diffuse across the membrane (lower part of the drawing) and establish an Up to this point, we have restricted the discussion to events occur- 
even more negative potential at that location (-80 mV) than that of the ring at a particular site on the nerve cell membrane where experi- 
mental depolarization has triggered an action potential. Once an 
action potential has been initiated, it does not remain localized at a 
particular site but is propagated as a nerve impulse down the 
length of the cell to the nerve terminals. 


resting potential. Almost as soon as they open, the potassium gates close, 
leaving the potassium leak channels as the primary path of ion movement 
across the membrane and reestablishing the resting potential. (6) A summary 
of the voltage changes that occur during an action potential, as described 
in part a. 
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FIGURE 4.54 Propagation of an impulse results from the local flow of 
ions. An action potential at one site on the membrane depolarizes an 
adjacent region of the membrane, triggering an action potential at the 
second site. The action potential can only flow in the forward direction 
because the portion of the membrane that has just experienced an action 
potential remains in a refractory period. 


Nerve impulses are propagated along a membrane because an 
action potential at one site has an effect on the adjacent site. The 
large depolarization that accompanies an action potential creates a 
difference in charge along the inner and outer surfaces of the plasma 
membrane (FIGURE 4.54). As a result, positive ions move toward the 
site of depolarization on the outer surface of the membrane and 
away from that site on the inner surface (Figure 4.54). This local flow 
of current causes the membrane in the region just ahead of the action 
potential to become depolarized. Because the depolarization accom- 
panying the action potential is very large, the membrane in the adja- 
cent region is readily depolarized to a level greater than the threshold 
value, which opens the sodium channels in this adjacent region, gen- 
erating another action potential. Thus, once triggered, a succession 
of action potentials passes down the entire length of the neuron 
without any loss of intensity, arriving at its target cell with the same 
strength it had at its point of origin. 

Because all impulses traveling along a neuron exhibit the same 
strength, stronger stimuli cannot produce “bigger” impulses than 
weaker stimuli. Yet, we are clearly capable of detecting differences in 
the strength of a stimulus. The ability to make sensory discrimina- 
tions depends on several factors. For example, a stronger stimulus 
such as scalding water activates more nerve cells than does a weaker 
stimulus such as warm water. It also activates “high-threshold” neu- 
rons that would remain at rest if the stimulus were weaker. Stimulus 
strength is also encoded in the pattern and frequency by which action 
potentials are launched down a particular neuron. In most cases, the 
stronger the stimulus, the greater the number of impulses generated. 

The greater the diameter of an axon, the less the resistance to 
local current flow and the more rapidly an action potential at one 
site can activate adjacent regions of the membrane. Some inverte- 
brates, such as squid and tube worms, have evolved giant axons that 
facilitate the animal’s escape from danger. There is, however, a limit 
to this evolutionary approach. Because the speed of conduction 
increases with the square root of the increase in diameter, an axon 
that is 480 um in diameter can conduct an action potential only four 
times faster than one that is 30 um in diameter. 

During the evolution of vertebrates, an increase in conduction 
velocity was achieved when the axon became wrapped in a myelin 
sheath (see Figures 4.5 and 4.51). Because it is composed of many 
layers of lipid-containing membranes, the myelin sheath is ideally 
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FIGURE 4.55 Saltatory conduction. During saltatory conduction, only the 
membrane in the nodal region of the axon becomes depolarized and 
capable of forming an action potential. This is accomplished as current flows 
directly from an activated node to the next resting node along the axon. 


suited to prevent the passage of ions across the plasma membrane. In 
addition, nearly all of the Na* ion channels of a myelinated neuron 
reside in the unwrapped gaps, or nodes of Ranvier, between adjacent 
Schwann cells or oligodendrocytes that make up the sheath (see 
Figure 4.51). Consequently, the nodes of Ranvier are the only sites 
where action potentials can be generated. An action potential at one 
node triggers an action potential at the next node (FIGURE 4.55), 
causing the impulse to jump from node to node without having to 
activate the intervening membrane. Propagation of an impulse by 
this mechanism is called saltatory conduction. Impulses are con- 
ducted along a myelinated axon at speeds up to 120 meters per sec- 
ond, which is more than 20 times faster than the speed that impulses 
travel in an unmyelinated neuron of the same diameter. 

The importance of myelination is dramatically illustrated by 
multiple sclerosis (MS), a disease associated with deterioration of 
the myelin sheath that surrounds axons in various parts of the nerv- 
ous system. Manifestations of the disease usually begin in young 
adulthood; patients experience weakness in their hands, difficulty in 
walking, and problems with their vision. 


REVIEW | 


1. How is an action potential propagated along an axon? 
What is saltatory conduction, and how does such a 
process occur? 


2. What is the role of the myelin sheath in conduction 
of an impulse? 


eo 


4.18 Neurotransmission: 
Jumping the Synaptic Cleft 


Neurons are linked with their target cells at specialized junctions 
called synapses. Careful examination of a synapse reveals that the 
two cells do not make direct contact but are separated from each 


other by a narrow gap of about 20 to 50 nm. This gap is called the _ transmitters that act on postsynaptic cells. Two of the best studied 163 
synaptic cleft. A presynaptic cell (a receptor cell or a neuron) neurotransmitters are acetylcholine and norepinephrine, 


conducts impulses toward a synapse, and a postsynaptic cell O 
(a neuron, muscle, or gland cell) always lies on the receiving side of | , 
a synapse. FIGURE 4.56 shows a number of synapses between the ter- CH3—C—O— CH2— CH2—N(CHs3)3 
minal branches of an axon and a skeletal muscle cell; synapses of this Accivichotins (ACH) 
type are called neuromuscular junctions. 
How does an impulse in a presynaptic neuron jump across the OH H 


synaptic cleft and affect the postsynaptic cell? Studies carried out | + 
decades ago indicated that a chemical substance is involved in HO i CH2—NH3 
the transmission of an impulse from one cell to another (page 148). The 

OH 
very tips (terminal knobs) of the branches of an axon appear in the 
electron microscope to contain large numbers of synaptic vesicles 
(Figure 4.56, left inset) that serve as storage sites for the chemical _ which transmit impulses to the body’s skeletal and cardiac muscles. 


Norepinephrine 


Membrane of postsynaptic target cell 


Synaptic vesicles 
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Don W. Fawcett/T. Reese/Photo Researchers, Inc. 
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FIGURE 4.56 The neuromuscular junction is the site where branches from a motor axon form synapses with the muscle fibers of a skeletal muscle. 
The left inset shows the synaptic vesicles residing within the terminal knob of the axon and the narrow synaptic cleft between the terminal knob and the 
postsynaptic target cell. The right inset shows the terminal knob pressed closely against the muscle cell plasma membrane. Neurotransmitter molecules (red) 
released from synaptic vesicles of the presynaptic neuron are binding to receptors (yellow) on the surface of the muscle cell (blue). 
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FIGURE 4.57 The sequence of events during synaptic transmission with acetylcholine as the neurotransmitter. During steps 1-4, a nerve impulse 
reaches the terminal knob of the axon, calcium gates open leading to an influx of Ca**, and acetylcholine is released from synaptic vesicles and binds to 
receptors on the postsynaptic membrane. If the binding of the neurotransmitter molecules causes a depolarization of the postsynaptic membrane (as in 5a), 
a nerve impulse may be generated there (6). If, however, the binding of neurotransmitter causes a hyperpolarization of the postsynaptic membrane (5b), the 
target cell is inhibited, making it more difficult for an impulse to be generated in the target cell by other excitatory stimulation. The breakdown of the 


neurotransmitter by acetylcholinesterase is not shown. 


The sequence of events during synaptic transmission can be 
summarized as follows (FIGURE 4.57). When an impulse reaches a 
terminal knob (step 1, Figure 4.57), the accompanying depolarization 
induces the opening of a number of voltage-gated Ca** channels in 
the plasma membrane of this part of the presynaptic nerve cell (step 2, 
Figure 4.57). Calcium ions are normally present at very low concen- 
tration within the neuron (about 100 nM), as in all cells. When the 
gates open, calcium ions diffuse from the extracellular fluid into the 
terminal knob of the neuron, causing the [Ca”*] to rise more than a 
thousandfold within localized microdomains near the channels. The 
elevated [Ca**] triggers the rapid fusion of one or a few nearby 
synaptic vesicles with the plasma membrane, causing the release of 
neurotransmitter molecules into the synaptic cleft (step 3, Figure 4.57). 

Once released from the synaptic vesicles, the neurotransmitter 
molecules diffuse across the narrow gap and bind selectively to 
receptor molecules that are concentrated directly across the cleft in 
the postsynaptic plasma membrane (step 4, Figure 4.57). A neuro- 
transmitter molecule can have one of two opposite effects depending 
on the type of receptor on the target cell membrane to which it binds:* 


1. The bound transmitter can trigger the opening of cation-selec- 
tive channels in the membrane, leading primarily to an influx of 
sodium ions and a less negative (more positive) membrane 


8Tt is important to note that this discussion ignores an important class of neurotrans- 
mitter receptors that are not ion channels and thus do not directly affect membrane 
voltage. This other group of receptors are members of a class of proteins called 
GPCRs, which are discussed at length in Section 15.3. When a neurotransmitter binds 
to one of these receptors, it can initiate a variety of responses, which often includes the 
opening of ion channels by an indirect mechanism. 


potential. This depolarization of the postsynaptic membrane 
excites the cell, making the cell more likely to respond to this or 
subsequent stimuli by generating an action potential of its own 
(Figure 4.57, steps 5a and 6). 


2. The bound transmitter can trigger the opening of anion-selective 
channels, leading mainly to an influx of chloride ions, and 
a more negative (hyperpolarized) membrane potential. Hyper- 
polarization of the postsynaptic membrane makes it less likely the 
cell will generate an action potential because greater sodium influx 
is subsequently required to reach the membrane’s threshold 
(Figure 4.57, step 5b). 


Most nerve cells in the brain receive both excitatory and inhibitory 
signals from many different presynaptic neurons. It is the summa- 
tion of these opposing influences that determine whether or not an 
impulse will be generated in the postsynaptic neuron. 

All terminal knobs of a given neuron release the same neuro- 
transmitter(s). However, a given neurotransmitter may have a stim- 
ulatory effect on one particular postsynaptic membrane and an 
inhibitory effect on another. Acetylcholine, for example, inhibits 
contractility of the heart but stimulates contractility of skeletal 
muscle. Within the brain, glutamate serves as the primary excitatory 
neurotransmitter and gamma-aminobutyric acid (GABA) as the pri- 
mary inhibitory neurotransmitter. A number of general anesthetics, 
as well as Valium and its derivatives, act by binding to the GABA 
receptor and enhancing the activity of the brain’s primary “off” 
switch. As will be evident in the following section, many different 
drugs affecting disorders, such as anxiety, depression, insomnia, and 
schizophrenia, exert their action at the synapse. 


Actions of Drugs on Synapses 


It is important that a neurotransmitter has only a short half-life follow- 
ing its release from a presynaptic neuron; otherwise, the effect of the 
neurotransmitter would be extended, and the postsynaptic neuron 
would not recover. A neurotransmitter is eliminated from the synapse 
in two ways: by enzymes that destroy neurotransmitter molecules in 
the synaptic cleft and by proteins that transport neurotransmitter mol- 
ecules back to the presynaptic terminals—a process called reuptake. 
Because of the destruction or reuptake of neurotransmitter molecules, 
the effect of each impulse lasts no more than a few milliseconds. 

Interfering with the destruction or reuptake of neuro- 
transmitters can have dramatic physiologic and behavioral effects. 
Acetylcholinesterase is an enzyme localized within the synaptic cleft 
that hydrolyzes acetylcholine. We began this chapter describing the 
horrific effects of the nerve gas sarin, which works by inhibiting ace- 
tylcholinesterase. When this enzyme is inhibited by exposure to 
sarin, the subsequent buildup of acetylcholine in the neuromuscular 
junctions throughout the body causes overstimulation of the adjoining 
postsynaptic muscle fibers. As a result, the skeletal muscles of the 
body, including those required for breathing, undergo violent 
contraction followed by muscle weakness and paralysis. Milder 
inhibitors of acetylcholinesterase are used to treat the symptoms of 
Alzheimer’s disease, which is characterized by the loss of acetylcho- 
line-releasing neurons. 

Many drugs act by inhibiting the transporters that sweep neuro- 
transmitters out of the synaptic cleft. A number of widely prescribed 
antidepressants, including Prozac and Zoloft, inhibit the reuptake of 
serotonin, a neurotransmitter implicated in mood disorders. 
Cocaine, on the other hand, interferes with the reuptake of the neu- 
rotransmitter dopamine that is released by certain nerve cells in a 
portion of the brain known as the limbic system. The limbic system 
contains the brain’s “pleasure” or “reward” centers. The sustained 
presence of dopamine in the synaptic clefts of the limbic system pro- 
duces a short-lived feeling of euphoria, as well as a strong desire to 
repeat the activity. Mice that have been genetically engineered to 
lack the dopamine transporter (DAT)—the protein responsible for 
dopamine reuptake—show behavior similar to that of normal mice 
that have been given cocaine or amphetamines. Administration of 
cocaine or amphetamines has no additional behavioral effects on 
animals lacking the DAT gene. Numerous other drugs act on presyn- 
aptic dopamine-releasing or postsynaptic dopamine-responding 
neurons. Amphetamines are thought to stimulate the excessive 
release of dopamine from presynaptic terminals and also interfere 
with the reuptake of the neurotransmitter molecules from the synaptic 
cleft. A number of antipsychotic drugs bind to certain subtypes of 
the dopamine receptor on postsynaptic neurons and block their 
stimulation by dopamine. 

The active compound in marijuana (A*-tetrahydrocannabinol) 
acts by a totally different mechanism. It binds to cannabinoid (CB1) 
receptors located on the presynaptic terminals of certain neurons of 
the brain, which reduces the likelihood that these neurons will 
release neurotransmitters. CB1 receptors normally interact with 
compounds produced in the body called endocannabinoids. 
Endocannabinoids are produced by postsynaptic neurons following 
depolarization. These substances diffuse “backwards” across the 
synaptic cleft to the presynaptic membrane, where they bind to CB1 
receptors, suppressing synaptic transmission. CB1 receptors are 
located in many areas of the brain, including the hippocampus, 


cerebellum, and hypothalamus, which explains the effects of mari- 165 


juana on memory, motor coordination, and appetite, respectively. If 
marijuana increases appetite by binding to CB1 receptors, it follows 
that blocking these receptors might decrease appetite. This line of 
reasoning led to the development of a CB1-blocking weight-loss 
drug called Acomplia, which has been pulled from the market 
because of side effects. 


Synaptic Plasticity 


Synapses are more than simply connecting sites between adjacent 
neurons; they are key determinants in the routing of impulses 
through the nervous system. The human brain is thought to contain 
at least one hundred trillion synapses. These synapses act like gates 
stationed along the various pathways, allowing some pieces of infor- 
mation to pass from one neuron to another, while holding back 
other pieces or rerouting them in another direction. While synapses 
are often perceived as fixed, unchanging structures, they can display 
aremarkable dynamic quality known as “synaptic plasticity.” Synaptic 
plasticity is particularly important during infancy and childhood, 
when the neuronal circuitry of the brain achieves its mature 
configuration. 

Synaptic plasticity is most readily observed in studies on neu- 
rons from the hippocampus, a portion of the brain that is vitally 
important in learning and short-term memory. When hippocampal 
neurons are repeatedly stimulated over a short period of time, the 
synapses that connect these neurons to their neighbors become 
“strengthened” by a process known as long-term potentiation (LTP), 
which may last for days, weeks, or even longer. Research into LTP 
has focused on the NMDA receptor, which is one of several receptor 
types that bind the excitatory neurotransmitter glutamate. When 
glutamate binds to a postsynaptic NMDA receptor, it opens an inter- 
nal cation channel within the receptor that allows the influx of Ca’* 
ions into the postsynaptic neuron, triggering a cascade of biochemi- 
cal changes that lead to synaptic strengthening. Synapses that have 
undergone LTP are able to transmit weaker stimuli and evoke 
stronger responses in postsynaptic cells. These changes are thought 
to play a major role as newly learned information or memories are 
encoded in the neural circuits of the brain. When laboratory animals 
are treated with drugs that inhibit LTP, such as those that interfere 
with the activity of the NMDA receptor, their ability to learn new 
information is greatly reduced. 

There are numerous other reasons the study of synapses is so 
important. For example, a number of diseases of the nervous system, 
including myasthenia gravis, Parkinson’s disease, schizophrenia, 
and even depression, are thought to have their roots in synaptic 
dysfunction. 


REVIEW | 


1. Describe the steps between the time an impulse 
reaches the terminal knob of a presynaptic neuron and 
an action potential is initiated in a postsynaptic cell. 

2. Contrast the roles of ion pumps and channels in 
establishing and using ion gradients, particularly as it 
applies to nerve cells. 
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10. 


WL. 


What types of integral proteins would you expect to reside in 
the plasma membrane of an epithelial cell that might be absent 
from that of an erythrocyte? How do such differences relate to 
the activities of these cells? 


. Many different types of cells possess receptors that bind steroid 


hormones, which are lipid-soluble molecules. Where in the cell 
do you think such receptors might reside? Where in the cell 
would you expect the insulin receptor to reside? Why? 


. We calculated on page 140 that when a solute is 10 times more 


concentrated outside a cell than inside, the free energy for mov- 
ing the solute into the cell is -1.4 kcal/mol. What would the free 
energy change be for moving the solute into the cell if the exter- 
nal concentration (outside the cell) was a thousand times more 
concentrated than the internal concentration (inside the cell)? 
How many times higher would the external concentration have 
to be compared to the internal concentration for the free energy 
of moving the solute into the cell to be -2.8 kcal/mol? If instead 
the free energy for moving solute into the cell was +2.8 kcal/mol, 
what would that tell you about the external concentration relative 
to the internal concentration? (See Quantitative Tutorial Video). 


. Suppose you were planning to use liposomes in an attempt to 


deliver drugs to a particular type of cell in the body, for exam- 
ple, a fat or muscle cell. Is there any way you might be able to 
construct the liposome to increase its target specificity? 


. How is it that, unlike polysaccharides such as starch and glyco- 


gen, the oligosaccharides on the surface of the plasma mem- 
brane can be involved in specific interactions? How is this 
feature illustrated by determining a person's blood type prior to 
receiving a transfusion? 


. Trypsin is an enzyme that can digest the hydrophilic portions of 


membrane proteins, but it is unable to penetrate the lipid 
bilayer and enter a cell. Because of these properties, trypsin has 
been used in conjunction with SDS-PAGE to determine which 
proteins have an extracellular domain. Describe an experiment 
using trypsin to determine the sidedness of proteins of the 
erythrocyte membrane. 


. Look at the scanning electron micrograph of erythrocytes in 


Figure 4.32a. These cells, which are flattened and have circular 
depressions on each side, are said to have a biconcave shape. 
What is the physiologic advantage of a biconcave erythocyte 
over a spherical cell with respect to O2 uptake? 


. Suppose you were culturing a population of bacteria at 15°C 


and then raised the temperature of the culture to 37°C. What 
effect do you think this might have on the fatty acid composi- 
tion of the membrane? on the transition temperature of the 
lipid bilayer? on the activity of membrane desaturases? 


. Looking at Figure 4.6, which lipids would you expect to have the 


greatest rate of flip-flop? which the least? Why? If you deter- 
mined experimentally that phosphatidylcholine actually exhib- 
ited the greatest rate of flip-flop, how might you explain this 
finding? How would you expect the rate of phospholipid flip- 
flop to compare with that of an integral protein? Why? 

What is the difference between a two-dimensional and a three- 
dimensional representation of a membrane protein? How are 
the different types of profiles obtained, and which is more use- 
ful? Why do you think there are so many more proteins whose 
two-dimensional structure is known? 

If you were to inject a squid giant axon with a tiny volume of 
solution containing 0.1 M NaCl and 0.1 M KCI in which both 
the Na* and K* ions were radioactively labeled, which of the 


12. 


13. 


14. 


15. 


16. 


IW. 


18. 


a9: 


20. 


Pa 


radioactively labeled ions would you expect to appear most 
rapidly within the seawater medium while the neuron remained 
at rest? After the neuron had been stimulated to conduct a 
number of action potentials? 

It has been difficult to isolate proteins containing water chan- 
nels (i.e., aquaporins) due to the high rate of diffusion of water 
through the lipid bilayer. Why would this make aquaporin isola- 
tion difficult? Is there any way you might be able to distinguish 
diffusion of water through the lipid bilayer versus that through 
aquaporins? The best approach to studying aquaporin behavior 
has been to express the aquaporin genes in frog oocytes. Is 
there any reason why the oocytes of a pond-dwelling amphibian 
might be particularly well suited for such studies? 

How is it that diffusion coefficients measured for lipids within 
membranes tend to be closer to that expected for free diffusion 
than those measured for integral proteins in the same 
membranes? 

Assume that the plasma membrane of a cell was suddenly per- 
meable to the same degree to both Na* and K* and that both 
ions were present at a concentration gradient of the same mag- 
nitude. Would you expect these two ions to move across the 
membrane at the same rates? Why or why not? 

Most marine invertebrates show no loss or gain of water by 
osmosis, whereas most marine vertebrates experience contin- 
ual water loss in their high-salt environment. Speculate on the 
basis for this difference and how it might reflect different path- 
ways of evolution of the two groups. 

How would you expect the concentrations of solute inside a 
plant cell to compare to that of its extracellular fluids? Would 
you expect the same to be true of the cells of an animal? 

What would be the consequence for impulse conduction if the 
Na* channels were able to reopen immediately after they had 
closed during an action potential? 


What would be the value of the potassium equilibrium potential 
if the external K* concentration was 200 mM and the internal 
concentration was 10 mM at 25°C? at 37°C? 

As discussed on page 155, the Na*/glucose cotransporter trans- 
ports two Na* ions for each glucose molecule. What if this ratio 
were 1:1 rather than 2:1; how would this affect the glucose con- 
centration that the transporter could work against? 

A transmembrane protein usually has the following features: (1) 
the portion that transits the membrane bilayer is at least 20 
amino acids in length, which are largely or exclusively nonpolar 
residues; (2) the portion that anchors the protein on the external 
face has two or more consecutive acidic residues; and (3) the 
portion that anchors the protein on the cytoplasmic face has 
two or more consecutive basic residues. Consider the trans- 
membrane protein with the following sequence: 


NH -MLSTGVKRKGAVLLILLFPWMVAGGPLFWLAADESTYKGS- 
COOH 


Draw this protein as it would reside in the plasma membrane. 
Make sure you label the N- and C-termini and the external and 
cytoplasmic faces of the membrane. (Single-letter code for 
amino acids is given in Figure 2.26.) 

Many marine invertebrates, such as squid, have extracellular flu- 
ids that resemble seawater and therefore have much higher 
intracellular ion concentrations than mammals. For a squid neu- 
ron, the ionic concentrations are roughly 


extracellular concentration 


10 mM 
440 mM 
100 mM 560 mM 
2x 10-* mM 10 mM 
76 8.0 


intracellular concentration 


350 mM 
40 mM 


f the resting potential of the plasma membrane, V,,, is —70 mV, 
are any of the ions at equilibrium? How far out of equilibrium, in 
mV, is each ion? What is the direction of net movement of each 
ion through an open channel permeable to that ion? 


22. The membrane potential V,, of a cell is determined by the relative 


permeability of the membrane to various ions. When acetylcho- 
ine binds to its receptors on the postsynaptic muscle membrane, 
it causes a massive opening of channels that are equally perme- 
able to sodium and potassium ions. Under these conditions, 


Vin = (Viet Vat) / 2 


23. 


24. 


25. 


If [K*,,] = 140 mM and [Na‘,,] = 10 mM for the muscle cell, and 167 


[Na* out) = 150 mM and [K* oy] = 5 mM, what is the membrane 
potential of the neuromuscular junction of an acetylcholine- 
stimulated muscle? 


Transmembrane domains consist of individual « helices or a B 
sheet formed into a barrel. Looking at Figures 2.30 and 2.31, why 
is a single « helix more suited to spanning the bilayer than a 
single B strand? 

Knowing how the K* channel selects for K* ions, suggest a 
mechanism by which the Na* channel is able to select for its ion. 


How would you compare the rate of movement of ions passing 
through a channel versus those transported actively by a P-type 
pump? Why? 
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Aerobic Respiration 
and the Mitochonarion 


Blood vessels of the heart and lungs. Oxygen poor blood (shown by the 
blue vessels) is exposed to oxygen in the lungs, where it becomes oxygen 
rich as indicated by the red vessels. 


Source: Ralph Hutchings / Getty Images, Inc. 
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WHY WE NEED TO BREATHE 


aoe breath you take, and every beat of your heart, 
happens because your cells need oxygen. To satisfy cellular 
oxygen demand, our blood runs red with hemoglobin. Much 
of human anatomy and physiology is devoted to ensuring an 
adequate oxygen supply, and providing enough oxygen is a 
major preoccupation of anesthesiologists, astronauts, and 
scuba divers. Exercise increases our need for oxygen, as 
everyone knows from experience. This oxygen is used to 
power cellular metabolism by providing energy through the 
biochemical pathway of respiration, much of which takes 
place within mitochondria which, as we recall from 

Chapter 1, arose from bacteria that took up residence in the 
early ancestors of eukaryotic cells. So the next time you find 
yourself gasping for breath after running up a flight of stairs, 
take a moment and contemplate the fact that the oxygen 
you are so desperately trying to suck in is being consumed 
by bacterially derived endosymbionts living inside your cells. 
In this chapter we will learn how oxygen is coupled to energy 
production, and what role the mitochondria play in the 
process. 


Mitochondrial Structure and 5.4 Oxidative Phosphorylation in the 5.9 Using the Proton Gradient 
Function Formation of ATP 5.10  Peroxisomes 
5.2 Aerobic Metabolism in the 5.5 Electron-Transport Complexes 5.11 THE HUMAN PERSPECTIVE: 
Mitochondrion 5.6 — Establishment of a Proton-Motive Diseases that Result from Abnormal 
5.3 THE HUMAN PERSPECTIVE: Force Mitochondrial or Peroxisomal 
The Role of Anaerobic and 5.7 The Structure of ATP Synthase Function 
Aerobic Metabolism 5.8 The Binding Change Mechanism 
in Exercise of ATP Formation 


5.1 Mitochondrial Structure 
and Function 


During the first two billion years or so that life existed on Earth, the 
atmosphere consisted largely of reduced molecules, such as molecu- 
lar hydrogen (H,), ammonia (NH;), and H,O. The Earth of this 
period was populated by anaerobes—organisms that captured and 
utilized energy by means of oxygen-independent (anaerobic) metab- 
olism, such as glycolysis and fermentation (Figures 3.24 and 3.29). 
Then, between 2.4 and 2.7 billion years ago, the cyanobacteria 
appeared—a new kind of organism that carried out a new type of 
photosynthetic process, one in which water molecules were split 
apart and molecular oxygen (O,) was released. At some point fol- 
lowing the appearance of the cyanobacteria, the Earth’s atmosphere 
began to accumulate significant levels of oxygen, which set the stage 
for a dramatic change in the types of organisms that would come to 
inhabit the planet. 

As discussed on page 34, molecular oxygen can be a very toxic 
substance, taking on extra electrons and reacting with a variety of 
biological molecules. The presence of oxygen in the atmosphere 
must have been a powerful agent for natural selection. Over time, 
species evolved that were not only protected from the damaging 
effects of molecular oxygen, but possessed metabolic pathways that 
utilized the molecule to great advantage. Without the ability to use 
oxygen, organisms could only extract a limited amount of energy 
from their foodstuffs, excreting energy-rich products such as lactic 
acid and ethanol, which they were unable to metabolize further. In 
contrast, organisms that incorporated O, into their metabolism 
could completely oxidize such compounds to CO, and H,O and, in 
the process, extract a much larger percentage of their energy con- 
tent. These organisms that became dependent on oxygen were the 
Earth’s first aerobes, and they eventually gave rise to all of the 
oxygen-dependent prokaryotes and eukaryotes living today. In 
eukaryotes, the utilization of oxygen as a means of energy extraction 
takes place in a specialized organelle, the mitochondrion. As discussed 
in Chapter 1, a massive body of data indicates that mitochondria 


Mitochondria — 


(a) 


have evolved from an ancient aerobic bacterium that took up 169 


residence inside the cytoplasm of an anaerobic host cell. 

Mitochondria are large enough to be seen in the light micro- 
scope (FIGURE 5.1a), and their presence within cells has been known 
for over a hundred years. Depending on the cell type, mitochondria 
can have a very different overall structure. At one end of the spec- 
trum, mitochondria can appear as individual, bean-shaped organelles 
(Figure 5.1b), ranging from 1 to 441m in length. At the other end of 
the spectrum, mitochondria can appear as a highly branched, inter- 
connected tubular network. Observations of fluorescently labeled 
mitochondria within living cells have shown them to be dynamic 
organelles capable of dramatic changes in shape. Most importantly, 
mitochondria can fuse with one another, or split in two (FIGURE 5.2a). 

It has been known for many years that mitochondria and the 
endoplasmic reticulum (ER) engage in extensive interactions, but it 
was a surprise to discover that mitochondrial fission is apparently 
induced by contact with thin tubules from the ER. As shown in 
Figure 5.2b, ER tubules can encircle the mitochondrion like a noose. 
These ER tubules appear to initiate constriction, which is then com- 
pleted through the action of soluble proteins that are recruited to the 
outer surface of the mitochondrion from the cytosol (Figure 5.2c). 
The balance between fusion and fission is likely a major determinant 
of mitochondrial number, length, and degree of interconnection. 
When fusion becomes more frequent than fission, the mitochondria 
tend to become more elongated and interconnected, whereas a pre- 
dominance of fission leads to the formation of more numerous and 
distinct mitochondria. A number of inherited neurologic diseases 
are caused by mutations in genes that encode components of the 
mitochondrial fusion machinery. 

Mitochondria occupy 15 to 20 percent of the volume of an aver- 
age mammalian liver cell and contain more than a thousand differ- 
ent proteins. These organelles are best known for their role in 
generating the ATP that is used to run most of the cell’s energy- 
requiring activities. To accomplish this function, mitochondria are 
often associated with fatty acid-containing oil droplets from which 
they derive raw materials to be oxidized. A particularly striking 


Mitochondria 


(c) 


FIGURE 5.1 Mitochondria. (a) A living fibroblast viewed with a phase-contrast microscope. Mitochondria are seen as elongated, dark bodies. 
(b) Transmission electron micrograph of a thin section through a mitochondrion revealing the internal structure of the organelle, particularly the membranous 
cristae of the inner membrane. (c) Localization of mitochondria in the midpiece surrounding the proximal portion of the flagellum of a bat sperm. 


Source: (a) Courtesy of Norman K. Wessels, Stanford University; (b) K. R. Porter/Photo Researchers, Inc.; (c) Don W. Fawcett/Photo Researchers, Inc. 
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(a) 


mouse fibroblast whose mitochondria have been labeled with a fluorescent 


protein. In the first three frames, two pairs of mitochondria (which have been 
artificially colored) contact end-to-end and immediately fuse. In the last three 
frames, the lower fusion product undergoes fission, and the daughter mitochon- 


dria move apart. (b) Three-dimensional model of contacts between the ER 
(green) and mitochondria (purple) in a yeast cell based on EM tomography 


(b) 


FIGURE 5.2 Mitochondrial fusion and fission. (a) The dynamic nature of these 
organelles is captured in the frames of this movie, which shows a portion of a 
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(Section 18.5). ER tubules are seen to encircle portions of the yeast mitochondrial 


network, marking sites of future fission. Similar events occur in mammalian cells. 


Bar equals 200 nm. (c) In the model depicted here, contact with ER tubules initiates the process of mitochondrial constriction (step 1). Fission is subsequently 
accomplished by proteins (e.g., Drp1 in mammals) that assemble into helices around the outer surface of the mitochondrion (step 2). Drp1 is a member of 
the dynamin family of GTP-binding proteins that are involved in severing membranous structures (see Figure 8.41). GTP binding and hydrolysis causes a 
conformational change in the Drp1 helices that fully constricts the mitochondrion, splitting it into two smaller organelles (step 3). 


Source: (a) From David C. Chan, Cell 125:1242, 2006; © 2006, with permission of Elsevier; (b) Image by M. West, from Jonathan R. Friedman, et al., Science 


334:359, 2011, reprinted with permission from AAAS. 


arrangement of mitochondria occurs in sperm cells, where they are 
often located in the midpiece, just behind the nucleus (Figure 5.1c). 
The movements of a sperm are powered by ATP produced in these 
mitochondria. Mitochondria are also prominent in many plant cells 
where they are the primary suppliers of ATP in nonphotosynthetic 
tissues, as well as being a source of ATP in photosynthetic leaf cells 
during periods of dark. 

While energy metabolism has been the focus of interest in the 
study of mitochondria, these organelles are also involved in other 
activities. For example, mitochondria are the sites of synthesis of 
numerous substances, including certain amino acids and the heme 
groups shown in Figures 2.35 and 5.12. Mitochondria also play a 
vital role in the uptake and release of calcium ions. Calcium ions are 
essential triggers for cellular activities (Section 15.5), and mitochon- 
dria (along with the endoplasmic reticulum) play an important role 
in regulating the Ca** concentration of the cytosol. For example, 
when Ca”* concentrations rise to abnormally high levels in the cyto- 
sol, a Ca”* transporter in the inner mitochondrial membrane takes 
up some of the excess Ca* ions. The process of cell death, which 
plays an enormous role in the life of all multicellular animals, is also 
regulated to a large extent by events that occur within mitochondria 
(Section 15.8). 


Mitochondrial Membranes 


If you take a close look at the electron micrograph of Figure 5.1), 
it can be seen that the outer boundary of a mitochondrion contains 
two membranes: the outer mitochondrial membrane and the inner 
mitochondrial membrane. The outer mitochondrial membrane 


completely encloses the mitochondrion, serving as its outer boundary. 
The inner mitochondrial membrane is subdivided into two major 
domains that have different protein residents and carry out distinct 
functions. One of these domains, called the inner boundary mem- 
brane, lies just inside the outer mitochondrial membrane, forming a 
double-membrane outer envelope. The inner boundary membrane is 
particularly rich in the proteins responsible for the import of mito- 
chondrial proteins (see Figure 8.47). The other domain of the inner 
mitochondrial membrane is present within the interior of the orga- 
nelle as a series of invaginated membranous sheets, called cristae. 
The role of mitochondria as energy transducers is intimately tied to 
the membranes of the cristae that are so prominent in electron micro- 
graphs of these organelles. The cristae contain a large amount of mem- 
brane surface, which houses the machinery needed for aerobic 
respiration and ATP formation (see Figure 5.22). The organization of 
the cristae is shown in a clearer profile in the scanning electron micro- 
graph of FIGURE 5.3a and the three-dimensional reconstruction of 
Figure 5.3b. The inner boundary membrane and internal cristal mem- 
branes are joined to one another by narrow tubular connections, or 
cristae junctions, as shown in the schematic illustrations of Figure 5.3c. 
Exactly how such complex organization is created is not yet fully 
known, but an inner membrane associated protein complex called 
MitOS, (also known as MICOS or MINOS) is located at the cristae 
junctions and is required for normal organization of cristae. 

The membranes of the mitochondrion divide the organelle into 
two aqueous compartments, one within the interior of the mitochon- 
drion, called the matrix, and a second between the outer and inner 
membrane, called the intermembrane space. The matrix has a 
gel-like consistency owing to the presence of a high concentration 
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Cristae junction 


FIGURE 5.3 The structure of a mitochondrion. (a) Scanning electron micrograph of a macerated mitochondrion, showing the internal matrix enclosed 
by folds of the inner membrane. (b) Three-dimensional reconstruction of a mitochondrion based on a series of micrographs taken with a high-voltage 
electron microscope of a single thick section of brown fat tissue that had been tilted at various angles. High-voltage instruments accelerate electrons at 
velocities that allow them to penetrate thicker tissue sections (up to 1.5 jum). This technique suggests that the cristae are present as flattened sheets 
(lamellae) that communicate with the intermembrane space by way of narrow tubular openings, rather than “wide-open” channels as is typically depicted. 
In this reconstruction, the inner mitochondrial membrane is shown in blue at the peripheral regions and in yellow when it penetrates into the matrix to 
form the cristae. (c) Schematic diagrams showing the three-dimensional internal structure (top) and a thin section (bottom) of a mitochondrion from bovine 


heart tissue. 


Source: (a) From K. Tanaka and T. Naguro, Int. Rev. Cytol. 68:111, 1980; (6) Courtesy of Guy A. Perkins and Terrence G. Frey. 


(up to 500mg/ml) of water-soluble proteins. The proteins of the 
intermembrane space are best known for their role in initiating cell 
suicide, a subject discussed in Section 15.17. 

The outer and inner membranes have very different properties. 
The outer membrane is composed of approximately 50 percent lipid 


by weight and contains a curious mixture of enzymes involved in 
such diverse activities as the oxidation of epinephrine, the degradation 
of tryptophan, and the elongation of fatty acids. In contrast, the 
inner membrane contains more than 100 different polypeptides and 
has a very high protein/lipid ratio (more than 3:1 by weight, which 
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The inner membrane is virtually devoid of cholesterol and rich in 
an unusual phospholipid, cardiolipin (diphosphatidylglycerol; see 
Figure 4.6 for the structure), which are characteristics of bacterial 
plasma membranes, from which the inner mitochondrial membrane 
has presumably evolved. Cardiolipin plays an important role in facil- 
itating the activity of several of the large protein complexes involved 
in electron transport and ATP synthesis. The outer mitochondrial 
membrane is thought to be homologous to an outer membrane 
present as part of the cell wall of certain bacterial cells. The outer 
mitochondrial membrane and outer bacterial membrane both contain 
porins, integral proteins that have a relatively large internal channel 
(e.g., 2-3 nm) surrounded by a barrel of B strands (FIGURE 5.4). The 
porins of the outer mitochondrial membrane are not static structures 
as was once thought but can undergo reversible closure in response 
to conditions within the cell. When the porin channels are wide 
open, the outer membrane is freely permeable to molecules such as 
ATP, NAD, and coenzyme A, which have key roles to play in energy 
metabolism within the mitochondrion. In contrast, the inner mito- 
chondrial membrane is highly impermeable; virtually all molecules 
and ions require special membrane transporters to gain entrance to 
the matrix. 

As will be discussed in following sections, the composition and 
organization of the inner mitochondrial membrane are the keys to 
the bioenergetic activities of the organelle. The architecture of the 
inner membrane and the apparent fluidity of its bilayer facilitate the 
interactions of components that are required for ATP formation. 


FIGURE 5.4 Porins. Gram-negative bacteria have a lipid-containing 
outer membrane outside of their plasma membrane as part of their 

cell wall. This outer membrane contains proteins, called porins, that 
consist of a barrel of B sheet and form an opening through which 
moderate-sized molecules can penetrate. This image shows the 

protein OmpW embedded in the outer membrane of E. coli. The 

porin contains a small hydrophobic compound within its central channel. 
A variety of porins having different sized channels and selectivities 

are also found in the outer mitochondrial membrane in eukaryotic 

cells. 

Source: From Heedeok Hong, et al., J. Biol. Chem. 281, cover of #11, 2006; 
© 2006, The American Society for Biochemistry and Molecular Biology; 
Image courtesy of Bert van den Berg. 


The Mitochondrial Matrix 


In addition to an array of enzymes, the mitochondrial matrix also 
contains ribosomes (of considerably smaller size than those found in 
the cytosol) and several molecules of DNA, which is circular in 
higher plants and animals (Figure 5.3c). Thus, mitochondria possess 
their own genetic material and the machinery to manufacture their 
own RNAs and proteins. This nonchromosomal DNA is important 
because it encodes a small number of mitochondrial polypeptides 
(13 in humans) that are tightly integrated into the inner mitochon- 
drial membrane along with polypeptides encoded by genes residing 
within the nucleus. Human mitochondrial DNA also encodes 2 ribo- 
somal RNAs and 22 tRNAs that are used in protein synthesis within 
the organelle. Mitochondrial DNA (mtDNA) is a relic of ancient his- 
tory. It is thought to be the legacy from a single aerobic bacterium 
that took up residence in the cytoplasm of a primitive cell that ulti- 
mately became an ancestor of all eukaryotic cells (page 26). Most of 
the genes of this ancient symbiont were either lost or transferred 
over the course of evolution to the nucleus of the host cell, leaving 
only a handful of genes to encode some of the most hydrophobic 
proteins of the inner mitochondrial membrane. It is interesting to 
note that the RNA polymerase that synthesizes the various mito- 
chondrial RNAs is not related to the multisubunit enzyme found in 
prokaryotic and eukaryotic cells. Instead, the mitochondrial RNA 
polymerase is a single subunit enzyme similar in many respects to 
certain bacterial viruses (bacteriophages) from which it appears to 
have evolved. For a number of reasons, mtDNA is well suited for use 
in the study of human migration and evolution. A number of com- 
panies, for example, employ mtDNA sequences to trace the ancestry 
of clients who are searching for their ethnic or geographic roots. 

We will return later in the chapter to discuss the molecular archi- 
tecture of mitochondrial membranes, but first let us consider the role 
of these organelles in the basic oxidative pathways of eukaryotic cells, 
which is summarized in FIGURE 5.5. It might help to examine this 
overview figure and read the accompanying legend before moving on 
to the following detailed descriptions of these pathways. 


REVIEW | 


1. Describe the metabolic changes in oxidative 
metabolism that must have accompanied the 
evolution and success of the cyanobacteria. 


2. Compare the properties and evolutionary history of 
the inner and outer mitochondrial membranes; the 
intermembrane space and the matrix. 


eo 


5.2 Aerobic Metabolism 
in the Mitochondrion 


In Chapter 3 we described the initial stages in the oxidation of carbo- 
hydrates. Beginning with glucose, the first steps of the oxidation pro- 
cess are carried out by the enzymes of glycolysis, which are located in 
the cytosol (Figure 5.5). The 10 reactions that constitute the glycolytic 
pathway were illustrated in Figure 3.24; the major steps of the 
pathway are summarized in FIGURE 5.6. Only a small fraction of the 
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membrane 


FIGURE 5.5 An overview of carbohydrate metabolism in eukaryotic cells. The reactions of glycolysis generate pyruvate and NADH in the cytosol. In the 
absence of Oo, the pyruvate is reduced by NADH to lactate (or another product of fermentation, such as ethanol in yeast; see Figure 3.29 for details). The 
NAD* formed in the reaction is reutilized in the continuation of glycolysis. In the presence of Oz, the pyruvate moves into the matrix (facilitated by a 
membrane transporter), where it is decarboxylated and linked to coenzyme A (CoA), a reaction that generates NADH. The NADH produced during 
glycolysis donates its high-energy electrons to a compound that crosses the inner mitochondrial membrane (as shown in Figure 5.9). The acetyl CoA passes 
through the TCA cycle (as shown in Figure 5.7), which generates NADH and FADH:z. The electrons in these various NADH and FADH2 molecules are passed 
along the electron-transport chain, which is made up of carriers that are embedded in the inner mitochondrial membrane, to molecular oxygen (O,).The 
energy released during electron transport is used in the formation of ATP by a process discussed at length later in the chapter. If all of the energy from 
electron transport were to be utilized in ATP formation, approximately 36 ATPs could be generated from a single molecule of glucose. 


free energy available in glucose is made available to the cell during 
glycolysis—enough for the net synthesis of only two molecules of 
ATP per molecule of glucose oxidized (Figure 5.6). Most of the energy 
remains stored in pyruvate. Each molecule of NADH produced 
during the oxidation of glyceraldehyde 3-phosphate (reaction 6, 
Figure 5.6) also carries a pair of high-energy electrons.' The two 
products of glycolysis—pyruvate and NADH—can be metabolized in 
two very different ways, depending on the type of cell in which they 
are formed and the presence or absence of oxygen. 

In the presence of O, aerobic organisms are able to extract large 
amounts of additional energy from the pyruvate and NADH 
produced during glycolysis—enough to synthesize more than 30 
additional ATP molecules. This energy is extracted in mitochondria 
(Figure 5.5). We will begin with pyruvate and return to consider the 
fate of the NADH later in the discussion. Each pyruvate molecule 
produced by glycolysis is transported across the inner mitochondrial 
membrane and into the matrix, where it is decarboxylated to form a 
two-carbon acetyl group (—CH;COO ). The acetyl group is trans- 
ferred to coenzyme A (a complex organic compound derived from 
the vitamin pantothenic acid) to produce acetyl CoA. 


Pyruvate +HS -CoA+NAD* - acetyl CoA +CO, +NADH+H* 


'The terms high-energy electrons and low-energy electrons are not always well received 
by biochemists, but they convey a useful image. As discussed on page 180, high- 
energy electrons are ones that are held with lower affinity and are more readily trans- 
ferred from a donor to an acceptor than are low-energy electrons. 


The decarboxylation of pyruvate and transfer of the acetyl group to 
CoA (Figures 5.5 and 5.7) along with the reduction of NAD* is 
catalyzed by the giant multienzyme complex pyruvate dehydrogenase, 
whose structure was shown in Figure 2.41. 


The Tricarboxylic Acid (TCA) Cycle 


Once formed, acetyl CoA is fed into a cyclic pathway, called the tri- 
carboxylic acid (TCA) cycle, where the substrate is oxidized and 
its energy conserved. Other than succinate dehydrogenase, which is 
bound to the inner membrane, all the enzymes of the TCA cycle 
reside in the soluble phase of the matrix (Figure 5.5). The TCA cycle is 
also referred to as the Krebs cycle after the British biochemist Hans 
Krebs, who worked out the pathway in the 1930s, or the citric acid 
cycle, after a key molecule formed in the pathway. 

The first step in the TCA cycle is the condensation of the two- 
carbon acetyl group with a four-carbon oxaloacetate to form a six- 
carbon citrate molecule (FIGURE 5.7, step 12). During the cycle, the 
citrate molecule is decreased in chain length, one carbon at a time, 
regenerating the four-carbon oxaloacetate molecule, which can con- 
dense with another acetyl CoA. It is the two carbons that are removed 
during the TCA cycle (which are not the same ones that were brought 
in with the acetyl group) that are completely oxidized to carbon 
dioxide. During the TCA cycle, four reactions occur in which a pair 
of electrons are transferred from a substrate to an electron-accepting 
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Glucose 4 


HO 

e 
Glucose is phosphorylated at the expense of one ATP, rearranged 
structurally to form fructose phosphate, and then phosphorylated again at 

2 ] the expense of a second ATP. The two phosphate groups are situated at 
the two ends (C1, C6) of the fructose chain. 

eo f 

CH,OPO3, 


) 
Fructose 1,6-bis- << 
phosphate H OH 
OH H 
© y The six-carbon bisphosphate is split into two three-carbon monophosphates. 
° 7 
H 
Glyceraldehyde 
3-phosphate HCOH 
(molecules) CH,0P03 
The three-carbon aldehyde is oxidized to an acid as the electrons removed 
e from the substrate are used to reduce the coenzyme NAD* to NADH. In 


1,3-Bisphospho- 


addition, the C1 acid is phosphorylated to form an acyl phosphate, which 
has a high phosphate group-transfer potential (denoted by the yellow shading). 


glycerate HCOH 
(molecules) b,0P02 
2 ADP . : 
7) The phosphate group from Cl is transferred to ADP forming ATP by _ 
Bite substrate-level phosphorylation. Two ATPs are formed per glucose oxidized. 
0 
too 
3-Phospho- co 
glycerate HCOH 
(molecules) CH,0POF 
8] y These reactions result in the rearrangement and dehydration of the 
substrate to form an enol phosphate at the C2 position that has a high 
9) y phosphate group-transfer potential. 
1 
Phosphoenol- o 
pyruvate C-0-PO 
ll 
(molecules) CH 
2 ADP The phosphate group is transferred to ADP forming ATP by substrate-evel 
® ( phosphorylation, generating a ketone at the C2 position. Two ATPs are 
0 hie formed per glucose oxidized. 
ll - 
—0O- Net reaction: 
Pyruvate ( 0 
(molecules) C=0 Glucose + 2 NAD* + 2 ADP + 2 P, —» 2 Pyruvate + 2 ATP + 2 NADH + 2 H* + 2,0 
| 
CH; 


FIGURE 5.6 An overview of glycolysis showing some of the key steps. These include the two reactions in which phosphate groups are transferred from 
ATP to the six-carbon sugar to produce fructose 1,6-bisphosphate (steps 1, 3); the oxidation and phosphorylation of glyceraldehyde 3-phosphate to 
produce 1,3-bisphosphoglycerate and NADH (step 6); and the transfer of phosphate groups from three-carbon phosphorylated substrates to ADP to 
produce ATP by substrate phosphorylation (steps 7 and 10). Keep in mind that two molecules of glyceraldehyde 3-phosphate are formed from each 
molecule of glucose, so reactions 6 through 10 shown here occur twice per glucose molecule oxidized. 


FIGURE 5.7 The tricarboxylic acid (TCA) cycle. The 
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cycle begins with the condensation of oxaloacetate bo 
(OAA) and acetyl CoA (reaction 12). The carbons of ol 
; C=0 
these two compounds are marked with numbers or 4 on 
letters. The two carbons lost during passage through CH3 . 
the cycle are derived from oxaloacetate. The standard Pyruvate > 
free energies (in kcal/mol) and names of enzymes are 8 
also provided. Five pairs of electrons are removed from HS-CoA 2 
substrate molecules by pyruvate dehydrogenase and NAD* en z 
the enzymes of the TCA cycle. These high-energy Ht dehydrogenase “ 
electrons are transferred to NAD* or FAD and then ‘ 5 
passed down the electron-transport chain for use in “NADH”: ay g 
ATP production. The reactions shown here begin with = 
number 11 because the pathway continues where the ' S—CoA + 
last reaction of glycolysis (number 10 of Figure 5.6) al = . 
leaves off. Aes a 
' CH, Q 
® | Acetyl CoA *co0- Al 
Malate tee 3 a 
via dehydrogenase COO H,0 HS-CoA CH, ° 
coenzyme. Three of the reactions reduce NAD* to HOCH imi ome H6CC00- 
NADH, and one reaction reduces FAD to FADH, | ; y| ® y! 
(Figure 5.7). The net equation for the reactions of the re NAD*  H* we Citrate ve 
TCA cycle can be written as COO- ee COO- synthase COO- 
Malate NADH; | Oxaloacetate © AG” =~-7.5 Citrate 
Acetyl CoA +2H,O0 +FAD+3NAD* +GDP +P, \ 13) 
\ : Aconitase 
— 2CO,+FADH,+3NADH+3H*+ ee H,0 iN AG = 415 
GTP +HS—CoA sie ou : 
cs ee 
The TCA cycle is a critically important meta- oo 5 pairs of “He 
bolic pathway. If the position of the TCA cycle in the CH electrons (from Hc—coo- 
overall metabolism of the cell is considered I hydrogen atoms yf 
: ; ; HC of substrate) HOCH 
(FIGURE 5.8; see also Figure 3.22), the metabolites of io to be used in gf 
this cycle are found to be the same compounds COO ATP production COO 
generated by most of the cell's catabolic pathways. pai sae , fsocitiate 
Acetyl CoA, for example, is an important end / .  NADt ® 
product of a number of catabolic paths, including © ore ‘, ‘, lsocitrate 
the disassembly of fatty acids, which are degraded go, ee ae 5, re a a 
within the matrix of the mitochondrion into two- dehydrogenase ‘ se w 
carbon units (Figure 5.8a). These two-carbon aie FAD x H+ CO, 
compounds enter the TCA cycle as acetyl CoA. The tg oe 
catabolism of amino acids also generates meta- *OO= % MARE *c00- 
bolites of the TCA cycle (Figure 5.8b), which enter coe = “He-feh GDP +P, 20H, H+ NAD* HS-CoA 20H, 
the matrix by means of special transport systems in CH, xl x! 
the inner mitochondrial membrane. It is apparent CH © he = rik 
that all of a cell’s energy-providing macromolecules ile BuccinybOek ae HB ccketogitarate 4 =0 
(polysaccharides, fats, and proteins) are broken synthase S—CoA dehydrogenase — *CQQ- 
down into metabolites of the TCA cycle. Thus, the Succinate ae Succinyl-CoA —— o-Ketoglutarate 
mitochondrion becomes the focus for the final 


energy-conserving steps in metabolism regardless of the nature of 
the starting material. 


The Importance of Reduced Coenzymes 
in the Formation of ATP 


It is evident from the net equation of the TCA cycle that the primary 
products of the pathway are the reduced coenzymes FADH), and 
NADH, which contain the high-energy electrons removed from various 
substrates as they are oxidized. NADH was also one of the products 
of glycolysis (along with pyruvate). Mitochondria are not able to 


import the NADH formed in the cytosol during glycolysis. Instead, 
the electrons of cytosolic NADH are used to reduce a low-molecular- 
weight metabolite that can either (1) enter the mitochondrion (by a 
pathway called the malate-aspartate shuttle) and reduce NAD* to 
NADH, or (2) transfer its electrons to FAD (by a pathway called the 
glycerol phosphate shuttle, which is shown in FIGURE 5.9) to produce 
FADH,. Both mechanisms allow the electrons from cytosolic NADH 
to be fed into the mitochondrial electron-transport chain and used 
for ATP formation. 

Now that we have accounted for the formation of NADH and 
FADH, by both glycolysis and the TCA cycle, we can turn to the 
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FIGURE 5.8 Catabolic pathways generate compounds that are fed 
into the TCA cycle. (a) Oxidation of fatty acids. The first step in the 
oxidation of a fatty acid is its activation by linkage to the thiol (— SH) 
group of coenzyme A, which occurs after the fatty acyl group is 
transported across the inner mitochondrial membrane in association 
with a carrier protein (not shown). In the mitochondrion, the fatty acyl 
CoA molecule undergoes stepwise disassembly in which an acetyl CoA 
(shown in blue) is removed from the fatty acid chain with each round of 
the cycle. In addition to the molecule of acetyl CoA that is fed into the 
TCA cycle, each round of the fatty acid cycle produces one NADH and 
one FADH2z. It is apparent from examination of this series of reactions 
why fats are such a rich storehouse of chemical energy. (b) Input of 
amino acids into the TCA cycle. 
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FIGURE 5.9 The glycerol phosphate shuttle. In the glycerol phosphate 
shuttle, electrons are transferred from NADH to dihydroxyacetone 
phosphate (DHAP) to form glycerol 3-phosphate, which shuttles them into 
the mitochondrion. These electrons then reduce FAD at the inner mito- 
chondrial membrane, forming FADHz, which can transfer the electrons to a 
carrier of the electron-transport chain. 
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steps that utilize these reduced coenzymes to produce ATP. The 
overall process can be divided into two steps, which can be summarized 
as follows: 


Step 1 (FIGURE 5.10). High-energy electrons are passed from 
FADH, or NADH to the first of a series of electron carriers that 
make up the electron-transport chain, which is located in the inner 
mitochondrial membrane. Electrons pass along the electron-transport 
chain in energy-releasing reactions. These reactions are coupled to 
energy-requiring conformational changes in electron carriers that 
move protons outward across the inner mitochondrial membrane. 
As a result, the energy released during electron transport is stored in 
the form of an electrochemical gradient of protons across the mem- 
brane. Eventually, the low-energy electrons are transferred to the 
terminal electron acceptor, namely, molecular oxygen (O,), which 
becomes reduced to water. 


Step 2 (Figure 5.10). The controlled movement of protons back 
across the membrane through an ATP-synthesizing enzyme pro- 
vides the energy required to phosphorylate ADP to ATP. 

The importance of proton movements during electron trans- 
port and the formation of ATP was first proposed in 1961 by Peter 
Mitchell of the University of Edinburgh, who named it the chemios- 
motic mechanism. Further analysis of the two steps summarized 
above will occupy us for much of the remainder of the chapter. 
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FIGURE 5.10 A summary of the process of oxidative phosphorylation. 
In the first step of the process, substrates such as isocitrate and succinate 
are oxidized (Figure 5.7) and the electrons are transferred to the coenzymes 
NAD* or FAD to form NADH or FADH2. These high-energy electrons are 
then transferred through a series of electron carriers of the electron 
transport chain. The energy released is used to translocate protons from 
the matrix to the intermembrane space, establishing a proton electrochemical 
gradient across the inner mitochondrial membrane. In step 2, the protons 
move down the electrochemical gradient, through an ATP-synthesizing 
complex. The energy stored in the gradient is used to synthesize ATP. These 
two essential steps of oxidative phosphorylation form the basis of the 
chemiosmotic mechanism proposed by Peter Mitchell in 1961. 


Each pair of electrons transferred from NADH to oxygen by 177 


means of the electron-transport chain releases sufficient energy to 
drive the formation of approximately three molecules of ATP. Each 
pair donated by FADH, releases sufficient energy for the formation 
of approximately two molecules of ATP. If one adds up all of the 
ATPs formed from one molecule of glucose completely catabolized 
by means of glycolysis and the TCA cycle, the net gain is about 36 
ATPs (which includes the GTP formed by each round of the TCA 
cycle, step 16, Figure 5.7). The actual number of ATPs formed per 
molecule of glucose oxidized depends on the particular activities in 
which the cell is engaged. The relative importance of glycolysis ver- 
sus the TCA cycle, that is, of anaerobic versus aerobic oxidative 
metabolism, in human skeletal muscle function is discussed in the 
accompanying Human Perspective. 


REVIEW | 


1. How are the two products of glycolysis connected to 
the reactions of the TCA cycle? 

2. Why is the TCA cycle considered to be the central 
pathway in the energy metabolism of a cell? 

3. Describe the mechanism by which NADH produced in 
the cytosol by glycolysis is able to feed electrons into 


the TCA cycle. 
oqo —_—_—— 


5.3 THE HUMAN PERSPECTIVE 


The Role of Anaerobic and Aerobic Metabolism in Exercise 


Muscle contraction expends large amounts of energy in sliding 
actin- and myosin-containing filaments over one another, as dis- 
cussed in Chapter 9. The energy that drives muscle contraction 
is derived from ATP. The rate of ATP hydrolysis increases more 
than 100-fold in a skeletal muscle undergoing maximal contrac- 
tion compared to the same muscle at rest. It is estimated that 
the average human skeletal muscle has enough ATP available to 
fuel a 2- to 5-second burst of vigorous contraction. Even as ATP 
is hydrolyzed, it is important that additional ATP be produced; 
otherwise the ATP/ADP ratio would fall and with it the free energy 
available to fuel contraction. To solve this problem, muscle cells 
contain a store of creatine phosphate (CrP), one of the com- 
pounds with a phosphate transfer potential greater than that of 
ATP (see Figure 3.28) and thus can be utilized to generate ATP in 
the following reaction: 


CrP + ADP + Cr+ ATP 
Skeletal muscles typically have enough stored creatine phosphate 


to maintain elevated ATP levels for approximately 15 seconds. 
This consumption of creatine by muscle is the reason that body 


builders sometimes take creatine supplements. Because muscle 
cells have a limited supply of both ATP and creatine phosphate, 
it follows that intense or sustained muscular activity requires the 
formation of additional quantities of ATP, which must be obtained 
by oxidative metabolism. 

Human skeletal muscles consist of elongated, contractile 
cells known as fibers. Muscle fibers are of two general types 
(FIGURE 1): fast-twitch fibers, which can contract very rapidly (e.g., 
15-40 msec), and slow-twitch fibers, which contract more slowly 
(e.g., 40-100 msec). Fast-twitch fibers are seen under the electron 
microscope to be nearly devoid of mitochondria, indicating that 
these cells are unable to produce much ATP by aerobic respira- 
tion. Slow-twitch fibers, on the other hand, contain large numbers 
of mitochondria. These two types of skeletal muscle fibers are 
suited for different types of activities. For example, lifting weights 
or running sprints depends primarily on fast-twitch fibers, which 
are able to generate more force than their slow-twitch counter- 
parts. Fast-twitch fibers produce nearly all of their ATP anaer- 
obically as a result of glycolysis. Even though glycolysis only 
produces about 5 percent as much ATP per molecule of glucose 
oxidized compared with aerobic respiration, the reactions of 
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FIGURE 1. Skeletal muscles contain a mix of fast-twitch (or type II) 
fibers (darkly stained) and slow-twitch (or type |) fibers (lightly stained). 


Source: Courtesy of Duncan MacDougall. 


glycolysis occur much more rapidly than those of the TCA cycle 
and electron transport; thus, the rate of anaerobic ATP produc- 
tion is actually higher than that achieved by aerobic respiration. 
The problems with producing ATP by glycolysis are the rapid use 
of the fiber's available glucose (stored as glycogen) and the pro- 
duction of an undesirable end product, lactic acid. Let's consider 
this latter aspect further. 

Recall that the continuation of glycolysis requires the regen- 
eration of NAD*, which occurs by fermentation (page 108). Muscle 
cells regenerate NAD* by reducing pyruvate—the end product of 
glycolysis—to lactic acid. Most of the lactic acid diffuses out of 
the active muscle cells into the blood, where it is carried to the 
liver and converted back to glucose. Glucose produced in the liver 
is released into the blood, where it can be returned to the active 
muscles to continue to fuel the high levels of glycolysis. However, 
the formation of lactic acid is associated with a drop in pH within 
the muscle tissue (from about pH 7.00 to 6.35), which may pro- 
duce the pain and cramps that accompany vigorous exercise. 
Increased acidity, along with accumulation of phosphate and 
conformational changes in myosin protein, contributes to loss of 
contractile strength in muscles during prolonged exertion. 


If, instead of trying to use your muscles to lift weights or 
run sprints, you were to engage in an aerobic exercise, such as 
bicycling or fast walking, you would be able to continue to per- 
form the activity for much longer periods of time without feeling 
muscle pain or fatigue. Aerobic exercises, as their name implies, 
are designed to allow your muscles to continue to perform aer- 
obically, that is, to continue to produce the necessary ATP by 
electron transfer and oxidative phosphorylation. Aerobic exer- 
cises depend largely on the contraction of the slow-twitch fibers 
of the skeletal muscles. Although these fibers generate less force, 
they can continue to function for long periods of time due to 
the continuing aerobic production of ATP without production of 
lactic acid. 

Aerobic exercise is initially fueled by the glucose molecules 
stored as glycogen in the muscles themselves, but after a few 
minutes, the muscles depend more and more on free fatty acids 
released into the blood from adipose (fat) tissue. The longer the 
period of exercise, the greater the dependency on fatty acids. 
After 20 minutes of vigorous aerobic exercise, it is estimated that 
about 50 percent of the calories being consumed by the muscles 
are derived from fat. Aerobic exercise, such as jogging, fast walking, 
swimming, or bicycling, is one of the best ways of reducing the 
body's fat content. 

The ratio of fast-twitch to slow-twitch fibers varies from one 
particular muscle to another. For example, postural muscles in the 
back that allow a person to stand consist of a higher proportion 
of slow-twitch fibers than arm muscles used to throw or lift an 
object. The precise ratio of fast-twitch to slow-twitch fibers in a 
particular muscle is genetically determined and quite variable 
from person to person, allowing a particular individual to excel 
in certain types of physical activities. For example, world-class 
sprinters and weight lifters tend to have a higher proportion of 
fast-twitch fibers in their muscles than long-distance runners. In 
addition, training for sports such as weight lifting leads to a dis- 
proportionate enlargement of the fast-twitch fibers. 

The muscle tissue of the heart must also increase its level 
of activity during vigorous exercise, but unlike skeletal muscle 
tissue, the heart can only produce ATP by aerobic metabolism. 
In fact, approximately 40 percent of the cytoplasmic space of a 
human heart muscle cell is occupied by mitochondria. 


5.4 Oxidative Phosphorylation 
in the Formation of ATP 


Mitochondria are often described as miniature power plants. Like 
power plants, mitochondria extract energy from organic materials 
and store it, temporarily, in the form of electrical energy. More spe- 
cifically, the energy extracted from substrates is utilized to generate 
an ionic gradient across the inner mitochondrial membrane. An 
ionic gradient across a membrane represents a form of energy that 
can be tapped to perform work. We saw in Chapter 4 how intestinal 
cells utilize an ionic gradient across their plasma membrane to trans- 
port sugars and amino acids out of the intestinal lumen, just as nerve 
cells use a similar gradient to conduct neural impulses. The use of 
ionic gradients as a form of energy currency requires several compo- 
nents, including a system to generate the gradient, a membrane 
capable of maintaining the gradient, and the machinery to utilize the 
gradient to perform work. 


Mitochondria utilize an ionic gradient across their inner mem- 
brane to drive numerous energy-requiring activities, most notably 
the synthesis of ATP. When ATP formation is driven by energy that 
is released from electrons removed during substrate oxidation, the 
process is called oxidative phosphorylation and is summarized 
in Figure 5.10. Oxidative phosphorylation can be contrasted with 
substrate-level phosphorylation, as discussed on page 105, in which 
ATP is formed directly by transfer of a phosphate group from a sub- 
strate molecule to ADP. According to one estimate, oxidative phos- 
phorylation accounts for the production of more than 2x10”° 
molecules (>60 kg) of ATP in our bodies per day. Unraveling the 
basic mechanism of oxidative phosphorylation has been one of the 
crowning achievements in the field of cell and molecular biology; 
filling in the remaining gaps continues to be an active area of 
research. To understand the mechanism of oxidative phosphoryla- 
tion, it is first necessary to consider how substrate oxidation is able 
to release free energy. 


Oxidation—Reduction Potentials 


If one compares a variety of oxidizing agents, they can be ranked in a 
series according to their affinity for electrons: the greater the affinity, the 
stronger the oxidizing agent. Reducing agents can also be ranked 
according to their affinity for electrons: the lower the affinity (the more 
easily electrons are released), the stronger the reducing agent. To put 
this into quantifiable terms, reducing agents are ranked according to 
electron-transfer potential; those substances having a high electron- 
transfer potential, such as NADH, are strong reducing agents, whereas 
those with a low electron-transfer potential, such as H,O, are weak 
reducing agents. Oxidizing and reducing agents occur as couples, such as 
NAD* and NADH, which differ in their number of electrons. Strong 
reducing agents are coupled to weak oxidizing agents, and vice versa. 
For example, NAD* (of the NAD*-NADH couple) is a weak oxidizing 
agent, whereas O, (of the O,-H,O couple) is a strong oxidizing agent. 
Because the movement of electrons generates a separation of 
charge, the affinity of substances for electrons can be measured by 
instruments that detect voltage (FIGURE 5.11). What is measured for 
a given couple is an oxidation-reduction potential (or redox 
potential) relative to the potential for some standard couple. The 
standard couple has arbitrarily been chosen as hydrogen (H*—H)). 
As with free-energy changes, where the standard free-energy change, 
AG°, was used, a similar assignment is employed for redox couples. 
The standard redox potential, Eo, for a given couple is designated as 
the voltage produced by a half-cell (with only members of one cou- 
ple present) in which each member of the couple is present under 
standard conditions (as in Figure 5.11). Standard conditions are 
1.0M for solutes and ions and latm pressure for gases (e.g., Hy) at 25°C. 
The standard redox potential for the oxidation-reduction reaction 
involving hydrogen (2 H* +2 electrons +H ) is 0.00V. Table 5.1 
gives the redox potentials of some biologically important couples. 
The value for the hydrogen couple in the table is not 0.00, but —0.42 V. 
This number represents the value when the concentration of H* is 
10-’ M(pH of 7.0) rather than 1.0M(pH of 0.0), which would be of 


Voltmeter 


| KCl bridge | 


Reference half-cell Sample half-cell 


FIGURE 5.11 Measuring the standard oxidation-reduction (redox) 
potential. The sample half-cell contains the oxidized and reduced 
members of the couple, both present at 1M. The reference half-cell contains 
a 1M solution of H* that is in equilibrium with hydrogen gas at 1 atm 
pressure. An electric circuit is formed by connecting the half-cells through a 
voltmeter and a salt bridge. If electrons flow preferentially from the sample 
half-cell to the reference half-cell, then the standard redox potential (Eo) of 
the sample couple is negative; if the electron flow is in the opposite 
direction, the standard redox potential of the sample couple is positive. 
The salt bridge, which consists of a saturated KCI solution, provides a path 
for counter-ions to move between the half-cells and maintain electrical 
neutrality in the two compartments. 


little physiologic use. When calculated at pH 7, the standard redox 
potential is indicated by the symbol Eo rather than Ey. The assign- 
ment of sign (positive or negative) to couples is arbitrary and varies 
among different disciplines. We will consider the assignment in the 
following way. Those couples whose reducing agents are better 
donors of electrons are assigned more negative redox potentials. For 
example, the standard redox potential for the NAD*-NADH couple 
is —0.32 V (Table 5.1). Acetaldehyde is a stronger reducing agent than 
NADH, and the acetate-acetaldehyde couple has a standard redox 
potential of —0.58 V. Those couples whose oxidizing agents are better 
electron acceptors than NAD‘, that is, have greater affinity for elec- 
trons than NAD‘, have more positive redox potentials. 

Just as any other spontaneous reaction is accompanied by a loss 
in free energy, so too are oxidation-reduction reactions. The stand- 
ard free-energy change during a reaction of the type 


Avox) os Bored) = A (red) + Brox) 


can be calculated from the standard redox potentials of the two cou- 
ples involved in the reaction according to the equation 


AG” =-nF AE) 


where n is the number of electrons transferred in the reaction, F is the 
Faraday constant (23.063 kcal/V- mol), and Ejis the difference in 
volts between the standard redox potentials of the two couples. The 
greater the difference in standard redox potential between two cou- 
ples, the farther the reaction proceeds under standard conditions to 
the formation of products before an equilibrium state is reached. 
Let’s consider the reaction in which NADH, a strong reducing agent, 
is oxidized by molecular oxygen, a strong oxidizing agent. 


NADH++40, +H* +H,0+NAD+* 
The standard redox potentials of the two couples can be written as 


502 +2H* +2e° +H,0 Ej) =+0.82V 


NAD* +2H* +2e° ~NADH+H?* £9 =-0.32V 


TABLE 5.1 Standard Redox Potentials of Selected Half-Reactions 


Electrode equation 


Succinate + CO, +2H*t+2e = 
a-ketoglutarate + H,O 

Acetate + 2 H* +2 & = acetaldehyde 

DSI SNE, 

a-Ketoglutarate +CO,+2H*+2e = isocitrate 

Cystine + 2 H'+2e = 2 cysteine 

NAD‘ +2 H*t+2e¢ — NADH+H* 

NADP* +2 Ht +2 e* = NADPH+ H* 


Acetaldehyde + 2 H* + 2 e = ethanol 
Pyruvate + 2 H* +2 e = lactate 

Oxaloacetate + 2 H* + 2 € — malate 

FAD +2 H* +2 e = FADH,) (in flavoproteins) 
Fumarate + 2 H* +2 & = succinate 


Ubiquinone + 2 H* + 2 e = ubiquinol 

2 cytochrome b,x) + 2 & = 2 cytochrome byeqy 
2 cytochrome ¢(o,) + 2 & = 2 cytochrome C(rea) 

2 cytochrome 43(.,) +2 & = 2 cytochrome 43(;¢4) 
4.0) +2H*+2e°= HO 
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between the two Ej values (AE)): 
AE, = +0.82 V —(-0.32 V)=1.14 V 


which is a measure of the free energy released when NADH is oxi- 
dized by molecular oxygen under standard conditions. Substituting 
this value into the above equation, 


AG” =(-2)(23.063 kcal/V -mol) (1.14 V) 
=—52.6 kcal/mol of NADH oxidized 


the standard free-energy difference (AG) is —52.6 kcal/mol. As 
with other reactions, the actual AG values depend on the relative 
concentrations of reactants and products (oxidized and reduced ver- 
sions of the compounds) present in the cell at a given instant. 
Regardless, it would appear that the drop in free energy of a pair of 
electrons as they pass from NADH to molecular oxygen 
(AG® =—52.6 kcal/mol) should be sufficient to drive the formation 
of several molecules of ATP (AG°’=+7.3 kcal/mol) even under 
conditions in the cell, where ATP/ADP ratios are much higher than 
those at standard conditions. The transfer of this energy from NADH 
to ATP within the mitochondrion occurs in a series of small, energy- 
releasing steps, which will be the primary topic of discussion in the 
remainder of the chapter. 

Electrons are transferred to NAD* (or FAD) within the mito- 
chondrion from several substrates of the TCA cycle, namely, isocitrate, 
a-ketoglutarate, malate, and succinate (reactions 14, 15-16, 19, and 17 
of Figure 5.7, respectively). The first three of these intermediates have 
redox potentials of relatively high negative values (Table 5.1)— 
sufficiently high to transfer electrons to NAD* under conditions that 
prevail in the cell.” In contrast, the oxidation of succinate to fumarate, 
which has a more positive redox potential, proceeds by the reduction 
of FAD, a coenzyme of greater electron affinity than NAD. 


Electron Transport 


Five of the nine reactions illustrated in Figure 5.7 are catalyzed by 
dehydrogenases, enzymes that transfer pairs of electrons from sub- 
strates to coenzymes. Four of these reactions generate NADH and 
one produces FADH. The NADH molecules, which are formed in 
the mitochondrial matrix, dissociate from their respective dehydro- 
genases and bind to NADH dehydrogenase, an integral protein of 
the inner mitochondrial membrane (see Figure 5.17). Unlike the other 
enzymes of the TCA cycle, succinate dehydrogenase, the enzyme 
that catalyzes the formation of FADH) (Figure 5.7, reaction 17), is a 
component of the inner mitochondrial membrane. In either case, 
the high-energy electrons associated with NADH or FADH) are 
transferred through a series of specific electron carriers that constitute 
the electron-transport chain (or respiratory chain) of the inner 
mitochondrial membrane. 


?As indicated in Table 5.1, the standard redox potential (Eq) of the oxaloacetate- 
malate couple is more positive than that of the NAD*-NADH couple. Thus, the oxida- 
tion of malate to oxaloacetate has a positive AG” (reaction 19, Figure 5.7) and can 
proceed to the formation of oxaloacetate only when the ratio of products to reactants 
is kept below that of standard conditions. The AG of this reaction is kept negative by 
maintenance of very low oxaloacetate concentrations, which is possible because the 
next reaction in the cycle (reaction 12, Figure 5.7) is highly exergonic and is catalyzed 
by one of the major rate-controlling enzymes of the TCA cycle. 


Types of Electron Carriers 


The electron-transport chain contains five types of membrane- 
bound electron carriers: flavoproteins, cytochromes, copper atoms, 
ubiquinone, and iron-sulfur proteins. With the exception of ubiqui- 
none, all of the redox centers within the respiratory chain that accept 
and donate electrons are prosthetic groups, that is, non-amino acid 
components that are tightly associated with proteins. 


e Flavoproteins consist of a polypeptide bound tightly to one of 
two related prosthetic groups, either flavin adenine dinucleotide 
(FAD) or flavin mononucleotide (FMN) (FIGURE 5.12a). The 
prosthetic groups of flavoproteins are derived from riboflavin 
(vitamin B,), and each is capable of accepting and donating two 
protons and two electrons. The major flavoproteins of the 
mitochondria are NADH dehydrogenase of the electron-transport 
chain and succinate dehydrogenase of the TCA cycle. 


e Cytochromes are proteins that contain heme prosthetic groups 
(such as that described for myoglobin on page 56). The iron atom 
of a heme undergoes reversible transition between the Fe** and 
Fe?* oxidation states as a result of the acceptance and loss of a 
single electron (Figure 5.12b). There are three distinct cytochrome 
types—a, b, and c—present in the electron-transport chain, which 
differ from one another by substitutions within the heme group 
(indicated by the blue shaded portions of Figure 5.120). 


e Three copper atoms, all located within a single protein 
complex of the inner mitochondrial membrane (see Figure 5.20), 
accept and donate a single electron as they alternate between the 
Cu* and Cu’ states. 


e Ubiquinone (UQ, or coenzyme Q) is a lipid-soluble molecule 
containing a long hydrophobic chain composed of five-carbon 
isoprenoid units (Figure 5.12c). Like the flavoproteins, each 
ubiquinone is able to accept and donate two electrons and two 
protons. The partially reduced molecule is the free radical 
ubisemiquinone, and the fully reduced molecule is ubiquinol 
(UQH,). UQ/UQH, remains within the lipid bilayer of the 
membrane, where it is capable of rapid lateral diffusion. 


e lron-sulfur proteins are iron-containing proteins in which the 
iron atoms are not located within a heme group but instead are 
linked to inorganic sulfide ions as part of an iron-sulfur center. 
The most common centers contain either two or four atoms of 
iron and sulfur—designated [2Fe-2S] and [4Fe-4S]—linked to 
the protein at cysteine residues (FIGURE 5.13). Even though a 
single center may have several iron atoms, the entire complex is 
capable of accepting and donating only a single electron. The 
redox potential of an iron-sulfur center depends on the hydro- 
phobicity and charge of the amino acid residues that make up its 
local environment. As a group, iron-sulfur proteins have 
potentials ranging from about —700mV to about +300 mV, 
corresponding to a major portion of the span over which 
electron transport occurs. More than a dozen distinct iron-sulfur 
centers have been identified within mitochondria. 


The carriers of the electron-transport chain are arranged in 
order of increasingly positive redox potential (FIGURE 5.14). Each 
carrier is reduced by the gain of electrons from the preceding carrier 
in the chain and is subsequently oxidized by the loss of electrons to 
the carrier following it. Thus, electrons are passed from one carrier 
to the next, losing energy as they move “downhill” along the chain. 
The final acceptor of this electron “bucket brigade” is O,, which 
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group of cytochrome c, and (c) ubiquinone 
(coenzyme Q). The heme groups of the 
various cytochromes of the electron- 
transport chain differ in the substitutions 
on the porphyrin rings (indicated by the 
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FIGURE 5.14 The arrangement of several carriers in the electron- 
transport chain. The diagram illustrates the approximate redox potential of 
the carriers and the decline in free energy as electron pairs move along the 


respiratory chain to molecular oxygen. The numerous iron-sulfur centers are 


FIGURE 5.13 Iron-sulfur centers. Structure of a[2Fe-2S] (a) anda not indicated in this figure for the sake of simplicity. As discussed in the 
[4Fe-4S] (b) iron-sulfur center. Both types of iron-sulfur centers are joined following section, each of the three electron transfers that are denoted by 
to the protein by linkage to a sulfur atom (shown in orange) of a cysteine red arrows yields sufficient energy to move protons across the inner 
residue. Inorganic sulfide ions (S*-) are shown in yellow. Both types of mitochondrial membrane, which in turn provides the energy required to 
iron-sulfur centers accept only a single electron, whose charge is distributed generate ATP from ADP. 


among the various iron atoms. Source: Based on a drawing from A. L. Lehninger, Biochemistry, 2e, 1975. 


182 accepts the energy-depleted electrons and is reduced to water. The 
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specific sequence of carriers that constitute the electron-transport 
chain was worked out by Britton Chance and co-workers at the 
University of Pennsylvania using a variety of inhibitors that blocked 
electron transport at specific sites along the route. The concept of 
these experiments is depicted by analogy in FIGURE 5.15. In each 
case, an inhibitor was added to cells, and the oxidation state of the 
various electron carriers in the inhibited cells was determined. This 
determination can be made using a spectrophotometer that meas- 
ures the absorption of light of a sample at various wavelengths. This 
measurement reveals whether a particular carrier is in the oxidized 
or reduced state. In the case depicted in Figure 5.15, addition of rote- 
none blocked electron transport at a site that left NADH and FMNH, 
in the reduced state, and left cytochromes b, c,and a in the oxidized 
state. This result indicates that NAD and FMN are situated 
“upstream” of the block. In contrast, an inhibitor (e.g., antimycin A) 
that acts between cytochromes b and c would leave NADH, FMNH, 
QH,, and cytochrome b in the reduced state. Thus, by identifying 
reduced and oxidized components in the presence of different inhib- 
itors, the sequences of the carriers could be determined. 

The tendency for electrons to be transferred from one carrier to 
the next depends on the potential difference between the two redox 
centers, but the rate of transfer depends on the catalytic activities of 
the proteins involved. This distinction between thermodynamics 
and kinetics is similar to that discussed on page 90 regarding the 
activity of enzymes. Studies indicate that electrons may travel con- 
siderable distances (10-20A) between adjacent redox centers and 
that electrons probably flow through special “tunneling pathways” 
consisting of a series of covalent and hydrogen bonds that stretch 


Point of inhibition 


Percent each electron carrier is reduced 


Rotenone block 


FIGURE 5.15 Experimental use of inhibitors to determine the 
sequence of carriers in the electron-transport chain. In this hydraulic 
analogy, treatment of mitochondria with the inhibitor rotenone leaves those 
carriers on the upstream (NADH) side of the point of inhibition in the fully 
reduced state and those carriers on the downstream (Oz) side of inhibition 
in the fully oxidized state. Comparison of the effects of several inhibitors 
revealed the order of the carriers within the chain. 


Source: Based on a drawing from A. L. Lehninger, Biochemistry, 2e, 1975. 


FIGURE 5.16 Electron-tunneling pathways for the yeast cytochrome c— 
cytochrome c peroxidase complex. The heme group of cytochrome c is 
blue, and that of cytochrome c peroxidase (which is not a carrier of the 
mitochondrial electron-transport chain but provides an analogous electron 
acceptor for which high-resolution crystal structures are available) is red. 
Several defined pathways (yellow) exist for the movement of electrons from 
one heme to the other. Each of the pathways carries the electrons through 
several amino acid residues situated between the heme groups. (It can be 
noted that other mechanisms of electron transfer have been proposed.) 
Source: By Jeffrey J. Regan and J. N. Onuchic, from David N. Beratan et al., 
Science 258:1741, © 1992. Reprinted with permission from AAAS. 


across parts of several amino acid residues. An example of such a 
pathway involving cytochrome c is shown in FIGURE 5.16. 


REVIEW | 


1. Describe the steps by which the transport of electrons 
down the respiratory chain leads to the formation of a 
proton gradient. 


2. Of the five types of electron carriers, which has the 
smallest molecular mass? Which has the greatest ratio 
of iron atoms to electrons carried? Which has a 
component that is located outside the lipid bilayer? 
Which are capable of accepting protons and electrons 
and which only electrons? 


3. Describe the relationship between a compound's 
affinity for electrons and its ability to act as a reducing 
agent. What is the relationship between the electron- 
transfer potential of a reducing agent and the ability 
of the other member of its couple to act as an 
oxidizing agent? 

4. Look at Figure 5.12 and describe how the semiquinone 


states of ubiquinone and FMN are similar. 
e 


5.5 Electron-Transport Complexes 


When the inner mitochondrial membrane is disrupted by detergent, 
the various electron carriers can be isolated as part of four distinct, 
asymmetric, membrane-spanning complexes, identified as complexes 
I, IL, I, and IV (FIGURE 5.17). Each of these four complexes can 
be assigned a distinct function in the overall oxidation pathway. 
Two components of the electron-transport chain, cytochrome c and 
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FIGURE 5.17 The electron-transport chain of the inner mitochondrial membrane. The respiratory chain consists of four complexes of electron carriers 
and two other carriers (ubiquinone and cytochrome c) that are independently disposed. Electrons enter the chain from either NADH (via complex |) or FADH2 
(a part of complex II). Electrons are passed from either complex | or Il to ubiquinone (UQ), which exists as a pool within the lipid bilayer. Electrons are 
subsequently passed from reduced ubiquinone (ubiquinol) to complex Ill and then to the peripheral protein cytochrome c, which is thought to be mobile. 
Electrons are transferred from cytochrome c to complex IV (cytochrome oxidase) and then to Op to form H,O. The sites of proton translocation from the matrix 
to the cytosol are indicated. The precise number of protons translocated at each site remains controversial; the number indicated is a general consensus. 
Keep in mind that the proton numbers shown are those generated by each pair of electrons transported, which is sufficient to reduce only one-half of an Oz 
molecule. (The translocation of protons by complex II occurs by way of a O cycle, which is a series of sequential interconversions of ubiquinone and ubiquinol 
coupled to movement of protons across the membrane. The O cycle can be divided into two steps, each of which leads to the release of two protons into the 


cytosol.) 


ubiquinone, are not part of any of the four complexes. Ubiquinone 
occurs as a pool of molecules dissolved in the lipid bilayer, and 
cytochrome c is a soluble protein in the intermembrane space. 
Ubiquinone and cytochrome c are thought to move within or along 
the membrane, shuttling electrons between the large, relatively immo- 
bile protein complexes. Once within one of the large, multiprotein 
complexes, electrons travel along defined pathways (of the type illus- 
trated in Figure 5.16) between adjacent redox centers whose relative 
positions are fixed in place. 

When NADH is the electron donor, electrons enter the respiratory 
chain by means of complex I, which transfers electrons to ubiquinone, 
generating ubiquinol (Figures 5.12 and 5.17). Unlike NADH, which 
can diffuse away from its soluble dehydrogenases, FADH, remains 
covalently bound to succinate dehydrogenase, a component of 
complex II. When FADH,j is the donor, electrons are passed directly 
to ubiquinone, bypassing the upstream end of the chain, which has 
too negative a redox potential to accept the less energetic electrons of 
the flavin nucleotide (Figure 5.14). 

If one examines the redox potentials of successive carriers in 
Figure 5.14, it is apparent that there are three places in which the 
transfer of electrons is accompanied by a major release of free energy. 
Each of these coupling sites, as they are called, occurs between carri- 
ers that are part of one of the three complexes, I, III, and IV. The free 
energy, which is released as electrons are passed across these three 
sites, is conserved by translocation of protons from the matrix across 


the inner membrane into the intermembrane space. These three 
protein complexes are often described as proton pumps. Unlike other 
ions (e.g., Na* or Cl’) that have to diffuse by themselves across the 
entire distance being traversed, H* ions can “hop” through a channel 
by exchanging themselves with other protons present along the 
pathway (as illustrated in Figure 5.19b). Such proton-conduction 
pathways (or “proton wires”) can be identified because they consist 
of strings of acidic residues, hydrogen-bonded residues, and/or 
trapped water molecules. The translocation of protons by these three 
electron-transporting complexes establishes the proton gradient that 
drives ATP synthesis. The ability of these remarkable molecular 
machines to act as independent proton-translocating units can be 
demonstrated by purifying each of them and incorporating them 
individually into artificial lipid vesicles. When given an appropriate 
electron donor, these protein-containing vesicles are capable of 
accepting electrons and using the released energy to pump protons 
across the vesicle membrane (see FIGURE 5.18). 

Great strides have been made in the past few years in elucidat- 
ing the molecular architecture of all the protein complexes of the 
inner membrane. Researchers are no longer asking what these pro- 
teins look like but are using the wealth of new structural information 
to understand how they work. We will briefly examine the mamma- 
lian versions of each of the four electron-transport complexes, which 
together contain about 70 different polypeptides. The bacterial ver- 
sions are considerably simpler than their mammalian counterparts 
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plex rather than electron transport. In other words, the basic process 
pH electrode of electron transport during respiration has remained virtually 
| unchanged since its evolution billions of years ago in our prokary- 


4Ht Cytochrome oxidase otic ancestors. 


2H,0 


Complex | (NADH Dehydrogenase) 


Complex I is the gateway to the electron-transport chain, catalyzing 
the transfer of a pair of electrons from NADH to ubiquinone (UQ) 
to form ubiquinol (UQH,). The mammalian version of complex I is 
a huge L-shaped conglomerate, containing 45 different subunits that 
lipesorta constitute a molecular mas mass of nearly one million daltons. As 
membrane indicated in Figure 5.19a, roughly half of the complex consists of a 
hydrophilic domain that projects into the matrix, with the remain- 
ing hydrophobic portion of the complex embedded in the mem- 
brane. Seven of the subunits—all hydrophobic, membrane-spanning 
polypeptides—are encoded by mitochondrial genes and are homo- 
FIGURE 5.18 Experimental demonstration that cytochrome oxidase is logues of bacterial polypeptides. Together, the hydrophilic and 
a proton pump. When purified cytochrome oxidase is incorporated into membrane-embedded portions of the complex carry out the two dif- 
the artificial bilayer of a lipoosome, the medium becomes acidified following ferent activities that are required of the electron-transport chain: 
the addition of reduced cytochrome c. This indicates that as electrons are electron transfer and proton translocation. 
transferred from cytochrome c to cytochrome oxidase, and Oz is being All of the components required for electron transfer reside 


reduced to water, protons are being translocated from the compartment sais ee ; 
within the vesicle to the external medium. This experiment was originally within the hydrophilic portion of the complex (FIGURE 5.198). The 


carried out in the late 1960s by Marten Wikstrém and colleagues at the 
University of Helsinki in Finland. 


Cytochrome c 


Inside aqueous phase 


Source: Reprinted with permission from Macmillan Publishers Ltd: M. |. 


Verkhovsky et al., Nature 400:481, 1999, copyright 1999. —_ 
e 
3 
and contain far fewer subunits. However, the additional subunits of ° 
the mammalian complexes do not contain redox centers and are Oxidized Hs 
ae ; HL 
thought to function in either the regulation or assembly of the com- 
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FIGURE 5.19 Structure and a proposed mechanism of action of complex | of the respiratory chain. (a) Crystal structure of a bacterial form of complex I. 
The peripheral hydrophilic domain and the membrane-embedded hydrophobic domain are evident. The electron carriers within the peripheral arm— 
namely, FMN (magenta sphere), the seven Fe-S centers (red spheres), and the quinone binding site (Q)—are indicated. Helix HL of the membrane domain, 
which is proposed to act as a connecting rod or piston, is seen running roughly parallel to the plane of the membrane. (b) Proposed mechanism by which 
electron transfer in the peripheral hydrophilic arm is coupled by conformational changes to proton translocation across the membrane domain. Reduction of 
ubiquinone at the QO site leads to a rightward movement of helix HL (indicated by the upper black arrow in the reduced version), which causes conforma- 
tional changes in the transmembrane helices, leading to the movement of protons across the membrane. Crucial charged residues in subunits of the 
membrane domain are indicated by circles. Red circles represent unprotonated glutamic acid residues, and blue circles represent protonated lysine residues. 
The protonated and unprotonated forms of these residues are seen as empty circles, respectively. The movement of protons between these residues, which 
is driven by conformational changes in the transmembrane helices, forms the basis for proton translocation. 


Source: From Rouslan G. Efremov and Leonid A. Sazanov, Nature 476:414, 2011; reprinted by permission from Macmillan Publishers Ltd. 


chain of electron carriers begins with an FMN-containing flavopro- 
tein that oxidizes NADH at the tip of the giant complex. Electrons 
are subsequently transferred through seven distinct iron-sulfur 
centers within the body of the hydrophilic domain, ending with the 
transfer of electrons to a protein-bound ubiquinone molecule situated 
near the membrane boundary. (An eighth Fe-S cluster and possibly 
a second quinone molecule are also present but not apparently in the 
direct path of electron transfer.) The passage of a pair of electrons 
from NADH to ubiquinone is thought to be accompanied by the 
movement of four protons from the matrix into the intermembrane 
space. The determination of the structure of complex I (Figure 
5.19a) has led to a proposal that explains how electron transport 
through the peripheral arm of the complex can drive proton translo- 
cation through the distant membrane-embedded portion. It can be 
seen in Figure 5.19b that the membrane-embedded domain of the 
complex contains a very long a helix, denoted HL (shown in purple), 
that is oriented roughly parallel to the membrane’s surface. This 
helix, which runs nearly the entire length of the membrane domain, 
is in direct contact with three discontinuous transmembrane helices 
(shown in purple, blue, and olive green). Each of the discontinuous 
helices is part of a half channel. Each pair of half channels is con- 
nected by a network of polar amino acid side chains to provide a 
potential whole channel for the translocation of protons across the 
membrane. A fourth proton-conducting channel is probably located 
near the interface with the soluble domain as shown in Figure 5.19b. 
It is proposed that the movement of electrons to ubiquinone induces 
a conformational change in the complex that causes the lateral 
movement of helix HL (toward the right in Figure 5.19b). This in 
turn leads to the tilting movement of the transmembrane helices, 
which changes the ionic environment of certain proton-transferring 
residues, leading to the movement of protons across the membrane 
(black downward arrows). The authors of this proposal compare the 
mechanism of action of complex I to that of a steam engine, with 
helix HL acting as the piston that drives the movement of the down- 
stream elements of the machinery. 

It can be noted that complex I dysfunction has been linked to 
certain neurodegenerative disorders, as discussed in the Human 
Perspective on page 195. 


Complex II (succinate dehydrogenase) 


Complex II consists of four polypeptides: two hydrophobic subunits 
that anchor the protein in the membrane and two hydrophilic subu- 
nits that comprise the TCA cycle enzyme, succinate dehydrogenase. 
Complex II provides a pathway for feeding lower-energy electrons 
(ones close to0mV) from succinate to FAD to ubiquinone (Figures 5.14 
and 5.17). The path from FADH, at the enzyme catalytic site to 
ubiquinone takes the electrons over a distance of 40 A through three 
iron-sulfur clusters. Complex II also contains a heme group, which is 
thought to attract escaped electrons, thereby preventing the forma- 
tion of destructive superoxide radicals (page 34). Electron transfer 
through complex II is not accompanied by proton translocation. 


Complex III (cytochrome bc,) 


Complex III catalyzes the transfer of electrons from ubiquinol to 
cytochrome c. Experimental determinations suggest that four pro- 
tons are translocated across the membrane for every pair of electrons 
transferred through complex III. The protons are released into the 


intermembrane space in two separate steps powered by the energy 185 


released as a pair of electrons are separated from one another and 
passed along different pathways through the complex. Two protons 
are derived from the molecule of ubiquinol that entered the complex. 
Two additional protons are removed from the matrix and translo- 
cated across the membrane as part of a second molecule of ubiqui- 
nol. Three of the subunits of complex II contain redox groups: 
cytochrome b contains two heme b molecules with different redox 
potentials, cytochrome c,, and an iron-sulfur protein. Cytochrome b is 
the only polypeptide of the complex encoded by a mitochondrial gene. 


Complex IV (cytochrome c oxidase) 


The final step of electron transport in a mitochondrion is the succes- 
sive transfer of electrons from reduced cytochrome c to oxygen 
according to the reaction 


2 cyt c** +2 Ht +40, 2 cyt 3+ +H,0 


To reduce a whole molecule of O,, 
4 cyt c?*+4H* +0, >4 cyt c?+2H,0 


The reduction of O, is catalyzed by complex IV, a huge assembly of 
polypeptides referred to as cytochrome oxidase. Cytochrome 
oxidase was the first component of the electron-transport chain 
shown to act as a redox-driven proton pump. This was demonstrated 
in the experiment depicted in Figure 5.18 in which the purified 
enzyme was incorporated into vesicles containing an artificial lipid 
bilayer (liposomes). Addition of reduced cytochrome c to the medium 
was accompanied by the expulsion of H* ions from the vesicles, which 
was measured as a drop in the surrounding pH. Whether inside a 
liposome or the inner mitochondrial membrane, translocation of 
protons is coupled to conformational changes generated by the 
release of energy that accompanies the transfer of electrons. For 
every molecule of O, reduced by cytochrome oxidase, eight protons 
are thought to be taken up from the matrix. Four of these protons are 
consumed in the formation of two molecules of water as indicated 
above, and the other four protons are translocated across the mem- 
brane and released into the intermembrane space (Figure 5.17). 
Consequently, the overall reaction can be written: 


4 cyt c?* +8 Hinatrix +O2 > 4 cyt ct +2 H,0+ 4 Hoist 


Humans produce about 300 ml of “metabolic water” from this reac- 
tion per day. It can be noted that a number of potent respiratory 
poisons, including carbon monoxide (CO), azide (N3), and cyanide 
(CN), have their toxic effect by binding to the heme a, site of 
cytochrome oxidase. (Carbon monoxide also binds to the heme 
group of hemoglobin.) 

Cytochrome oxidase consists of 13 subunits, three of which are 
encoded by the mitochondrial genome and contain all four of the 
protein’s redox centers. A major challenge for investigators is to 
explain how carriers that are only able to transfer single electrons 
can reduce a molecule of O, to two molecules of H,O, a process that 
requires four electrons (together with four protons). Most impor- 
tantly, the process must occur very efficiently because the cell is 
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partially reduced oxygen species has the potential to damage virtu- 
ally every macromolecule in the cell (page 34). 

The movement of electrons between redox centers of cytochrome 
oxidase is shown in FIGURE 5.20. Electrons are transferred one at a 
time from cytochrome c through a bimetallic copper center (Cua) of 
subunit II to a heme (heme a) of subunit I. From there, electrons are 
passed to a redox center located in subunit I that contains a second 
heme (heme a3) and another copper atom (Cug) situated less than 
5A apart. Acceptance of the first two electrons reduces the a;-Cug 
binuclear center according to the reaction 


3+ 2+ - 2+ i+ 
Fer +Cug’ +2e > Fe +Cug 


Once the binuclear center accepts its second electron, an O, mole- 
cule binds to the center. The O==O double bond is then broken as 
the O atoms accept a pair of electrons from the reduced a3-Cug binu- 
clear center, likely forming a reactive peroxy anion O7. 


Fe**—O — Fet+—O- 


OCul* O7 Cu?* 


In subsequent steps, the binuclear center accepts two more electrons 
from cytochrome c and four protons from the matrix, which leads 
to the formation of two molecules of water. For each proton removed 
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FIGURE 5.20 A summary of cytochrome oxidase activity. A model 
showing the flow of electrons through the four redox centers of cytochrome 
oxidase. Iron atoms are shown as red spheres, copper atoms as yellow 
spheres. Electrons are thought to be passed one at a time from cytochrome c 
to the dimeric copper center (Cua), then on to the iron of the heme group 
of cytochrome a, then on to the binuclear redox center consisting of a 
second iron (of the heme group of cytochrome a3) and copper ion (Cug). 
The structures and suggested orientations of the redox centers are indicated. 
Source: Reprinted From M. Wikstrom et al., Biochim Biophysics. ACTA 
1459:515, 2000, with permission from Elsevier.) 


from the matrix, an excess negative charge (in the form of an OH ) 
is left behind, thus contributing directly to the electrochemical gra- 
dient across the inner mitochondrial membrane. 

As noted above, protons taken up from the matrix are utilized in 
two very different ways. For every molecule of O, reduced to 2 H,O 
by cytochrome oxidase: (1) four H* ions are consumed in this chem- 
ical reaction and (2) four additional H* ions are translocated across 
the inner mitochondrial membrane. The first four protons can be 
referred to as “substrate” protons and the latter four as “pumped” 
protons. Movement of both groups of protons contributes to the 
electrochemical gradient across the membrane. 

With the publication of the three-dimensional crystal structure 
of cytochrome oxidase, it became possible to search for pathways 
through which substrate and pumped protons might move through 
the huge protein. Researchers have identified potential proton con- 
duits within the molecule, but their role has not been experimentally 
established. Unfortunately, static structural models, by themselves, 
cannot reveal the dynamic movements that take place within the 
protein as it functions. The energy released by O, reduction is pre- 
sumably used to drive conformational changes that alter the ioniza- 
tion states and precise locations of amino acid side chains within 
these channels. These changes would, in turn, promote the move- 
ment of H* ions through the protein, as occurs during electron 
transport in complex I (Figure 5.190). 


REVIEW | 
1. What is meant by the binuclear center of cytochrome 
oxidase? How does it function in the reduction of O,? 


2. What are two different ways that cytochrome oxidase 
contributes to the proton gradient? 


3. Why do some electron transfers result in a greater 
release of energy than other transfers? 


e—____! 


5.6 Establishment of a 
Proton-Motive Force 


We have seen that the free energy released during electron transport 
is utilized to move protons from the matrix to the intermembrane 
space and cytosol. Translocation of protons across the inner mem- 
brane is electrogenic (i.e., voltage producing) because it results in a 
greater number of positive charges in the intermembrane space and 
cytosol and a greater number of negative charges within the matrix. 
Thus, there are two components of the proton gradient to consider. 
One of the components is the concentration difference between 
hydrogen ions on one side of the membrane versus the other side; 
this is a pH gradient (ApH). The other component is the voltage (c) 
that results from the separation of charge across the membrane. 
A gradient that has both a concentration (chemical) and an electrical 
(voltage) component is an electrochemical gradient (page 140). The 
energy present in both components of the proton electrochemical 
gradient can be combined and expressed as the proton-motive 
force (Ap) , which is measured in millivolts. Thus, 


Ap =w —2.3 (RT/F) ApH 


Because 2.3 RT/F is equal to at 59 mV 25°C, the equation? can be 
rewritten as 


Ap =y-59 ApH 


The contribution to the proton-motive force made by the electric 
potential versus the pH gradient depends on the permeability prop- 
erties of the inner membrane. For example, if the outward movement 
of protons during electron transport is accompanied by negatively 
charged chloride ions, then the electric potential (yw) is reduced with- 
out affecting the proton gradient ApH. Measurements made in vari- 
ous laboratories suggest that actively respiring mitochondria generate 
a proton-motive force of approximately 220 mV across their inner 
membrane. In mammalian mitochondria, approximately 80 percent 
of the free energy of Ap is represented by the voltage component and 
the other 20 percent by the proton concentration difference (approxi- 
mately 0.5 tol pH unit difference). If the proton concentration differ- 
ence were much greater than this it would likely affect the activity of 
cytoplasmic enzymes. The transmembrane voltage across the inner 
mitochondrial membrane can be visualized by the use of positively 
charged, lipid-soluble dyes that distribute themselves across mem- 
branes in proportion to the electric potential (FIGURE 5.21). 

The proton-motive force describes the total energy that is 
released when a proton moves down the gradient, expressed as an 
electrical potential in units of volts. It is easy to convert a proton- 
motive force back into the more familiar units of free energy. To do 
this, recall from page 140 that the energy to move an ion through a 
potential of voltage Ap is 


AG = zFAp 


but for a proton, z=1 since it has a single positive charge. So if we 
know the proton-motive force Ap, we can always find the free energy 


AG = FAp 
where F = 23.06 kcal/mol» V 


3In other words, a difference of one pH unit, which represents a tenfold difference in 
H* concentration across the membrane, is equivalent to a potential difference of 59 
millivolts, which is equivalent to a free-energy difference of 1.37 kcal/mol. 


FIGURE 5.21 Visualizing the proton-motive force. Fluorescence 
micrograph of a cultured cell stained with a fluorescent dye known as JC-1, 
whose fluorescence emission changes from green to orange when the 
membrane potential of the mitochondrion increases. 


Source: lvan S. Chernoivanenko et al., FASEB J. 29:820-7, 2015. Panel B, 
Fig.4 


When cells are treated with certain lipid-soluble agents, most 187 


notably 2,4-dinitrophenol (DNP), they continue to oxidize sub- 
strates without being able to generate ATP. In other words, DNP 
uncouples glucose oxidation and ADP phosphorylation. Because 
of this property, DNP has become widely used in the laboratory 
to inhibit ATP formation. During the 1920s, a few physicians 
even prescribed DNP as a diet pill. When exposed to the drug, 
the cells of obese patients continued to oxidize their stores of fat 
in a vain attempt to maintain normal ATP levels. The practice 
ceased with the deaths of a number of patients taking the drug. 
With the formulation of the chemiosmotic theory and the deter- 
mination that mitochondria generate a proton gradient, the 
mechanism of DNP action became understood. This drug can 
uncouple oxidation and phosphorylation because it combines 
with protons and, because of its lipid solubility, carries the pro- 
tons across the inner mitochondrial membrane down their 
electrochemical gradient. 

The maintenance of a proton-motive force requires that the inner 
mitochondrial membrane remain highly impermeable to protons. If 
not, the gradient established by electron transport is quickly dissipated 
by the movement of protons back into the matrix, leading to the release 
of energy as heat. It came as a surprise to discover that the inner mito- 
chondrial membrane of certain cells contains proton-transporting 
proteins, called uncoupling proteins (or UCPs), that act as natural 
(endogenous) uncouplers. One of these proteins, UCP1, is espe- 
cially abundant in brown adipose tissue of mammals, which func- 
tions as a source of heat production during exposure to cold 
temperatures. Human infants also rely on “burning” brown fat depos- 
its to maintain body temperature. It was thought until recently that 
adult humans were largely devoid of brown adipose tissue, but several 
recent studies have shown the presence of brown fat cells in the neck 
and upper chest. As in infants, these cells become activated when we 
are exposed to cold temperatures. Other UCP isoforms (UCP2-UCP5) 
are present in a variety of tissues, especially in cells of the nervous sys- 
tem, but their roles are debated. For example, UCPs in the inner mito- 
chondrial membrane are thought to prevent the build-up of an 
excessively large proton-motive force. If such a high-energy state were 
to form, it could block the passage of electrons through the respiratory 
complexes, leading to the leakage of electrons and the production of 
reactive oxygen radicals. 


REVIEW | 


1. What are the two components of the proton-motive 
force, and how can their relative contribution vary from 
one cell to another? 


2. What is the effect of dinitrophenol on ATP formation 
by mitochondria? Why is this the case? 


oo —____—. 


5.7 The Structure of ATP Synthase 


Now that we have seen how the transport of electrons generates a 
proton electrochemical gradient across the inner mitochondrial 
membrane, we can turn to the molecular machinery that utilizes 
the energy stored in this gradient to drive the phosphorylation 
of ADP. 
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During the early 1960s, Humberto Fernandez-Moran of 
Massachusetts General Hospital was examining isolated mitochon- 
dria by the recently developed technique of negative staining. 
Fernandez-Moran discovered a layer of spheres attached to the inner 
(matrix) side of the inner membrane, projecting from the membrane 
and attached to it by stalks (FIGURE 5.22). A few years later, Efraim 
Racker of Cornell University isolated the inner membrane spheres, 
which he called coupling factor 1, or simply F,. Racker discovered 
that the F, spheres behaved like an enzyme that hydrolyzed ATP, that 
is, an ATPase. At first glance this seems to be a peculiar finding. Why 
should mitochondria possess a predominant enzyme that hydrolyzes 
the substance it is supposed to produce? 

If one considers that the hydrolysis of ATP is the reverse reac- 
tion from its formation, the function of the F; sphere becomes more 
evident; it contains the catalytic site at which ATP formation nor- 
mally occurs. Recall that 


1. Enzymes do not affect the equilibrium constant of the reaction 
they catalyze. 


2. Enzymes are capable of catalyzing both the forward and reverse 
reactions. 


Consequently, the direction of an enzyme-catalyzed reaction 
at any given time depends on the prevailing conditions. This is 
nicely shown in experiments with other ATPases, such as the 
Na‘*/K*t-ATPase of the plasma membrane (page 153). When this 
enzyme was discussed in Chapter 4, it was described as an enzyme 
that used the energy obtained from ATP hydrolysis to export Na+ and 
import K* against their respective gradients. In the cell this is the only 
function of the enzyme. However, under experimental conditions, 
this enzyme can catalyze the formation of ATP rather than its hydroly- 
sis (FIGURE 5.23). To obtain such conditions, red blood cell ghosts 
(page 137) are prepared with a very high internal Kt concentration 
and a very high external Na* concentration, higher than normally 
exists in the body. Under these conditions, K* moves out of the “cell” 
and Na* moves into the “cell? Both ions are moving down their 


FIGURE 5.22 The machinery for ATP synthesis. Three dimensional 
electron micrograph of isolated mitochondrial membranes showing 
spherical particles attached by thin stalks. 


Source: From Davies KM, et al., 2012 PNAS 109: 13602-13607. 


Nat/K*t-ATPase 


FIGURE 5.23 An experiment to drive ATP formation in membrane 
vesicles reconstituted with the Na*/K*-ATPase. By causing these vesicles 
to have very high internal K* concentration and very high external Na* 
concentration, the reaction is made to run in the opposite direction from 
that of its normal course in the plasma membrane. In the process, ATP is 
formed from ADP and P. The size of the letters indicates the direction of 
the concentration gradients. 


respective gradients, rather than against them as would normally 
occur in a living cell. If ADP and P, are present within the ghost, the 
movement of the ions causes ATP to be synthesized rather than 
hydrolyzed. Experiments such as this illustrate the reality of what 
would be expected on the basis of the theoretical reversibility of 
enzyme-catalyzed reactions. They also illustrate how an ionic gradient 
can be used to drive a reaction in which ADP is phosphorylated to 
ATP, which is precisely what occurs in the mitochondrion. The driv- 
ing force is the proton-motive force established by electron transport. 

Although the F, sphere is the catalytic portion of the enzyme that 
manufactures ATP in the mitochondrion, it is not the whole story. 
The ATP-synthesizing enzyme, which is called ATP synthase, is a 
mushroom-shaped protein complex (FIGURE 5.24a) composed of 
two principal components: a spherical F, head (about 90 A diameter) 
and a basal section, called F,, embedded in the inner membrane. The 
“o” in F, is not a zero, but the letter “o’, which stands for “oligomycin,” 
a toxin that works by binding to the F, subunit. High-resolution elec- 
tron micrographs indicate that the two portions are connected by 
both a central and a peripheral stalk as shown in Figure 5.24. A typi- 
cal mammalian liver mitochondrion has roughly 15,000 copies of the 
ATP synthase. Homologous versions of the ATP synthase are found 
in the plasma membrane of bacteria, the thylakoid membrane of 
plant chloroplasts, and the inner membrane of mitochondria. 

The F, portions of the bacterial and mitochondrial ATP syn- 
thases are highly conserved; both contain five different polypeptides 
with a composition of a,8, dye. The a and B subunits are arranged 
alternately within the F, head in a manner resembling the segments 
of an orange (Figure 5.244; see also 5.26b). Two aspects can be noted 
for later discussion: (1) Each F, contains three catalytic sites for ATP 
synthesis, one on each f subunit, and (2) the Y subunit runs from the 
outer tip of the F, head through the central stalk and makes contact 
with the F, basepiece. In the mitochondrial enzyme, all five polypep- 
tides of F, are encoded by the nuclear DNA, synthesized in the cyto- 
sol, and then imported (see Figure 8.47). 

The F, portion of the ATP synthase resides within the membrane 
and consists of three different polypeptides with a stoichiometry of 
ab3c 9 in E. coli (Figure 5.24b). The number of subunits in the c ring 
can vary depending on the source of the enzyme. Both the yeast 
mitochondrial and E. coli ATP synthase, for example, have 10 
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(a) 


FIGURE 5.24 The structure of the bacterial ATP synthase. (a) Schematic diagram of the ATP synthase from E. coli. The enzyme consists of two major 
portions, called F, and F,. The F; head consists of five different subunits in the ratio 3a : 3B : 15: ly : le. The @ and B subunits are organized in a circular array to 
form the spherical head of the particle; the y subunit runs through the core of the ATP synthase from the tip of F, down to F, to form a central stalk; the ¢ 
subunit helps attach the y subunit to the Fo base. The F. base, which is embedded in the bacterial plasma membrane, consists of three different subunits in 
the ratio 1a: 2b: 10c. As discussed later, the c subunits form a rotating ring within the membrane; the paired b subunits of the Fo base and the & subunit of 
the F, head form a peripheral stalk that holds the a/B subunits in a fixed position; and the a subunit contains the proton channel that allows protons to 
traverse the membrane. The mammalian enzyme contains seven to nine small additional subunits whose functions are not well established. (b) Three- 
dimensional structure of the bacterial ATP synthase. This image is composed of several partial structures of the enzyme from various organisms. 


Source: From Wolfgang Junge and Nathan Nelson, Science 308:643, 2005; © 2005, reprinted with permission from AAAS. 


c subunits, whereas a chloroplast enzyme has 14 c subunits 
(FIGURE 5.25) and, surprisingly, the mammalian enzyme has only 8c 
subunits. The F, base contains a channel through which protons are 
conducted from the intermembrane space to the matrix. 


REVIEW | 


1. Describe the basic structure of the ATP synthase. 
| 


5.8 The Binding Change 
Mechanism of ATP Formation 


How does a proton electrochemical gradient provide the energy 
required to drive the synthesis of ATP? To answer this question, Paul 
Boyer of UCLA published an innovative hypothesis in 1979, called the 
binding change mechanism, which has since gained wide acceptance. 


5nm 


FIGURE 5.25 Visualizing the oligomeric c ring of a chloroplast ATP 
synthase. Atomic force microscopy of a “field” of c rings that have been Components of the Binding 


isolated from chloroplast ATP synthases and reconstituted as a two- Change Hypothesis 


dimensional array within an artificial lipid layer. Rings of two different 


diameters are present in the field because the oligomers are thought to lie Overall, the binding change hypothesis has several distinct compo- 
in both possible orientations within the “artificial membrane.” (Inset) nents, which we will discuss in sequence. 

Higher-resolution view of one of the rings, showing that it is composed of 

14 subunits. 1. The energy released by the movement of protons is not used to 
Source: From Holger Seelert, et al., Nature 405:419, 2000; © 2000, drive ADP phosphorylation directly but principally to change 
reprinted by permission from Macmillan Publishers Ltd; Image courtesy the binding affinity of the active site for the ATP product. We 


of daniel J. Miiller and Andreas Engel. are used to thinking of cellular reactions occurring in an aqueous 


189 


UOIEWIOY JIV JO WusiUeYda/\) eGueyD Bulpuig aul e Q's 


190 


UOLPUOYION|\] @Y} PUe UO!eIIdSay DIGQOJeV e G YALdVHD 


environment in which the concentration of water is 55 M and the 
reactants and products are simply dissolved in the medium. 
Under these conditions, energy is required to drive the formation 
of the covalent bond linking ADP with inorganic phosphate to 
form ATP. It has been demonstrated, however, that once ADP and 
Pare bound within the catalytic site of the ATP synthase, the two 
reactants readily condense to form a tightly bound ATP molecule 
without the input of additional energy. In other words, even 
though the reaction 


soluble ADP + soluble P, + soluble ATP +H,O 


may require the input of considerable energy (7.3 kcal/mol 
under standard conditions), the reaction 


enzyme —bound ADP + enzyme —bound P, > 
enzyme -— bound ATP+H,O 


can occur spontaneously without the input of energy. This does 
not mean that ATP can be formed from ADP without energy 
expenditure. Instead, the energy is required for the release of the 
tightly bound product from the catalytic site, rather than the 
phosphorylation event itself. 


. Each active site progresses successively through three distinct 


conformations that have different affinities for substrates and 
product. Recall that the F, complex has three catalytic sites, one 
on each of the three B subunits. When investigators examined the 
properties of the three catalytic sites in a static enzyme (one that 
was not engaged in enzymatic turnover), the different sites exhib- 
ited different chemical properties. Boyer proposed that, at any 
one time, the three catalytic sites are present in different 
conformations, which causes them to have different affinity for 
nucleotides. At any given instant, one site is in the “loose” or 
L conformation in which ADP and P, are loosely bound; a second 
site is in the “tight” or T conformation in which nucleotides 
(ADP +P, substrates, or ATP product) are tightly bound; and the 
third site is in the “open” or O conformation, which, because it 
has a very low affinity for nucleotides, allows release of ATP. 

Although there were differences among the three catalytic 
sites in a static enzyme, Boyer obtained evidence that all of the 
ATP molecules produced by an active enzyme were synthesized 
by the same catalytic mechanism. It seemed, in other words, that 
all three of the catalytic sites of the enzyme were operating identi- 
cally. To explain the apparent contradiction between the asym- 
metry of the enzyme’s structure and the uniformity of its catalytic 
mechanism, Boyer proposed that each of the three catalytic sites 
passed sequentially through the same L, T, and O conformations 
(see Figure 5.27). 


. ATP is synthesized by rotational catalysis in which one part of 


the ATP synthase rotates relative to another part. To explain 
the sequential changes in the conformation of each of the catalytic 
sites, Boyer proposed that the o and B subunits, which form a 
hexagonal ring of subunits within the F, head (Figure 5.24), rotate 
relative to the central stalk. In this model, which is referred to as 
rotational catalysis, rotation is driven by the movement of protons 
through the membrane via the channel in the F, base. Thus, 
according to this model, electrical energy stored in the proton 


gradient is transduced into mechanical energy of a rotating stalk, 
which is transduced into chemical energy stored in ATP. 


Evidence to Support the Binding 
Change Mechanism and 
Rotary Catalysis 


The publication in 1994 of a detailed atomic model of the F, head by 
John Walker and his colleagues of the Medical Research Council in 
England provided a remarkable body of structural evidence to sup- 
port Boyer’s binding change mechanism. First, it revealed the struc- 
ture of each of the catalytic sites in a static enzyme, confirming that 
they differ in their conformation and their affinity for nucleotides. 
Structures corresponding to the L, T, and O conformations were 
identified in the catalytic sites of the three B subunits. Second, it 
revealed that the y subunit of the enzyme is perfectly positioned within 
the ATP synthase to transmit conformational changes from the F, 
membrane sector to the F, catalytic sites. The y subunit could be seen 
to extend as a shaft from the F, sector through the stalk and into a 
central cavity within the F, sphere (FIGURE 5.26a), where it contacts 
each of the three 8 subunits differently (Figure 5.26b). 

The apical end of the y subunit is highly asymmetric, and at any 
given instant, different faces of the y subunit interact with the differ- 
ent B subunits, causing them to adopt different (L, T, and O) confor- 
mations. As it rotates in steps of 120°, each binding site on the y 
subunit interacts successively with the three B subunits of F. During 
a single catalytic cycle, the y subunit rotates a full 360°, causing each 
catalytic site to pass sequentially through the L, T, and O conforma- 
tions. This mechanism is illustrated schematically in FIGURE 5.27 
and discussed in detail in the accompanying legend. As indicated in 
Figure 5.27, condensation of ADP and P; to form ATP occurs while 
each subunit is in the T conformation. As indicated in Figure 5.24a, 
the € subunit is associated with the “lower” portion of the y subunit, 
and the two subunits rotate together as a unit. 

Direct evidence for the rotation of the y subunit relative to the 
af subunits has been obtained in a variety of experiments. If “seeing 
is believing, then the most convincing demonstration came in 1997 
from the work of Masasuke Yoshida and colleagues at the Tokyo 
Institute of Technology and Keio University in Japan. These research- 
ers devised an ingenious system to watch individual enzyme mole- 
cules catalyze the reverse reaction from that normally operating in 
the cell. First, they prepared a genetically engineered version of the 
F, portion of the ATP synthase, consisting of three a subunits, three 
B subunits, and ay subunit (a3B37) (FIGURE 5.28a). This polypeptide 
complex was then fixed to a glass coverslip by its head, and a short, 
fluorescently labeled actin filament was attached to the end of the 
y subunit that projected into the medium (Figure 5.28a). When the 
preparation was incubated with ATP and observed under the micro- 
scope, the fluorescently labeled actin filament was seen to rotate like 
a microscopic propeller (Figure 5.28b), powered by the energy released 
as ATP molecules were bound and hydrolyzed by the catalytic sites of 
the B subunits. 

More recently, a similar type of F, complex was “forced” to per- 
form the more challenging activity that it normally carries out within 
the cell—the synthesis of ATP. In this study, a microscope magnetic 
bead was attached to the y subunit of a single F, molecule and then 
caused to rotate in a clockwise direction by subjecting the preparation 


(a) (b) 


FIGURE 5.26 The structural basis of catalytic site conformation. (a) A section through the F, head shows the spatial organization of three of its subunits. 
The y subunit is constructed of two extended and intertwined a helices. This helical stalk projects into the central cavity of the F,, and between the a and 

B subunits on each side. The conformation of the catalytic site of the B subunit (shown on the left) is determined by its contact with the y subunit. (b) A top 
view of the F, head showing the arrangement of the six a and B subunits (shown in red and yellow) around the asymmetric y subunit (shown in blue). The 

y subunit is in position to rotate relative to the surrounding subunits. It is also evident that the y subunit makes contact in a different way with each of the 
three B subunits, inducing each of them to adopt a different conformation. B_ corresponds to the O conformation, Brp to the L conformation, and Bpp to the 


T conformation. 
Source: From J. P. Abrahams, et al., Nature 370:624, 627, 1994, reprinted by Permission from Macmillan Publishers Ltd. Courtesy of John E. Walker. 


to a revolving magnetic field. When one of these F, molecules was _ from the model, three molecules of ATP were synthesized with each 
placed within a tiny transparent microchamber in the presence of 360° turn. When the magnetic field was switched off, the y subunit 
ADP and P, the forced rotation of the y subunit led to the synthesis of rotated spontaneously in the reverse direction, driven by the hydrolysis 
ATP, which built up to UM concentrations. As would be expected —_ of the recently synthesized ATP. Taken together, these innovative 


ADP + Pi-~ /o> AIP 


ADP + Pj ADP + Pj a . 
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FIGURE 5.27 The binding change mechanism for ATP synthesis. (a) Schematic drawing showing changes in a single catalytic site during a cycle of 
catalysis. At the beginning of the cycle, the site is in the open (O) conformation, and substrates ADP and P are entering the site. In step 1, the movement 

of protons through the membrane induces a shift to the loose (L) conformation in which the substrates are loosely bound. In step 2, the movement of 
additional protons induces a shift to the tight (T) conformation, in which the affinity for substrates increases, causing them to be tightly bound to the 
catalytic site. In step 3, the tightly bound ADP and P, spontaneously condense to form a tightly bound ATP; no change in conformation is required for 

this step. In step 4, the movement of additional protons induces a shift to the open (O) conformation, in which the affinity for ATP is greatly decreased, 
allowing the product to be released from the site. Once the ATP has dissociated, the catalytic site is available for substrate binding, and the cycle is 
repeated. (b) Schematic drawing showing changes at all three catalytic sites of the enzyme simultaneously. The movement of protons through the F, part of 
the enzyme causes the rotation of the asymmetric y subunit, which displays three different faces to the catalytic subunits. As the y subunit rotates, it induces 
changes in the conformation of the catalytic site of the B subunits, causing each catalytic site to pass successively through the T, O, and L conformations. 
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Coverslip coated with Ni-NTA 
(a) 


FIGURE 5.28 Direct observation of rotational catalysis. (a) To carry out 
the experiment, a modified version of a portion of the ATP synthase 
consisting of a3 B3y was prepared. Each 8 subunit was modified to contain 
10 histidine (His) residues at its N-terminus, a site located on the outer 
(matrix) face of the F, head. The side chains of histidine have a high affinity 
for a substance (Ni-NTA), which was used to coat the coverslip. The 

y subunit was modified by replacing one of the serine residues near the 
end of the stalk with a cysteine residue, which provided a means to attach 
the fluorescently labeled actin filament. In the presence of ATP, the actin 
filament was observed to rotate counterclockwise (when viewed from the 
membrane side). At low ATP concentrations, the actin filaments could be 
seen to rotate in steps of 120°. (b) A sequence of four frames from a video 
of a rotating actin filament. 

Source: From H. Noji, et al., Nature 386: 299, 1997, reprinted by permission 
from Macmillan Publishers Ltd; Courtesy of Masasuke Yoshida. 


bioengineering experiments demonstrate unequivocally that ATP 
synthase operates as a rotary motor. 

Rotary machines are common in our industrialized society; we 
use rotary turbines, rotary drills, rotary wheels, and propellers, to 
name just a few. But rotary devices are extremely rare in living organ- 
isms. There are, for example, no known rotary organelles in eukary- 
otic cells, rotary joints in animals, or rotary feeding structures in the 
biological world. In fact, only two types of biological structures are 
known that contain rotating parts: ATP synthases (and related pro- 
teins that act as ionic pumps) and bacterial flagella (which are pic- 
tured in Figure 1.14a, inset), both of which can be described as 
rotary “nanomachines” because their size is measured in nanome- 
ters. Engineers have begun to invent nanoscale devices made of 
inorganic materials that may one day carry out various types of sub- 
microscopic mechanical activities. The construction of nanosized 
motors poses a particular challenge, and attempts have already been 
made to use the ATP synthase to power simple, inorganic devices. 
Someday humans may be using ATP instead of electricity to power 
some of their most delicate instruments. 


REVIEW | 
1. Describe the steps in ATP synthesis according to the 
binding change mechanism. 


2. Describe some of the evidence that supports the 
binding change mechanism. 


. — 


5.9 Using the Proton Gradient 


By 1997, a detailed understanding of the working machinery of the 
F, complex was in hand, but major questions about the structure and 
function of the membrane-bound F, portion of the enzyme remained 
to be answered. Most important among them were: What is the path 
taken by protons as they move through the F, complex, and how 
does this movement lead to the synthesis of ATP? It had been 
postulated that: 


1. The c subunits of the F, base were assembled into a ring that 
resides within the lipid bilayer (as in Figure 5.24a). 


2. Thec ring is physically bound to the Y subunit of the stalk. 


3. The “downhill” movement of protons through the membrane 
drives the rotation of the ring of c subunits. 


4. The rotation of thec ring of F, provides the twisting force (torque) 
that drives the rotation of the attached Y subunit, leading to the 
synthesis and release of ATP by catalytic subunits of the F, ring. 


All of these presumptions have been confirmed. We will look at each 
of these aspects in more detail. 


The Role of the F, Portion of ATP 
Synthase in ATP Synthesis 


A body of evidence, including X-ray crystallography and atomic 
force microscopy, has shown that the c subunits are indeed organ- 
ized into a circle to form a ring-shaped complex (Figure 5.25). High- 
resolution electron micrographs indicate that the two b subunits and 
the single a subunit of the F, complex reside outside the ring of 
c subunits, as shown in Figure 5.24. The b subunits are thought to be 
primarily structural components of the ATP synthase. The two elon- 
gated b subunits form a peripheral stalk that connects the F, and F, 
portions of the enzyme (Figure 5.24b) and, along with the d subunit 
of F, are thought to hold the a,8, subunits in a fixed position while 
the Y subunit rotates within the center of the complex. 

The rotation of the c ring of F, during ATP synthesis has been 
demonstrated in numerous experiments, confirming that both the 
c ring and Y subunit act as rotors during enzyme activity. How are 
these two “moving parts” connected? Each c subunit is built like a 
hairpin: it contains two transmembrane helices connected by a 
hydrophilic loop that projects toward the F, head. The hydrophilic 
loops situated on the tops of the c subunits are thought to form a bind- 
ing site for the bases of the Y and € subunits, which together act like a 
“foot” that is firmly attached to the c ring (Figures 5.24 and 5.29). As 
a result of this attachment, rotation of the c ring drives rotation of 
the attached Y subunit, which can occur at rates of more than 100 
revolutions per second. 

The mechanism by which H* movements drive the rotation of the 
c ring is more complex and less well understood. FIGURE 5.29 pro- 
vides a model as to how H* ions might flow through the F, complex. 
Keep in mind in the following discussion of this model (1) that the 
subunits of the c ring move successively past a stationary a subunit 
and (2) that protons are picked up from the intermembrane space 
one at a time by each c subunit and carried completely around a circle 
before they are released into the matrix. In this model, each a subunit 
has two half-channels that are physically separated (offset) from one 
another. One half-channel leads from the intermembrane (cytosolic) 


Ring of 
c subunits 


FIGURE 5.29 A model in which proton diffusion is coupled to the 
rotation of the c ring of the F, complex. As discussed in the text, the 
number of subunits in the c ring is variable. For the sake of simplicity, this 

c ring consists of 12 subunits. It is proposed in this model that each proton 
from the intermembrane space enters a half-channel within the a subunit 
and then binds to an acidic residue (Aspé611 in E. coli) that is accessible on 
one of the c subunits. Proton binding induces a conformational change that 
causes the ring to move by approximately 30°. The bound proton is carried 
in a full circle by the rotation of the c ring and is then released into a 
second half-channel that opens into the matrix. A succession of protons 
that engage in this activity causes the c ring to rotate in a counterclockwise 
direction as shown. 


space into the middle of the a subunit, and the other leads from the 
middle of the a subunit into the matrix. It is proposed that each pro- 
ton moves from the intermembrane space through the half-channel 
and binds to a negatively charged acidic residue situated at the sur- 
face of the adjoining c subunit. This acidic residue is apparently an 
aspartic acid in E. coli (Figure 5.29), but is a glutamic acid residue in 
animal mitochondria. Binding of the proton to the carboxyl group of 
that residue causes that subunit to rotate approximately 308 in a 
counterclockwise direction. The movement of the recently proto- 
nated c subunit brings the adjoining subunit in the ring (which was 
protonated at an earlier step) into alignment with the second half- 
channel of the a subunit. Once there, the acidic residue releases its 
associated proton, which diffuses into the matrix. Following disso- 
ciation of the proton, the c subunit returns to its original conforma- 
tion and is ready to accept another proton from the intermembrane 
space and repeat the cycle. 

According to this model, the proton-binding residue of each 
c subunit acts like a revolving proton carrier. A proton hops onto the 
carrier at a selected pick-up site and is then carried around a circle 
before being released at a selected drop-off site. Movement of the ring 
is driven by the conformational changes associated with the sequential 
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unit. Consequently, the number of protons translocated for each rota- 
tion of thec ring is generally considered to be equal to the number of 
subunits that make up the ring, although this is not always experi- 
mentally validated. The c ring has been shown to contain between 8 
and 15 subunits, depending on the source. For the sake of simplicity, 
we will describe a c ring composed of 12 subunits, as illustrated in 
Figure 5.29. In this case, the association/dissociation of four protons 
in the manner described would move the ring 120°. Movement of 
the c ring 120° would drive a corresponding rotation of the attached 
Y subunit 120°, which would lead to the release of one newly synthe- 
sized molecule of ATP by the F, complex. According to this stoichi- 
ometry, the translocation of 12 protons would lead to the full 360° 
rotation of the c ring and Y subunit, and the synthesis and release of 
three molecules of ATP. If the c ring were to contain greater or fewer 
than 12 subunits, the Ht/ATP ratio would change, but this can easily 
be accommodated within the basic model of proton-driven rotation 
shown in Figure 5.29. 


Other Roles for the Proton-Motive 
Force in Addition to ATP Synthesis 


Although the production of ATP may be the most important activity 
of mitochondria, these organelles are engaged in numerous other 
processes that require the input of energy. Unlike most organelles 
that rely primarily on ATP hydrolysis to power their activities, mito- 
chondria rely on an alternate source of energy—the proton-motive 
force. For example, the proton-motive force drives the uptake of 
ADP and P, into the mitochondria in exchange for ATP and H+, 
respectively. Exchange of ATP for ADP across the membrane is 
accomplished by the adenine nucleotide translocase (ANT), which 
makes the ATP produced within the mitochondria available to meet 
the energy needs of the remainder of the cell. These and other activi- 
ties that take place during aerobic respiration are summarized in 
FIGURE 5.30. In other examples, the proton-motive force may be 
used as the source of energy to “pull” calcium ions into the mito- 
chondrion; to drive the events of mitochondrial fusion; and to cause 
specifically targeted polypeptides to enter the mitochondrion from 
the matrix (Figure 8.47). 


REVIEW | 


1. Describe a proposed mechanism whereby proton 
diffusion from the intermembrane space into the 
matrix drives the phosphorylation of ADP. 


oo 


5.10 Peroxisomes 


Peroxisomes are simple, membrane-bound vesicles (Figure 5.31a) 
with a diameter of 0.1 to 10 [1m that may contain a dense, crystalline 
core of oxidative enzymes. Peroxisomes are multifunctional orga- 
nelles, containing more than 50 enzymes involved in such diverse 
activities as the oxidation of very-long-chain fatty acids (VLCFAs, 
ones whose chain typically contains 24 to 26 carbons) and the syn- 
thesis of plasmalogens, which are an unusual class of phospholipids 


SOWOSIXOIOg © QL'S 


194 


USLPUOYSOI|A| a4 PUe UONeIdsey DIGOleY © G YALdVHO 


WI) DZ N]IQOIC tht! 
; PPO RRE RY 
N\. 


Pyruvic acid 
(c3) 


Intermembrane P; H+ 
space 


in which one of the fatty acids is linked to glycerol by an ether link- 
age rather than an ester linkage. Plasmalogens are very abundant in 
the myelin sheaths that insulate axons in the brain (Figure 4.5). 
Abnormalities in the synthesis of plasmalogens can lead to severe 
neurological dysfunction. The enzyme luciferase, which generates 
the light emitted by fireflies, is also a peroxisomal enzyme. Peroxisomes 
are considered in the present chapter because they share several 
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FIGURE 5.30 Summary of the major activities during 
aerobic respiration in a mitochondrion. 


ATP 
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properties with mitochondria: both mitochondria and 
peroxisomes form by splitting from preexisting orga- 
nelles, using some of the same proteins to accomplish 
the feat; both types of organelles import preformed 
proteins from the cytosol (Section 8.21); and both 
engage in similar types of oxidative metabolism. In 
fact, at least one enzyme, alanine/glyoxylate ami- 
notransferase, is found in the mitochondria of some 
mammals (e.g., cats and dogs) and in the peroxisomes 
of other mammals (e.g., rabbits and humans). The two 
types of organelles have become even more closely 
linked with recent reports that mitochondria form vesi- 
cles that carry cargo destined for peroxisomes. 

These organelles were named “peroxisomes” because 
they are the site of synthesis and degradation of hydro- 
gen peroxide (H,O,), a highly reactive and toxic oxidiz- 
ing agent. Hydrogen peroxide is produced by a number 

of peroxisomal enzymes, including urate oxidase, glyco- 
late oxidase, and amino acid oxidases, that utilize molecular oxygen 
to oxidize their respective substrates (FIGURE 5.31b). The H,O, 
generated in these reactions is rapidly broken down by the enzyme 
catalase, which is present in high concentrations in these organelles. 
The importance of peroxisomes in human metabolism is evident 
from the discussion in the accompanying Human Perspective. 


Oxidases 


FIGURE 5.31 The structure and function of peroxisomes. 
(a) Electron micrograph of a section of a rat liver cell, which 
had been stained for catalase, an enzyme localized in 
peroxisomes. Note the close association between the darkly 
stained peroxisomes and mitochondria. An electron micro- 
graph of a peroxisome within a plant cell is shown in Figure 
6.23. Bar equals 250 nm. (b) Peroxisomes contain enzymes that 
carry out the two-step reduction of molecular oxygen to water. 
In the first step, an oxidase removes electrons from a variety of 
substrates (RH), such as uric acid or amino acids. In the 
second step, the enzyme catalase converts the hydrogen 
peroxide formed in the first step to water. 

Source: From Michael Schrader and Yisang Yoon, Bioess. 
29:1106, 2007. © 2007, John Wiley & Sons. 


Peroxisomes are also present in plants. Plant seedlings containa 195 
specialized type of peroxisome, called a glyoxysome (FIGURE 5.32). 
Plant seedlings rely on stored fatty acids to provide the energy and 
material to form a new plant. One of the primary metabolic activi- 
ties in these germinating seedlings is the conversion of stored fatty 
acids to carbohydrate. Disassembly of stored fatty acids generates 
acetyl CoA, which condenses with oxaloacetate (OAA) to form 
citrate, which is then converted into glucose by a series of enzymes 
of the glyoxylate cycle localized in the glyoxysome. The role of 
peroxisomes in the metabolism of leaf cells is discussed in 
Section 6.9. 
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1. What are some of the major activities of peroxisomes? 
What is the role of catalase in these activities? 


FIGURE 5.32 Glyoxysome localization within plant seedlings. Light 
micrograph of a section through cotyledons from imbibed cotton seeds. 
The glyoxysomes, which are seen as small dark structures (arrow), have 


been made visible by a cytochemical stain for the enzyme catalase 2. In what ways are peroxisomes similar to mitochondria? 
Source: From Richard N. Trelease and Kent D. Chapman, J. Cell Biol. In what ways are they unique? 
115:998, 1991, Fig. 1.© 1991 Rockefeller University Press. oo 


5.11 THE HUMAN PERSPECTIVE 


Diseases that Result from Abnormal Mitochondrial or Peroxisomal Function 


Mitochondria maternally. In addition, the mitochondria of a cell can contain a 
In 1962, Rolf Luft of the University of Stockholm in Sweden mixture of normal (i.e., wild-type) and mutant mtDNA, a condition 
reported on a study of mitochondria isolated from a woman who known as heteroplasmy. The level of mutant mtDNA can vary 
had been suffering from long-standing fatigue and muscle weak- from one organ to another within a single individual, and it is only 
ness and who exhibited an elevated metabolic rate and body when a preponderance of the mitochondria of a particular tissue 
temperature. The mitochondria from this patient were somehow contains defective genetic information that clinical symptoms 


released from their normal respiratory control. Normally, when usually appear. Because of this variability, members of a family 
ADP levels are low, isolated mitochondria stop oxidizing sub- carrying the same mtDNA mutation can exhibit strikingly different 
strate. In contrast, the mitochondria from this patient continued — symptoms. 

to oxidize substrate at a high rate, even in the absence of ADP Flow of electrons along the mitochondrial electron transport 
to phosphorylate, producing heat rather than mechanical work. chain has been shown to be a major source of mutagenic oxygen 
Since that initial report, a variety of disorders have been traced radicals—so-called reactive oxygen species (ROS) (page 34). 
to mitochondrial dysfunction. Most of these disorders are char- Because of its proximity to the inner mitochondrial membrane 
acterized by degeneration of muscle or brain tissue, both of — where electron transport occurs, mtDNA is thought to come under 
which utilize exceptionally large amounts of ATP. These condi- a much greater level of attack than nuclear DNA. In addition, 
tions range in severity from diseases that lead to death during nuclear DNA is better protected from persistent damage by the 
infancy; to disorders that produce seizures, blindness, deafness, presence of a greater variety of DNA repair systems (Section 13.8). 


and/or stroke-like episodes; to milder conditions characterized by For these reasons, mtDNA experiences more than 10 times the 
intolerance to exercise or nonmotile sperm. Patients with serious mutation rate of nuclear DNA. Mitochondrial mutations are par- 
conditions generally have abnormal skeletal muscle fibers, which __ ticularly likely to accumulate in cells that remain in the body for 
contain large peripheral aggregates of mitochondria (FIGURE 1a). long periods of time, such as those of nerve and muscle tissue. 
Closer examination of the mitochondria reveals large numbers of | The degenerative changes in mitochondrial function are thought 
abnormal inclusions (Figure 1b). The proteins of the respiratory to contribute to a number of common neurologic diseases with 


chain are split between the nuclear and mitochondrial genomes, adult onset, most notably Parkinson's disease (PD). This possi- 
and diseases that affect respiratory function can be caused by bility originally came to light in the early 1980s when a number of 
mutations in either nuclear or mitochondrial genes. young drug addicts reported to hospitals with the sudden onset 

The inheritance of mitochondrial disorders contrasts in sev- of severe muscular tremors that are characteristic of advanced 
eral ways with the Mendelian inheritance of nuclear genes. The Parkinson's disease in elderly adults. It was discovered that these 
mitochondria present in the cells of ahuman embryo are derived individuals had injected themselves intravenously with a synthetic 
exclusively from mitochondria that were present in the egg at heroin that was contaminated with a compound called MPTP. 
the time of conception without any contribution from the fertil- Subsequent studies revealed that MPTP caused damage to com- 


izing sperm. Consequently, mitochondrial disorders are inherited plex | of the mitochondrial respiratory chain, leading to the death 
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(a) 


(b) 


FIGURE 1. Mitochondrial abnormalities in skeletal muscle. (a) Ragged-red fibers. These degenerating muscle fibers are from a biopsy of a patient 
and show accumulations of red-staining “blotches” just beneath the cells’ plasma membrane, which are due to abnormal proliferation of mitochondria. 
(b) Electron micrograph showing crystalline structures within the mitochondrial matrix from cells of a patient with abnormal mitochondria. 


Source: (a) Courtesy of Donald R. Johns; (6) From John A. Morgan-Hughes and D. N. Landon, in Myology, 2d ed., A. G. Engel and C. Franzini- 


Armstrong, eds., reproduced with permission of Mcgraw-hill, © 1994. 


of nerve cells in the same region of the brain (the substantia nigra) 
that is affected in patients with PD. When cells from the substantia 
nigra of PD patients were then studied, they too were found to 
have a marked and selective decrease in complex | activity. This 
complex | deficiency is probably due to a documented increase 
in the mutation rate in the DNA of these cells compared to other 
cells. Studies have also implicated exposure to certain pesticides, 
particularly rotenone, a known complex | inhibitor, as an envi- 
ronmental risk factor for development of PD. Administration of 
rotenone to rats causes the destruction of dopamine-producing 
neurons, which is characteristic of the human disease. The pos- 
sible link between rotenone and Parkinson's disease is being 
investigated. 

It has long been speculated that the gradual accumulation 
of mutations in mtDNA is a major causal factor in human aging. 
This hypothesis is fueled by the fact that cells taken from older 
persons have an increased number of mtDNA mutations com- 
pared to the same cells from younger individuals. But are these 
mutations a cause of aging or simply a consequence of a more 
basic underlying aging process that affects many physiological 
properties? The experiments that may be the most relevant to 
this question have been performed on mice that are homozygous 
for a mutant gene (called Polg) that encodes the enzyme that rep- 
licates mtDNA. The mtDNA of these mice accumulates a much 
higher level of mutations than that of their normal littermates. 
These “mutator” mice appear normal for the first 6 to 9 months of 
age, but then rapidly develop signs of premature aging, such as 
hearing loss, graying hair, and osteoporosis (FIGURE 2). Whereas 
the control mice have a life span of more than two years, mem- 
bers of the experimental group live, on average, only about one 
year. Interestingly, the genetically modified animals show no evi- 
dence of increased oxidative (free radical) damage, so it is not 
clear how the increased mtDNA mutation rate causes this pheno- 
type. It is also noteworthy that mice that are heterozygous for the 
Polg mutation have an elevated mtDNA mutation rate (although 
far less than the homozygotes), yet they do not exhibit a short- 
ened life span. If mtDNA mutations were a major contributor 
to the normal aging process, these heterozygotes should have 


FIGURE 2 A premature-aging phenotype caused by increased 
mutations in mtDNA. The photograph shows a 13-month-old normal 
mouse and its “aged” littermate whose mitochondrial DNA harbors an 
abnormally high number of mutations. This premature-aging phenotype 
was caused by a mutation in the nuclear gene that encodes the DNA 
polymerase responsible for replication of mtDNA. 


Source: Courtesy of Jeff Miller, University of Wisconsin-Madison, Image 
provided by G. C. Kujoth. 


a significantly shorter life. Taken together, these results suggest 
that mutations in mtDNA may be sufficient to cause an animal to 
age prematurely, but they are not necessarily required as part of 
the normal aging process. Whether these studies on mice bear 
directly on human aging remains unclear. 


Peroxisomes 

Zellweger syndrome (ZS) is a rare inherited disease characterized 
by a variety of neurologic, visual, and liver abnormalities leading 
to death during early infancy. In 1973, Sidney Goldfischer and 
his colleagues at Albert Einstein College of Medicine reported 
that liver and renal cells from these patients were lacking per- 
oxisomes. Subsequent investigations revealed that peroxisomes 
were not entirely absent from the cells of these individuals, but, 
rather, they were present as empty membranous “ghosts”—that 


is, organelles lacking the enzymes that were normally found in 
peroxisomes. It is not that these individuals are unable to synthe- 
size peroxisomal enzymes, but rather that the enzymes fail to be 
imported into the peroxisomes and remain largely in the cytosol 
where they are unable to carry out their normal functions. Genetic 
studies of cells from ZS patients showed that the disorder can 
arise from mutations in at least 12 different genes, all encoding 
proteins involved in the uptake of peroxisomal enzymes from the 
cytosol. 

Unlike Zellweger syndrome, which affects a wide variety of 
peroxisomal functions, a number of inherited diseases are char- 
acterized by the absence of a single peroxisomal enzyme. One 
of these single-enzyme deficiency diseases is X-linked adrenoleu- 
kodystrophy (X-ALD), which was the subject of the 1993 movie 
Lorenzo's Oil. Boys with the disease are typically unaffected until 
mid-childhood, when symptoms of adrenal insufficiency and neu- 
rological dysfunction begin. The disease results from a defect ina 
membrane protein (ABCD1) that transports very-long-chain fatty 
acids (VLCFAs) into the peroxisomes, where they are normally 
metabolized. In the absence of this protein, VLCFAs accumulate 
in the brain and destroy the myelin sheaths that insulate nerve 


cells. In Lorenzo's Oil, the parents of a boy stricken with X-ALD 
discover that a diet rich in certain fatty acids is able to retard the 
progress of the disease. Subsequent studies have generated con- 
tradictory results regarding the value of this diet. A number of 
X-ALD patients have been successfully treated by bone marrow 
transplantation, which provides normal cells capable of metab- 
olizing VLCFAs, and by administration of drugs (e.g., lovastatin) 
that may lower VLCFA levels. The most exciting prospects for the 
treatment of X-ALD came in 2009 with the report that two boys 
in France had been successfully treated by gene therapy. In this 
clinical trial, stem cells were isolated from blood, a normal copy 
of the ABCD1 gene was introduced into the cells, which were 
then infused back into the patients from which they were derived. 
Some of these genetically engineered stem cells gave rise to dif- 
ferentiated cells that took up residence in the brain. These cells 
were able to produce the missing protein, leading to the arrest of 
further neurodegeneration. Years after the therapy, the boys had 
still not shown further progression of the disease, but so far the 
entire evaluation of this strategy rests on this single report of two 
individuals. A large-scale clinical trial of this treatment was started 
in 2013. 
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Analytic Questions —-———— 


1. Consider the reaction A: H+B =B:H+A. If, at equilibrium, the 
ratio of [B:H]/[B] is equal to 2.0, one can conclude that (1) B:H is 
the strongest reducing agent of the four compounds; (2) the 
(A:H-A) couple has a more negative redox potential than the 7 
(B:H -B) couple; (3) none of the four compounds are cytochromes; 
(4) the electrons associated with B are higher energy than those 
associated with A. Which of the previous statements are true? 
Draw one of the half-reactions for this redox reaction. 


6. Look at the drop in energy during electron transport shown in 
Figure 5.14. How would this profile be different if the original 
electron donor was FADH, rather than NADH? 


. Would you expect the import of P. to lead to a decrease in the 
proton-motive force? Why? 


8. Inthe chart of standard redox potentials in Table 5.1, the oxaloac- 
etate-malate couple is less negative than the NAD*-NADH cou- 
ple. How are these values compatible with the transfer of 


2. Mitochondrial DNA is replicated by the enzyme DNA polymerase electrons from malate to NAD* in the TCA cycle? 


Y. The “mutator” mice discussed on page 196 had this phenotype 
because they possessed a polymerase Y that tended to make 
mistakes as it copied its mtDNA template. Several limitations in 
interpreting these studies in terms of the normal aging process 
were pointed out. What do you think could be learned by gener- 
ating mice that had a super-accurate polymerase Y, that is, one 
that made fewer mistakes than the normal (wild-type) version? 


9. How many high-energy triphosphates are formed by substrate- 
level phosphorylation during each turn of the TCA cycle (con- 
sider the TCA cycle reactions only)? How many are formed as the 
result of oxidative phosphorylation? How many molecules of 
CO, are released? How many molecules of FAD are reduced? 
How many pairs of electrons are stripped from substrate? 


10. The assembly of Fe-S clusters, which takes place in the mito- 
chondrial matrix, is highly vulnerable to the presence of Op. 
Given the importance of mitochondria in aerobic metabolism, 
does this seem to be an unlikely place for this process to occur? 


b. Give the reaction that would occur if one were to mix reduced 11. The drop in AG” of a pair of electrons as they pass from NADH 

A molecules and oxidized B molecules. to O» is -52.6kcal/mol, and the AG” of ATP formation is 
+7.3 kcal/mol. If three ATPs are formed for each pair of electrons 
removed from substrate, can you conclude that oxidative 
phosphorylation is only 21.9/52.6 or 42 percent efficient? Why or 
why not? 


3. Protein A is a flavoprotein with a redox potential of —0.2V. 
Protein B is a cytochrome with a redox potential of +0.1V. 


a. Draw the half-reactions for each of these two electron carriers. 


c. Which two compounds in the reaction of part b would be pre- 
sent at the highest concentration when the reaction reached 
equilibrium? 

4. Of the following substances—ubiquinone, cytochrome c, NAD*, 

NADH, O2, H2O—which is the strongest reducing agent? Which 12. Would you expect metabolically active, isolated mitochondria to 

is the strongest oxidizing agent? Which has the greatest affinity acidity or alkalize the medium in which they were suspended? Why? 


for electrons? (See Quantitative Tutorial Video). 13. Suppose the movement of three protons were required for the 


synthesis of one ATP molecule. Calculate the energy released by 
the passage of three moles of protons into the matrix when the 
proton-motive force is measured at 220mV (see page 187 for 
information) (See Quantitative Tutorial Video). 


5. If it were determined that the inner mitochondrial membrane 
was freely permeable to chloride ions, what effect might this 
have on the proton-motive force across the inner mitochondrial 
membrane? 


198 14. Would you expect isolated mitochondria to be able to carry out 
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15. 


16. 


the oxidation of glucose with the accompanying production of 
ATP? Why or why not? What might be a good compound to add to 
a preparation of isolated mitochondria to achieve ATP synthesis? 


Calculate the free energy released when FADH, is oxidized by 
molecular O2 under standard conditions. 
Using your understanding of the organization of the mitochon- 
drion and how it generates ATP, answer the following questions. 
Give a brief explanation of each answer. 


a. What would be the effect on ATP production of adding a sub- 
stance (protonophore) that makes the mitochondrial mem- 
brane leaky to protons? 

b. What would be the effect on the pH of the mitochondrial 
matrix of reducing the oxygen supply? 


c. Suppose that the glucose supply to the cell is greatly reduced. 
What would be the effect on CO, production by the 
mitochondria? 


Ve: 


18. 


19: 


Suppose that you are able to manipulate the potential of the 
inner membrane of an isolated mitochondrion. You measure 
the pH of the mitochondrial matrix and find it to be 8.0. You 
measure the bathing solution and find its pH to be 7.0. You clamp 
the inner membrane potential at +59 mV—that is, you force the 
matrix to be 59 mV positive with respect to the bathing medium. 
Under these circumstances, can the mitochondrion use the pro- 
ton gradient to drive the synthesis of ATP? Explain your answer in 
terms of the proton-motive force (see Section 5.6 to review) 


Suppose you were able to synthesize an ATP synthase that 
was devoid of the Y subunit. How would the catalytic sites of the 
Y subunits of such an enzyme compare to one another. Why? 


In functional terms, the ATP synthase can be divided into two 
parts: a “stator” made up of subunits that do not move during 
catalysis, and a “rotor” made up of parts that do move. Which 
subunits of the enzyme make up each of these two parts? How 
are the immovable parts of the stator in F, related structurally to 
the immovable parts of the stator in F,? 


BIOFUEL: RUNNING YOUR CAR 
BY PHOTOSYNTHESIS 


W. use fossil fuels (oil, coal, and gas) for the vast 
majority of our energy needs. But the supply of fossil 
fuel is finite and running out. New mining and extraction 
technologies such as fracking are allowing more fuel to 
be extracted from the ground, but there's still a limit to 
what is available. Meanwhile, the sun is bathing our 
entire planet in energy. We currently harvest a minus- 
cule fraction of this light using solar panels, but these 
are expensive and inefficient. Meanwhile, plants, algae, 
and bacteria have been converting light to chemical 
energy for over a billion years, and it is their leftover 
biomass buried underground that gave rise to the fossil 


Photosynthesis and 
the Chloroplast 


fuels we burn today. What if we could take advantage of 


these organisms to produce fuel? Nobody has yet found 


a way to produce biofuel (fuel derived from living 
organisms) at a price that can compete with fossil fuels. 
Nevertheless, the promise of such biofuels has sparked 
an intense research and development effort. In this 
chapter we will see how plants collect light from the sun 
and convert it into chemical energy to support their own 
growth and metabolism. 


Source: Pierre Marchal / LookatSciences / Phototake 
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The Earth’s earliest life forms must have obtained their raw materials 
and energy from simple organic molecules dissolved in their aque- 
ous environment. These organic molecules had to have formed abi- 
otically, that is, as a result of nonbiological chemical reactions 
occurring in the primeval seas. Thus, just as we survive on nutrients 
taken in from our environment, so too must have the original life 
forms. Organisms that depend on an external source of organic 
compounds are called heterotrophs. 

The number of heterotrophic organisms living on the primitive 
Earth must have been severely restricted because the spontaneous pro- 
duction of organic molecules occurs very slowly. The evolution of life 
on Earth received a tremendous boost with the appearance of organ- 
isms that employed a new metabolic strategy. Unlike their predeces- 
sors, these organisms could manufacture their own organic nutrients 
from the simplest types of inorganic molecules, such as carbon dioxide 
(CO,) and hydrogen sulfide (H,S). Organisms capable of surviving on 
CO, as their principal carbon source are called autotrophs. 

The manufacture of complex organic molecules from CO, 
requires the input of large amounts of energy. Over the course of 
evolution, two main types of autotrophs have evolved that can be 
distinguished by their source of energy. Chemoautotrophs utilize 
the chemical energy stored in inorganic molecules (such as ammo- 
nia, hydrogen sulfide, or nitrites) to convert CO, into organic com- 
pounds, while photoautotrophs utilize the radiant energy of the 
sun to achieve this result. Because all chemoautotrophs are prokary- 
otes, and their relative contribution to the formation of biomass on 
Earth is small, we will not consider their metabolic activities further. 
Photoautotrophs, on the other hand, are responsible for capturing 
the energy that fuels the activities of most organisms on Earth. 
Photoautotrophs include plants and eukaryotic algae, various flagel- 
lated protists, and members of several groups of prokaryotes. All of 
these organisms carry out photosynthesis, a process in which 
energy from sunlight is transformed into chemical energy that is 
stored in carbohydrates and other organic molecules. 

During photosynthesis, relatively low-energy electrons are 
removed from a donor compound and converted into high-energy 
electrons using the energy absorbed from light.’ These high-energy 
electrons are then employed in the synthesis of reduced biological 
molecules, such as starch and oils. It is likely that the first groups of 
photoautotrophs, which may have dominated Earth for two billion 
years, utilized hydrogen sulfide as their source of electrons for pho- 
tosynthesis, carrying out the overall reaction 


CO, +2 H,S —#* » (CH,0) +2 


where (CH,0) represents a unit of carbohydrate. There are numer- 
ous bacteria living today that carry out this type of photosynthesis; 
an example is shown in FIGURE 6.1. But, today, hydrogen sulfide is 
neither abundant nor widespread; consequently, organisms that 
depend on this compound as an electron source are restricted to 
habitats such as sulfur springs and deep-sea vents. 

Approximately 2.7 to 2.4 billion years ago, a new type of photo- 
synthetic prokaryote appeared on Earth that was able to utilize a 
much more abundant source of electrons, namely, water. Not only did 
the use of water allow these organisms—the cyanobacteria—to exploit 
a much more diverse array of habitats on Earth (see Figure 1.15), it 
produced a waste product of enormous consequence for all forms of 


1See the footnote on page 173 regarding use of the term high-energy electrons. 
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FIGURE 6.1 Photosynthetic green sulfur bacteria are present as a ring 
of peripheral cells that are living in a symbiotic relationship with a single 
anaerobic, heterotrophic bacterium in the center of the “colony.” The 
heterotrophic bacterium receives organic matter produced by the photo- 
synthetic symbionts. Photosynthetic vesicles containing the light-capturing 
machinery are visible in the green sulfur bacteria. 

Source: From Tom Fenchel, Science 296:1070, 2002, Electron micrograph 
by Ken Clarke, Ceh-Windermere, U.K. © 2002, Reprinted with permission 
from AAAS. 


life. The waste product was molecular oxygen (O,), which is formed 
in the overall reaction 


CO, +H,O —2* > (CH,0)+0. 


The switch from H,S to H,O as the substrate for photosynthesis is 
more difficult than switching one letter in the alphabet for another. 
The redox potential of the S-H,S couple is —0.25V as compared to 
+0.816V for the O,-H,O couple (page 179). In other words, the 
sulfur atom in a molecule of H,S has much less affinity for its elec- 
trons (and so is easier to oxidize) than does the oxygen atom in a 
molecule of H,O. Thus, if an organism is going to carry out oxygenic 
(oxygen-releasing) photosynthesis, it has to generate a very strong 
oxidizing agent as part of its photosynthetic metabolism in order to 
pull tightly held electrons from water. The switch from H,S (or other 
reduced substrates) to H,O as the electron source for photosynthesis 
required an overhaul of the photosynthetic machinery. 

At some point in time, one of these ancient, O,-producing 
cyanobacteria took up residence inside a mitochondria-containing, 
nonphotosynthetic, primitive eukaryotic cell (page 27). Over a long 
period of evolution, the symbiotic cyanobacterium was transformed 
from a separate organism living within a host cell into a cytoplasmic 
organelle, the chloroplast. As the chloroplast evolved, most of the 
genes that were originally present in the symbiotic cyanobacterium 
were either lost or transferred to the plant cell nucleus. As a result, the 
polypeptides found within modern-day chloroplasts are encoded by 
both the nuclear and chloroplast genomes. Extensive genetic analyses 
of chloroplast genomes suggest that all modern chloroplasts have 
arisen from a single, ancient symbiotic relationship. As a result of 


their common ancestry, chloroplasts and cyanobacteria share many 
basic characteristics, including a similar photosynthetic machinery, 
which will be discussed in detail in the following sections. 


REVIEW | 


1. Describe the effect that the evolution of cyanobacteria 
is thought to have had on the metabolism of 
organisms. 
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6.2 Chloroplast Structure 


Chloroplasts are located predominantly in the mesophyll cells of 
leaves. The structure of a leaf and the arrangement of chloroplasts 
around the central vacuole of a mesophyll cell are shown in FIGURE 6.2. 
Chloroplasts of higher plants are generally lens-shaped (FIGURE 6.3a), 
approximately 2 to 4 um wide and 5 to 10 um long, and typically 
numbering 20 to 40 per cell. Their dimensions make chloroplasts 
giants among organelles—as large as an entire mammalian red blood 
cell. As illustrated in the chapter-opening micrograph, chloroplasts 
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FIGURE 6.2 The functional organization of a leaf. The section of the 
leaf shows several layers of cells that contain chloroplasts distributed within 
their cytoplasm. These chloroplasts carry out photosynthesis, providing raw 
materials and chemical energy for the entire plant. 
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FIGURE 6.3 The internal structure of a chloroplast. (a) Transmission 
electron micrograph through a single chloroplast. The internal membrane is 
arranged in stacks (grana) of disk-like thylakoids that are physically separate 
from the outer double membrane that forms the envelope. (b) Schematic 
diagram of a chloroplast showing the outer double membrane and the 
thylakoid membranes. 

Source: Biology Pics/ Photo Researchers, Inc. 


arise by fission from preexisting chloroplasts (or their nonpigmented 
precursors, which are called proplastids). 

Chloroplasts were identified as the site of photosynthesis in 1881 
in an ingenious experiment by the German biologist T. Engelmann. 
Engelmann illuminated cells of the green alga Spirogyra and found 
that actively moving bacteria would collect outside the cell near the 
site of the large ribbon-like chloroplast (see Figure 1.5). The bacteria 
were utilizing the minute quantities of oxygen released during photo- 
synthesis in the chloroplast for their aerobic respiration. 

The outer covering of a chloroplast consists of an envelope com- 
posed of two membranes separated by a narrow space (Figure 6.30). 
Like the outer membrane of a mitochondrion, the outer membrane 
of a chloroplast envelope contains several different porins (page 
172). Although these proteins have relatively large channels, they 
exhibit some selectivity toward various solutes and thus may not be 
as freely permeable to key metabolites as is often described. The 
inner membrane of the envelope is highly impermeable; substances 
moving through this membrane do so only with the aid of a variety 
of transporters. 
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Much of the photosynthetic machinery of the chloroplast— 
including light-absorbing pigments, a complex chain of electron carriers, 
and an ATP-synthesizing apparatus—is part of an internal membrane 
system that is physically separate from the double-layered envelope. The 
internal membrane of the chloroplast is organized into flattened mem- 
branous sacs, called thylakoids. Thylakoids are arranged in orderly 
stacks called grana (Figures 6.3b and 6.4). The space inside a thylakoid 
sac is the lumen, and the space outside the thylakoid and within the 
chloroplast envelope is the stroma, which contains the enzymes 
responsible for carbohydrate synthesis. The thylakoid membrane con- 
tains the chlorophyll molecules and protein complexes that comprise 
the energy-transducing machinery of the chloroplast. 

Like the matrix of a mitochondrion, the stroma of a chloroplast 
contains small, double-stranded, circular DNA molecules and 
prokaryotic-like ribosomes. As discussed previously, the DNA of the 
chloroplast is a relic of the genome of an ancient bacterial endosym- 
biont. Depending on the organism, chloroplast DNA contains between 
about 60 and 200 genes involved in either gene expression (e.g., 
tRNAs, rRNAs, ribosomal proteins) or photosynthesis. The vast 
majority of the estimated 2000-3500 polypeptides of a plant chloro- 
plast are encoded by the DNA of the nucleus and synthesized in the 
cytosol. These proteins must be imported into the chloroplast by a 
specialized transport machinery (Section 8.21). 

Thylakoid membranes have a high protein content and are 
unusual in having relatively little phospholipid. Instead, these 
membranes have a high percentage of galactose-containing glycolipids 
such as the following: 
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FIGURE 6.4 Thylakoid membranes. Electron micrograph of a section 

through a portion of a chloroplast showing the stacked grana thylakoids, 
which are connected to one another by unstacked stroma thylakoids (or 

stroma lamellae). The dark spheres are osmium-stained lipid granules. 


Source: Dr. Kenneth R. Miller/Photo Researchers, Inc. 


Both fatty acids of these lipids contain several double bonds, 
which makes the lipid bilayer of thylakoid membranes highly fluid. 
The fluidity of the lipid bilayer facilitates lateral diffusion of protein 
complexes through the membrane during photosynthesis. 


REVIEW | 


1. Describe the organization of the membranes of a 
chloroplast. How does this organization differ from 
that of mitochondria? 

2. Distinguish between stroma and lumen, envelope 
membrane and thylakoid membrane, autotrophs and 
heterotrophs. 


el 


6.3 An Overview of 
Photosynthetic Metabolism 


A major advance in our understanding of the chemical reactions of 
photosynthesis came in the early 1930s with a proposal by C. B. van 
Niel, who was then a graduate student at Stanford University. Consider 
the overall equation of photosynthesis as given earlier: 
CO, +H,0 —#°5 (CH,0)+0, 

The prevailing belief in 1930 was that the energy of light was used to 
split CO, releasing molecular oxygen (O,) and transferring the car- 
bon atom to a molecule of water to form a unit of carbohydrate 
(CH,0). In 1931, van Niel proposed an alternate mechanism based 
on his work with sulfur bacteria. It had been demonstrated that these 
organisms were able to reduce CO, to carbohydrate using the energy 
of light without the simultaneous production of molecular O. The 
proposed reaction for sulfur bacteria was 


CO, +2 H,S —t#» (CH,O)+H,0+25 


Postulating a basic similarity in the photosynthetic processes of all 
organisms, van Niel proposed a general reaction to include all these 
activities: 


CO, +2H,A —» (CH,O)+H,0+2 A 


For the production of a hexose, such as glucose, the reaction would be 


6 CO, +12H,A 8» C,H,,0, +6 H,0+12 A 


Van Niel recognized that photosynthesis was essentially a process of 
oxidation-reduction. In the preceding reaction, H,A is an electron 
donor (reducing agent) and can be represented by H,O, H,S, or 
other reduced substrates utilized by various types of bacteria. Carbon 
dioxide, however, is an oxidizing agent, which, in a plant cell, is 
reduced to form hexose in the following reaction: 


6CO, +12 3,0 #5 C_.H.,0,+6H,0+60 
2 2 677126 2 2 


In this scheme, each molecule of oxygen is derived not from CO, 
but from the breakdown of two molecules of H,O, a process driven 
by the absorption of light. The role of water in the formation of 
molecular oxygen was clearly established in 1941 by Samuel Ruben 


and Martin Kamen of the University of California, Berkeley. These 
investigators carried out experiments on suspensions of green algae 
using a specially labeled isotope of oxygen, '*O, as a replacement for 
the common isotope, '°O. One sample of algae was exposed to labeled 
C['80,] and unlabeled water, while another sample was exposed to 
unlabeled carbon dioxide and labeled H,['%O]. The investigators 
asked a simple question: Which of these two samples of photosyn- 
thetic organisms released labeled '*O,? The algae given labeled water 
produced labeled oxygen, demonstrating that O, produced during 
photosynthesis derived from H,O. The algae given labeled carbon 
dioxide produced unlabeled oxygen, confirming that O, was not 
being produced by a chemical splitting of CO,. Contrary to popular 
belief, it wasn’t carbon dioxide that was being split into its two atomic 
components, but water. Van Niel’s hypothesis had been confirmed. 

The van Niel proposal placed photosynthesis in a different per- 
spective; it became, in essence, the reverse of mitochondrial respira- 
tion. Whereas respiration in mitochondria reduces oxygen to water, 
photosynthesis in chloroplasts oxidizes water to oxygen. The former 
process releases energy, so the latter process must require energy. An 
overview of the thermodynamics of photosynthesis and aerobic res- 
piration is indicated in FIGURE 6.5. Many similarities between these 
two metabolic activities will be apparent in the following pages. 

The events of photosynthesis can be divided into two series of 
reactions. During the first stage, the light-dependent reactions, 
energy from sunlight is absorbed and stored as chemical energy in 
two key biological molecules: ATP and NADPH. As discussed in 
Chapter 3, ATP is the cell’s primary source of chemical energy, and 
NADPH is its primary source of reducing power. During the second 
stage, the light-independent reactions (or “dark reactions” as 
they are often called), carbohydrates are synthesized from carbon 
dioxide using the energy stored in the ATP and NADPH molecules 
produced in the light-dependent reactions. In actual fact, these “dark 
reactions” take place much more rapidly when the light-dependent 
reactions are also occurring (see page 216). It is estimated that 
plant life on Earth converts approximately 500 trillion kg of CO, to 
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carbohydrate each year, an amount roughly 10,000 times the world’s 203 


yearly beef production. 
We will begin with the light-dependent reactions, which are 
complex and remain incompletely understood. 


REVIEW | 


1. In what ways is photosynthesis the reverse of 
respiration? 

2. In what ways is nonoxygenic photosynthesis that uses 
HS as an electron source similar to oxygenic 
photosynthesis that uses HO as an electron source? 
In what ways are they different? 


3. ln general terms, how do the light-independent reactions 
differ from the light-dependent reactions? What are the 
primary products of the two types of reactions? 


oe _! 


6.4 The Absorption of Light 


Light travels in packets (or quanta) of energy called photons, which 
can be thought of as “particles” of light.” The energy content of a pho- 
ton depends on the wavelength of the light according to the equation 


E=hcla 


where h is Planck’s constant (1.58 x 10“ calsec), c is the speed of 
light in a vacuum, and ( is the wavelength of light. The shorter the 
wavelength, the higher the energy content. One mole (6.02 x 10”*) of 
photons of 680 nm wavelength, which is an important wavelength in 
photosynthesis, contains approximately 42 kcal of energy, which is 


°The idea that electromagnetic radiation (e.g., light) has both wave-like and particle- 
like properties emerged at the start of the twentieth century from the work of Planck, de 
Broglie, and Einstein, and marked the beginning of the study of quantum mechanics. 
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FIGURE 6.5 An overview of the energetics of photosynthesis and aerobic respiration. 
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(calculated by dividing 42 kcal by the Faraday constant of 23.06 kcal/V). 

The absorption of light is the first step in any photochemical 
process. When a photon is absorbed by a molecule, an electron 
becomes sufficiently energetic to be pushed from an inner to an 
outer orbital. The molecule is said to have shifted from the ground 
state to an excited state. Because the number of orbitals in which 
an electron can exist is limited and each orbital has a specific energy 
level, it follows that any given atom or molecule can absorb only light 
of certain specific wavelengths. 

The excited state of a molecule is unstable and can be expected 
to last only about 10~’ second. Several consequences can befall an 
excited electron, depending on the circumstances. Consider a mol- 
ecule of chlorophyll, which is the most important light-absorbing 
photosynthetic pigment. If the electron of an excited chlorophyll 
molecule drops back to a lower orbital, the energy it had absorbed 
must be released. If the energy is released in the form of light (fluo- 
rescence or phosphorescence) or heat, the chlorophyll has returned 
to the original ground state and the energy of the absorbed photon 
has not been utilized. This is precisely what is observed when a prep- 
aration of isolated chlorophyll in solution is illuminated: The solution 
becomes strongly fluorescent because the absorbed energy is reemit- 
ted at a longer (i.e., lower energy) wavelength. However, if the same 
experiment is performed on a preparation of isolated chloroplasts, 
only a faint fluorescence is observed, indicating that very little of the 
absorbed energy is dissipated. Instead, the excited electrons of chlo- 
rophyll molecules are transferred to electron acceptors within the 
chloroplast membranes before they have a chance to drop back to 
lower energy orbitals. Thus, chloroplasts are able to harness the 
absorbed energy before it dissipates. 

Pigments are compounds that appear colored because they 
only absorb light of particular wavelength(s) within the visible spec- 
trum. Leaves are green because their chloroplasts contain large quan- 
tities of the pigment chlorophyll, which absorbs most strongly in the 
blue and red, leaving the intermediate green wavelengths to be 
reflected to our eyes. The structure of chlorophyll is shown in 
FIGURE 6.6. Each molecule consists of two parts: (1) a porphyrin ring 
that functions in light absorption and (2) a hydrophobic phytol chain 
that keeps the chlorophyll embedded in the photosynthetic mem- 
brane. Unlike the red, iron-containing porphyrins (heme groups) of 
hemoglobin and myoglobin, the porphyrin in a chlorophyll molecule 
contains a magnesium atom. The alternating single and double bonds 
along the edge of the porphyrin ring delocalize electrons, which form 
a cloud around the ring (Figure 6.6). Systems of this type are said to 
be conjugated and are strong absorbers of visible light. The absorbed 
energy causes a redistribution of the electron density of the molecule, 
which in turn favors loss of an electron to a suitable acceptor. The 
conjugated bond system also broadens the absorption peaks, ena- 
bling individual molecules to absorb energy of a range of wavelengths. 
These features are evident in an absorption spectrum of purified 
chlorophyll molecules (FIGURE 6.7). An absorption spectrum is a 
plot of the intensity of light absorbed relative to its wavelength. The 
range of wavelengths absorbed by photosynthetic pigments situated 
within the thylakoids is further increased because the pigments are 
noncovalently associated with a variety of different polypeptides. 

Several classes of chlorophyll, differing from one another in the 
side groups attached to the porphyrin ring, occur among photosyn- 
thetic organisms. The structures of these pigments are indicated in 
Figure 6.6. Chlorophylls are the primary light-absorbing photosyn- 
thetic pigments, but terrestrial plants also contain orange and red 
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FIGURE 6.6 The structure of chlorophyll a. The molecule consists of a 
porphyrin ring (which in turn is constructed of four smaller pyrrole rings) 
with a magnesium ion at its center and a long hydrocarbon tail. The green 
shading around the edge of the porphyrin indicates the delocalization of 
electrons that form a cloud. The structure of the magnesium-containing 
porphyrin of chlorophyll can be compared to the iron-containing porphyrin 
of a heme shown in Figure 5.12. Chlorophyll b and bacteriochlorophyll 

a contain specific substitutions as indicated. For example, the —CH3 group 
on ring Il is replaced by a——CHO group in chlorophyll b. Chlorophyll a is 
present in all oxygen-producing photosynthetic organisms, but it is absent 
in the various sulfur bacteria. In addition to chlorophyll a, chlorophyll b is 
present in all higher plants and green algae. Others not shown are 
chlorophyll c, present in brown algae, diatoms, and certain protozoa, and 
chlorophyll d, found in red algae. Bacteriochlorophyll is found only in green 
and purple bacteria, organisms that do not produce O2 during photosynthesis. 


accessory pigments called carotenoids, including B-carotene, which 
contain a linear system of conjugated double bonds: 
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FIGURE 6.7 Absorption spectrum for several photosynthetic pigments 
of higher plants. The background shows the colors that we perceive for 
the wavelengths of the visible spectrum. Chlorophylls absorb most strongly 
in the violet, blue, and red regions of the spectrum, while carotenoids (e.g., 
B-carotene) also absorb into the green region. Red algae and cyanobacteria 
contain additional pigments (phycobilins) that absorb in the middle bands 
of the spectrum. 


Carotenoids absorb light primarily in the blue and green region of the 
spectrum (Figure 6.7), while reflecting those of the yellow, orange, and 
red regions. Carotenoids produce the characteristic colors of carrots 
and oranges and the leaves of some plants during the fall. Carotenoids 
have multiple functions: They act as secondary light collectors during 
photosynthesis, and they draw excess energy away from excited chlo- 
rophyll molecules and dissipate it as heat. If this excess energy were 
not absorbed by carotenoids, it could be transferred to oxygen, pro- 
ducing an ultrareactive form of the molecule called singlet oxygen 
('O,) that can destroy biological molecules and cause cell death. 
Because the light falling on a leaf is composed of a variety of 
wavelengths, the presence of pigments with varying absorption 
properties ensures that a greater percentage of incoming photons 
will stimulate photosynthesis. This can be seen by examining an 
action spectrum (FIGURE 6.8), which is a plot of the relative rate 


Chlorophyll a 


Action spectrum 


Absorption 
spectrum 


‘"-— B-carotene 


Chlorophyll b —! 


Relative light absorption (——) 
Relative photochemical efficiency (——) 


Wavelength, nm 


FIGURE 6.8 Action spectrum for photosynthesis. The action spectrum 
(red-colored line) indicates the relative efficiency with which light of various 
wavelengths is able to promote photosynthesis in the leaves of a plant. An 
action spectrum can be generated by measuring the Oz produced by the 
leaves following exposure to various wavelengths. The black lines indicate 
the absorption spectra of each of the major photosynthetic pigments. The 
green line shows the combined absorption spectrum of all pigments. 


(or efficiency) of photosynthesis produced by light of various 205 


wavelengths. Unlike an absorption spectrum, which simply measures 
the wavelengths of light that are absorbed by particular pigments, an 
action spectrum identifies the wavelengths that are effective in 
bringing about a given physiologic response. The action spectrum 
for photosynthesis follows the absorption spectrum of chlorophylls 
and carotenoids fairly closely, reflecting the participation of these 
pigments in the photosynthetic process. 


REVIEW | 
1. What is the difference between an absorption 
spectrum and an action spectrum? 


2. Compare the structure, absorption, and function of 
chlorophylls and carotenoids. 
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6.5 Coordinating the Action 
of Two Different 
Photosynthetic Systems 


In 1932, Robert Emerson and William Arnold of the California 
Institute of Technology carried out an experiment suggesting that 
not all of the chlorophyll molecules in a chloroplast were directly 
involved in conversion of light energy into chemical energy. Using 
suspensions of the green alga Chlorella and flashing lights of 
extremely short duration at saturating intensity, they determined the 
minimum amount of light needed to produce maximal O, produc- 
tion during photosynthesis. Based on the number of chlorophyll 
molecules present in the preparation, they calculated that one mol- 
ecule of oxygen was being released during a brief flash of light for 
every 2500 molecules of chlorophyll present. Later, Emerson showed 
that a minimum of eight photons must be absorbed to produce one 
molecule of Oj, which meant that chloroplasts contain approxi- 
mately 300 times as many chlorophyll molecules as would appear to 
be necessary to oxidize water and generate Op. 

One possible interpretation of this finding is that only a very 
small percentage of chlorophyll molecules is involved in photosyn- 
thesis. This is not the case, however. Rather, several hundred chloro- 
phyll molecules act together as one photosynthetic unit in which 
only one member of the group—the reaction-center chlorophyll— 
actually transfers electrons to an electron acceptor. Even though the 
bulk of the pigment molecules do not participate directly in the con- 
version of light energy into chemical energy, they are responsible for 
light absorption. These pigment molecules form a light-harvesting 
antenna that absorbs photons of varying wavelength and transfers 
that energy (called excitation energy) very rapidly to the pigment 
molecule at the reaction center. 

The transfer of excitation energy from one pigment molecule to 
another is very sensitive to the distance between the molecules. The 
chlorophyll molecules of an antenna are kept in close proximity (less 
than 1.5 nm separation) and proper orientation by noncovalent link- 
age to integral membrane polypeptides. A “rule” that operates among 
antenna pigments is that energy can only be transferred to an equal 
or less energy-requiring molecule. In other words, energy can only 
be passed to a pigment molecule that absorbs light of equal or longer 
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FIGURE 6.9 The transfer of excitation energy. Energy is transferred 
randomly through a network of pigment molecules that absorb light of 
increasingly longer wavelength until the energy reaches a reaction-center 
chlorophyll, which transfers an excited electron to a primary acceptor, as 
described later in the chapter. 


wavelength (lower energy) than that absorbed by the donor mole- 
cule. As the energy “wanders” through a photosynthetic unit 
(FIGURE 6.9), it is repeatedly transferred to a pigment molecule that 
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absorbs at a longer wavelength. The energy is ultimately transferred 
to a chlorophyll of the reaction center, which absorbs light of longer 
wavelength than any of its neighbors. Once the energy is received by 
the reaction center, the electron excited by light absorption can be 
transferred to its acceptor. 

The evolution of organisms capable of using H,O as an electron 
source was accompanied by major changes in the photosynthetic 
machinery. The reason for these changes is revealed by considering 
the energetics of oxygenic (O-releasing) photosynthesis. The O,- 
H,0 couple has a standard redox potential of +0.82 V, whereas that 
of the NADP*-NADPH couple is —0.32 V (Table 5.1). The difference 
between the redox potentials of these two couples (1.14 V) provides a 
measure of the minimal energy that must be absorbed by the system 
to remove an electron from H,O and pass it to NADP* under standard 
conditions. However, cells don’t operate under standard conditions, 
and the transfer of electrons from HO to NADP* requires more than 
the minimal input of energy. It is estimated that, during actual opera- 
tions in the chloroplast, more than 2 V of energy are utilized to carry 
out this oxidation-reduction reaction. (This value of 2 V is estimated 
from the left-hand scale of FIGURE 6.10, which extends from below 
+1 V to above —1 V.) It was noted on page 204 that one mole of photons 
of 680 nm wavelength (red light) is equivalent to a change in redox 
potential of 1.8 V. Thus, while it is theoretically possible for one 
photon of red light to boost an electron to an energy level required to 
reduce NADP* under standard conditions (i.e., 1.14 V), the process is 
accomplished in the cell through the combined action of two different 
light-absorbing reactions. 

The light-absorbing reactions of photosynthesis occur in large 
pigment-protein complexes called photosystems. Two types of 
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FIGURE 6.10 An overview of the flow of electrons 
during the light-dependent reactions of photosynthesis. 
The events depicted in this schematic drawing are described 
Light in detail in the following pages. The energy content of 
electrons is given in volts. To convert these values to 
calories, multiply by the Faraday constant, 23.06 kcal/V. For 
example, a difference of 2.0 V corresponds to an energy 
difference of 46 kcal/mol of electrons. This can be com- 
pared to the energy of red light (680 nm), which contains 


about 42 kcal/mol of photons. 


photosystems are required to catalyze the two light-absorbing 
reactions utilized in oxygenic photosynthesis. One photosystem, 
photosystem II (PSII), boosts electrons from an energy level below 
that of water to a midway point (Figure 6.10). The other photosys- 
tem, photosystem | (PSI), raises electrons from a midway point to 
an energy level well above that of NADP*. The two photosystems 
act in series, that is, one after the other. Even though they mediate 
distinctly different photochemical reactions, the two types of photo- 
systems in plants, as well as those of photosynthetic bacterial cells, 
exhibit marked similarities in protein composition and overall archi- 
tecture. These shared properties suggest that all photosynthetic reac- 
tion centers have evolved from a common ancestral structure that 
has been conserved for more than three billion years. 

The reaction center of photosystem II is a chlorophyll dimer 
referred to as P680, “P” standing for “pigment” and “680” standing 
for the wavelength of light that this particular pair of chlorophylls 
absorbs most strongly. The reaction center of photosystem I is also 
a chlorophyll dimer and is referred to as P700 for comparable rea- 
sons. When sunlight strikes a thylakoid membrane, energy is 
absorbed by antenna pigments of both PSH and PSI and passed to the 
reaction centers of both photosystems. Electrons of both reaction- 
center pigments are boosted to an outer orbital, and each electron is 
transferred to a primary electron acceptor. After losing their 
electrons, the reaction-center chlorophylls of PSII and PSI become 
positively charged pigments referred to as P680* and P700", respec- 
tively. The electron acceptors, in turn, become negatively charged. 
In essence, this separation of charge within the photosystems is the 
light reaction—the conversion of light energy into chemical energy. 
Positively charged reaction centers act as attractants for electrons, 
and negatively charged acceptors act as electron donors. 
Consequently, the separation of charge within each photosystem 
sets the stage for the flow of electrons along a chain of specific 
carriers. 


REVIEW | 


1. What is the relationship between the energy content 
of a photon and the wavelength of light? How does 
the wavelength of light determine whether it will 
stimulate photosynthesis? How do the absorbance 
properties of photosynthetic pigments determine the 
direction in which energy is transferred within a 
photosynthetic unit? 


2. What is the role in photosynthesis of the light- 
harvesting antenna pigments? 


3. How does energy from photons absorbed by antenna 
pigments make it to the reaction center? 


4. Photosystems | and II boost electrons to higher 
energy in two different steps. Which of the two 
boosts electrons from the lowest energy to a mid- 
point, and which then boosts electrons from the mid- 
point to the highest energy level? If we look at the 
names of the photosystems (I and I!) does their 
numerical order correspond to the two successive 
increases in electron energy, or is the order 
backwards? 


6.6 The Operations of 
Photosystem II and Photosystem | 


In oxygenic photosynthesis, where two photosystems act in series, elec- 
tron flow occurs along three legs—from water to PSII, from PSII to PSI, 
and from PSI to NADP* —an arrangement described as the Z scheme. 
The broad outline of the Z scheme is illustrated in Figure 6.10; we will 
fill in the names of some of the specific components as we examine each 
of the major parts of the pathway. Like the members of the respiratory 
chain of mitochondria (Chapter 5), most of the electron carriers of the 
Z scheme are found as part of large membrane protein complexes (see 
Figure 6.16). Over the years, researchers have generated increasingly 
detailed portraits of the structures of these complexes. These efforts 
have culminated in the past few years with the publication of X-ray 
crystallographic structures of both PSI and PSII by several laboratories 
at increasingly higher resolution. These studies form the basis for the 
structures of the photosystems shown in FIGURES 6.11, 6.15, and 6.16. 

As in mitochondria, electron transfer releases energy, which is 
used to establish a proton gradient, which in turn drives the synthe- 
sis of ATP. As discussed in section 6.9, ATP produced in the chloro- 
plast is used primarily within the organelle in the synthesis of 
carbohydrates; ATP utilized outside the chloroplast is derived largely 
from that produced in plant cell mitochondria. 


PSII Operations: Obtaining Electrons 
by Splitting Water 


Photosystem II uses absorbed light energy to generate a proton gradi- 
ent across the thylakoid membrane. The PSII of plant cells is a complex 
of more than 20 different polypeptides, most of which are embedded in 
the thylakoid membrane. Two of these proteins, designated D1 and 
D2, are particularly important because together they bind the P680 
reaction-center chlorophyll dimer and all of the cofactors involved in 
electron transport through the photosystem (Figure 6.11). 

The first step in PSII activation is the absorption of light by an 
antenna pigment. Most of the antenna pigments that collect solar 
energy for PSII reside within a separate pigment protein complex, 
called light-harvesting complex II or, simply, LHCII. LHCII pro- 
teins bind both chlorophylls and carotenoids and are situated out- 
side the core of the photosystem (Figure 6.11a). 


THE FLOW OF ELECTRONS FROM PSII TO PLASTO- 
QUINONE. Excitation energy is passed from the outer-antenna 
pigments of LHCII to a small number of inner-antenna chlorophyll 
molecules situated within the core of PSII. From there, the energy is 
ultimately passed to the PSII reaction center. The excited reaction- 
center pigment (P680*) responds by transferring a single photo- 
excited electron to a closely associated chlorophyll-like pheophytin 
molecule (“Pheo” for short, see step 1, Figure 6.11a), which is the 
first electron acceptor in a whole series of sequential electron trans- 
fer steps. This first electron transfer generates a separation of charge 
in PSII between a positively charged donor (P680* ) and a negatively 
charged acceptor (Pheo ). 

The importance of the formation of two oppositely charged spe- 
cies, P680* and Pheo , becomes more apparent when we consider 
the oxidation-reduction capabilities of these two species. P680° is 
electron-deficient and can accept electrons, making it an oxidizing 
agent. In contrast, Pheo” has an extra electron that it will readily lose, 
making it a reducing agent. This event—the light-driven formation 
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FIGURE 6.11 The functional organization of photosystem Il. (a) A schematic model of the huge protein-pigment complex, which catalyzes the light- 
driven oxidation of water and reduction of plastoquinone. The path taken by electrons through PSII is indicated by the yellow arrows. Events begin with 

the absorption of light by an antenna pigment in the outer light-harvesting complex (LHCII). Energy is transferred from LHCII through an inner antenna 
pigment-protein complex to a P680 reaction-center chlorophyll a, which is one of four closely spaced chlorophyll a molecules (the P680 dimer and two 
accessory chlorophyll a molecules). Absorption of this energy by P680 excites an electron, which is transferred to pheophytin (Pheo) (step 1), the primary 
electron acceptor of PSII. (Pheophytin is a chlorophyll molecule that lacks the Mg** ion). The electron subsequently passes to a plastoquinone PQy (step 2) 
and then through a nonheme Fe?* to POs (step 3) to form a negatively charged free radical PO3. Absorption of a second photon sends a second electron 
along the same pathway, converting the acceptor to PQ¢ (step 4). Two protons then enter from the stroma (step 5) generating POH2, which is released into 
the lipid bilayer and replaced by a new oxidized POs molecule (step 6). As the above events are occurring, electrons are moving from H20 by way of Tyr, to 
the positively charged reaction-center pigment (steps B and A). Thus, overall, PSII catalyzes the transfer of electrons from water to plastoquinone. The 
oxidation of two molecules of HzO to release one molecule of Oz generates two molecules of POH2. Because the oxidation of water releases protons into 
the thylakoid lumen and the reduction of PQS removes protons from the stroma, the operation of PSII makes a major contribution to formation of a H* 
gradient. The figure shows one monomer of a dimeric PSII complex. (6) The oxygen-evolving complex contains a MngCaOs cluster whose structure has been 
determined to a resolution of 1.9A by X-ray crystallography. Three Mn atoms (#1-3), a Ca atom, and four O atoms (#1, 2, 3, 5) are arranged as part of an 
asymmetric cube-like cluster with a bridge to a fourth Mn and fifth O atom at nearby sites. Two water molecules are bound to the Ca and two more are 
bound to Mn4. It is likely that reaction between the oxygen atoms of two of these bound water molecules leads to formation of an O==O bond. 


Source: (b) From Yasufumi Umena, et al., Nature 473:57, 2011; Courtesy Nobuo Kamiya. © 2011, Reprinted with permission from Macmillan Publishers Ltd. 


of an oxidizing agent and a reducing agent—takes less than one 
billionth of a second and is the essential first step in photosynthesis. 

Because they are oppositely charged, P680* and Pheo’ exhibit 
an obvious reactivity with one another. Interaction between the 
oppositely charged species is prevented by moving the separated 
charges farther apart, ultimately to opposite sides of the membrane, 
by passage through several different sites. 

Pheo transfers its electron (step 2, Figure 6.11a) to a molecule of 
plastoquinone (designated PQ, in Figure 6.11a) bound near the outer 
(stromal) side of the membrane. Plastoquinone (PQ) is a lipid-soluble 
molecule (FIGURE 6.12) similar in structure to ubiquinone (see Figure 
5.12c). The electron from PQ, is transferred (step 3, Figure 6.11a) to 
a second plastoquinone (designated PQ, in Figure 6.11) to produce 
a semireduced form of the molecule (PQ, ) that remains firmly 
bound to the D1 protein of the reaction center. With each of these 
transfers, the electron moves closer to the stromal side of the 
membrane. 

The positively charged pigment (P680") is reduced back to P680 
(as described below), which primes the reaction center for the absorp- 
tion of another photon, which sends a second energized electron 
along the path from P680 to pheophytin to PQ, to (PQ, ), forming 
(PQ; ) (step 4, Figure 6.11), which combines with two protons to 
form plastoquinol, PQH), (step 5, Figure 6.11la; Figure 6.12). The 


protons utilized in the formation of PQH, are derived from the 
stroma, causing a decrease in H* concentration of the stroma, which 
contributes to formation of the proton gradient. The reduced PQH, 
molecule dissociates from the D1 protein and diffuses into the lipid 
bilayer. The displaced PQH), is replaced by a fully oxidized PQ 
molecule derived from a small “pool” of plastoquinone molecules in 
the bilayer (step 6, Figure 6.11a). We can summarize this portion of 
the PSH reaction with the equation 


2e°+PQ+2H* epee 


stroma 


PQH, 


As we will see shortly, the oxidation of a molecule of water by PSII 
requires four photons, so we can better describe this portion of the 
PSII reaction with the equation 


4e+2PQ+4Ht = AOS 


stroma 


2 PQH, 


We will follow the fate of the electrons (and protons) carried by 
PQH, in a following section. 


THE FLOW OF ELECTRONS FROM WATER TO PSII 
All these electrons come from water and are released when water is 
split into protons and molecular oxygen. But water is a very stable 
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FIGURE 6.12 Plastoquinone. The acceptance of two electrons and two 
protons reduces PQ (plastoquinone) to POH) (plastoquinol). The intermedi- 
ates are similar to those shown in Figure 5.12c for ubiquinone of the 
mitochondrion. 


molecule made up of tightly held hydrogen and oxygen atoms. In 
fact, the splitting of water is the most thermodynamically challenging 
(endergonic) reaction known to occur in living organisms. Splitting 
water in a laboratory requires the use of a strong electric current or 
temperatures approaching 2000°C. Yet a plant cell can accomplish this 
feat on a snowy mountainside using only the energy of visible light. 

We saw in the previous section how the absorption of light by 
PSII leads to the formation of two charged molecules, P680* and 
Pheo™. We have followed the route taken by the excited electron 
associated with Pheo™; now we turn to the other species, P680", 
which is the most powerful oxidizing agent yet to be discovered in a 
biological system. The redox potential of the oxidized form of P680 
is sufficiently strong to pull tightly held (low-energy) electrons from 
water (redox potential of +0.82 V), thus splitting the molecule. The 
splitting of water during photosynthesis is called photolysis. The 
formation of one molecule of oxygen during photolysis is thought to 
require the simultaneous loss of four electrons from two molecules of 
water according to the reaction 


4 photons 4 Ht 


2 H,0 lumen +O, pee” 


Yet one PSII reaction center can only generate one positive charge 
(P680~), or oxidizing equivalent, at a time. A solution to this prob- 
lem was proposed around 1970 by Pierre Joliot and Bessel Kok as the 
S-state hypothesis, which allows the photosystem to accumulate the 
four oxidizing equivalents needed to oxidize water. Closely associated 
with the D1 protein of PSII at its luminal surface is a cluster of five 
metal atoms—four manganese (Mn) atoms and one calcium (Ca) 
atom—that is stabilized and protected by a number of peripheral 
proteins that form the oxygen-evolving complex (Figure 6.11a). The 
geometric organization of the Mn-Ca cluster is shown in Figure 6.11b 
and discussed in the accompanying legend. The Mn-Ca cluster 


accumulates four oxidizing equivalents by transferring four electrons, 209 


one at a time, to the nearby P680°. The transfer of each electron 
from the Mn-Ca cluster to P680* (steps B and A of Figure 6.11) is 
accomplished by passage through an intermediate electron carrier, a 
tyrosine residue on the D1 protein, termed Tyr,. After each electron 
is transferred to P680*, regenerating P680, the pigment becomes 
reoxidized (back to P680°) following the absorption of another photon 
by the photosystem. Thus, the stepwise accumulation of four oxidiz- 
ing equivalents by the Mn-Ca cluster is driven by the successive 
absorption of four photons of light by the PSII photosystem. Once 
this has occurred, the system can now catalyze the removal of 4e” 
from two closely bound H,O molecules (Figure 6.110), as indicated 
in the following manner: 


4Ht++0O, 2H,0 
4e< cer 


So WSS, WSS, WSS, ss, 


where the subscript on the S indicates the number of oxidizing 
equivalents stored by the Mn-Ca cluster. Evidence for the accumula- 
tion of successive oxidizing equivalents was first obtained by expo- 
sure of algal cells to very brief (1 sec) flashes of light (FIGURE 6.13). 
It can be seen in this plot that O, production peaks after every fourth 
flash of light, indicating that the effect of four individual photoreac- 
tions must accumulate before O, can be released. 

The protons produced in the photolysis reaction are retained in 
the thylakoid lumen (Figure 6.11), where they contribute to the pro- 
ton gradient. The four electrons produced in the photolysis reaction 
serve to regenerate the fully reduced Mn-Ca cluster (Sp state), while 
the O, is released as a waste product into the environment. 

Before leaving the subject of PSII, it can be noted that the activ- 
ity and integrity of this photosystem can be negatively affected by 
high light intensity. This phenomenon is termed photoinhibition. 
The formation of a very strong oxidizing agent and the ever-present 
danger of forming highly toxic oxygen species give PSII the potential 
for its own self-destruction as the result of overexcitation of the sys- 
tem. Most of the damage appears to be targeted to the polypeptide 
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FIGURE 6.13 Measuring the kinetics of O2 release. The plot shows the 
response by isolated chloroplasts that have been kept in the dark to a 
succession of light flashes of very short duration. The amount of oxygen 
released peaks with every fourth flash of light. The first peak occurs 

after three flashes (rather than four) because most of the manganese 
cluster is present in the S, state (one oxidizing equivalent) when kept in 
the dark. The oscillations become damped as the number of flashes 
increases. 
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photosystem. Chloroplasts contain a mechanism for the selective 
proteolytic degradation of D1 and its replacement by a newly syn- 
thesized polypeptide molecule. 


FROM PSII TO PSI _ It was described earlier how the successive 
absorption of two photons by the reaction center of PSII leads to the 
formation of a fully reduced PQH, molecule. Consequently, the pro- 
duction of a single O, molecule, which requires the absorption of 
four photons by PSII, leads to the formation of two molecules of 
PQH). PQH)j is a mobile electron carrier that diffuses through the 
lipid bilayer of the thylakoid membrane and binds to a large multi- 
protein complex called cytochrome b,f (Figure 6.14). Each PQH) 
molecule donates its two electrons to cytochrome b, f, while its two 
protons are released into the lumen. Cytochrome b, f is related in 
structure and function to cytochrome bc, of the electron transport 
chain of mitochondria (page 185). Both complexes have quinols as 
substrates and share similar redox groups, both can be poisoned by 
some of the same inhibitors, and both engage in a Q cycle that trans- 
locates 4 H* for each pair of electrons similar to complex III of the 
mitochondrion (see Figure 5.17). In this case, two H* are donated by 
PQH,, and two additional H™ are translocated through the complex 
from the stroma. Because all of these protons had originally been 
derived from the stroma, their release into the lumen constitutes a 
movement of protons across the thylakoid membrane (see Figure 
6.16b). The electrons from cytochrome b,f are passed to another 
mobile electron carrier, a water-soluble, copper-containing periph- 
eral membrane protein called plastocyanin, situated on the luminal 
side of the thylakoid membrane (FIGURE 6.14). Plastocyanin carries 
electrons to the luminal side of PSI, where they are transferred to 
P700*, the positively charged reaction-center pigment of PSI, thus 
completing the link between PSII and PSI. 

Keep in mind that all of the electron transfers described in this 
discussion are exergonic and occur as electrons are passed to carriers 
with increasing affinity for electrons (more positive redox potentials, 
page 179). The need for mobile electron carriers, such as PQH, and 
plastocyanin, became evident when it was discovered that the two 
types of photosystems (PSII and PSI) are not situated in close prox- 
imity to one another in the membrane but rather are separated by 
distances on the order of 0.1 um. 


2H+ 


FIGURE 6.14 Electron transport between PSII and PSI. The flow of a 
pair of electrons is indicated by the yellow arrow. Cytochrome bef operates 
in amanner very similar to that of cytochrome bc, in the mitochondria and 
engages in a O cycle (not discussed in the text) that translocates four 
protons for every pair of electrons that moves through the complex. PQH2 
and PC (plastocyanin) are mobile carriers that can transport electrons 
between distantly separated photosystems. 


PSI Operations: The Production 
of NADPH 


The PSI of higher plants consists of a reaction-center core made up of 
12-14 polypeptide subunits and a peripheral complex of protein- 
bound pigments called LHCI. Light energy is absorbed by the antenna 
pigments of LHCI and passed to the PSI reaction-center pigment 
P700, which is a chlorophylla dimer (FIGURE 6.15). Following energy 
absorption, an excited reaction-center pigment (P700*) transfers an 
electron to a separate monomeric chlorophyll a molecule (designated 
Ag), which acts as the primary electron acceptor (step 1, Figure 6.14). 
As in PSII, absorption of light leads to the production of two charged 
species, in this case P700 and Ap. A is avery strong reducing agent 
with a redox potential of approximately -1.0V, which is well above 
that needed to reduce NADP" (redox potential of —0.32 V). The posi- 
tive charge of P700* pigment is neutralized by an incoming electron 
donated by plastocyanin, as noted earlier. 

The initial separation of charge in PSI is stabilized by passage of 
the electron from Aj through several cofactors beginning with a type 
of quinone called phylloquinone (designated A,) and then three iron- 
sulfur clusters (designated F,, F,, and F,) (steps 2-4, Figure 6.15). 


H* + NADP* 


NADPH 
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FIGURE 6.15 The functional organization of photosystem I. The path 
taken by electrons is indicated by the yellow arrow. Events begin with the 
absorption of light by an antenna pigment and transfer of the energy to a 
P700 chlorophyll at the PSI reaction center. Absorption of energy by P700 
causes the excitation of an electron and its transfer (step 1) to Ao, which is 
the primary electron acceptor of PSI. The electron subsequently passes to 
A, (step 2) and then to an iron-sulfur center named Fy (step 3). From Fy the 
electron is transferred (step 4) through two more iron-sulfur centers (F, and 
Fz), which are bound by a peripheral protein at the stromal side of the 
membrane. The electron is finally transferred to ferredoxin, a small iron- 
sulfur protein (step 5) that is external to the PSI complex. When two 
different ferredoxin molecules have accepted an electron, they act together 
to reduce a molecule of NADP* to NADPH (step 6). The electron-deficient 
reaction-center pigment (P700") is reduced by an electron donated by 
plastocyanin (step A). 


The oxidation of P700 to P700* occurs at the luminal side of the mem- 
brane. As indicated in Figure 6.15, the electron that is lost to the pri- 
mary acceptor passes through PSI to the iron-sulfur centers bound at 
the stromal side of the membrane. The electron is subsequently trans- 
ferred out of PSI to a small, water-soluble, iron-sulfur protein called 
ferredoxin (step 5, Figure 6.15) associated with the stromal surface of 
the membrane. The reduction of NADP* to form NADPH (step 6, 
Figure 6.15) is catalyzed by a large enzyme called ferredoxin-NADP* 
reductase, which contains an FAD prosthetic group capable of accept- 
ing and transferring two electrons (page 180). An individual ferre- 
doxin molecule can donate only one electron, so that two ferredoxins 
act together in the reduction: 


Ferredoxin—NADP* Reductase 


2 Ferredoxin,,,; + H* + NADP* 
2 Ferredoxin,,. + NADPH 


The removal of a proton from the stroma also adds to the proton 
gradient across the thylakoid membrane. We can write the overall 
reaction for PSI, based on the absorption of four photons as was done 
for PSII, as 


4 photons 


4e°+2H. +2 NADP* 2 NADPH 


stroma 


Not all electrons passed to ferredoxin inevitably end up in 
NADPH; alternate routes can be taken depending on the particular 
organism and conditions. For example, electrons from PSI can be 
used to reduce various inorganic acceptors. These paths for electrons 
can lead to the eventual reduction of nitrate (NO;) to ammonia 
(NH;), or of sulfate (SO7-) to sulfhydryl (—SH), which are key 
ingredients of biological molecules. Thus, the energy of sunlight is 
used not only to reduce the most oxidized carbon atoms (those in CO3), 
but also to reduce highly oxidized forms of nitrogen and sulfur atoms. 


REVIEW | 


1. Describe the sequence of events following the 
absorption of a photon of light by the reaction-center 
pigment of photosystem II. Describe the comparable 
events in photosystem |. How are the two 
photosystems linked to one another? 

2. Describe the difference in the redox potentials of the 
reaction-center pigments of the two photosystems. 

3. Describe the process by which water is split during 
photolysis. How many photons must be absorbed by 
PSII for this to occur? 


eo | 


6.7 An Overview of 
Photosynthetic Electron 
Transport 


FIGURE 6.16a shows the molecular structure of the major compo- 
nents involved in the light-dependent reactions of the thylakoid 
membrane. Knowledge of these structures has proven invaluable in 
unlocking many of the mysteries of photosynthesis at the molecular 
level. If we look back over the entire process of electron transport that 


takes place during oxygenic photosynthesis (summarized in 211 


Figure 6.16b), we can see that electrons travel from water to NADP* 

by the action of two light-absorbing photosystems. Events occurring 
in PSII generate a strong oxidizing agent capable of producing O, 
from water, whereas events in PSI generate a strong reducing agent 
capable of producing NADPH from NADP”. These two events lie at 
the opposite ends of redox chemistry in living organisms. As noted 
on page 209, the production of one molecule of O, requires the 
removal of four electrons from two molecules of water. The removal 
of four electrons from water requires the absorption of four photons, 
one for each electron. At the same time, the reduction of one mole- 
cule of NADP* requires the transfer of two electrons. Thus, hypo- 
thetically, if only one photosystem were able to transfer electrons 
from H,O to NADP’, four photons would be sufficient to produce 
two molecules of NADPH. Because two photosystems are utilized in 
the cell, that number is doubled to eight photons, four being utilized 
in PSH and four in PSL In other words, a total of eight moles of pho- 
tons must be absorbed by the cell to generate one mole of molecular 
oxygen and two moles of NADPH. Thus, if we add the reactions of 
PSII and PSI, ignoring the protons for a moment, we arrive at an over- 
all equation for the light reactions of 


2 H,O+2 NADP* —*P9"5 5109, 4+ 2NADPH 


(overall light reaction) 


In addition, the light reactions of photosynthesis establish a proton 
gradient across the thylakoid membrane that leads to the formation 
of ATP. The proton gradient forms as the result of the removal of H* 
from the stroma and the addition of H* to the thylakoid lumen. 
Contributions to the proton gradient (see Figure 6.16b) arise from 
(1) the splitting of water in the lumen; (2) oxidation of plastoquinol 
(PQH,) by cytochrome bf, releasing protons into the lumen; 
and (3) reductions of NADP* and PQ, which remove protons from 
the stroma. 

The light reactions of photosynthesis employ a considerable 
number of electron carriers, which serve as targets for a variety of 
different plant-killing chemicals (herbicides). A number of common 
herbicides, including diuron, atrazine, and terbutryn, act by binding 
to a core protein of PSII. We saw on page 208 how absorption of light 
by PSII leads to production of a PQH, molecule that is subsequently 
released from the Qs site of PSII and replaced by a PQ from the pool. 
The herbicides listed above act by binding to the open Qs site after 
release of PQH), blocking electron transport through PSII. The her- 
bicide paraquat has received attention in the news media because it 
is used to kill marijuana plants and because its residues are highly 
toxic to humans. Paraquat interferes with PSI function by competing 
with ferredoxin for electrons from the PSI reaction center. Electrons 
that are diverted to paraquat are subsequently transferred to O,, gen- 
erating highly reactive oxygen radicals (page 34) that damage the 
chloroplasts and kill the plant. Paraquat destroys human tissue by 
generating oxygen radicals using electrons diverted from complex I 
of the respiratory chain (page 184). 


REVIEW | 


1. Which steps in the light-dependent reactions are 
responsible for generating an electrochemical proton 
gradient across the thylakoid membrane? 
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6.8 Photophosphorylation 
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FIGURE 6.16 Summary of the light-dependent reactions. (a) Three- 
dimensional structures of the proteins of the thylakoid membrane that carry out the 
light-dependent reactions of photosynthesis. Of the four major protein complexes, 
PSII and cytochrome bef are present in the membrane as dimers, whereas PSI and 
the ATP synthase (shown in greater detail in Figure 5.24) are present as monomers. 
(b) Summary of the flow of electrons from H2O to NADPH through the three 
transmembrane complexes. This figure shows the estimated number of protons 
translocated through the membrane as the result of the oxidation of two molecules 
of water, yielding two pairs of electrons. The ATP synthase of the thylakoid 
membranes is also shown (see Section 5.7 for discussion of the ATP-synthesizing 
enzyme). Approximately four protons are required for the synthesis of each 
molecule of ATP (page 193). 

Source: (a) From Nature Revs. Mol. Cell Biol. 5:973, 2004, Fig. 1B. Image provided 
courtesy of Nathan Nelson. © 2004, Macmillan Publishers Ltd. 


aerobic bacterium. As in those cases, the ATP synthase (Figure 6.16) 
consists of a head (called CF, in chloroplasts), which contains the 
catalytic site of the enzyme, and a base (CF,), which spans the mem- 


The light-dependent reactions discussed in previous sections pro- 
vide the energy required to reduce CO, to carbohydrate. The con- 
version of one mole of CO, to one mole of carbohydrate (CH,O) 
requires the input of three moles of ATP and two moles of NADPH 
(see Figure 6.19). We have seen how plant cells produce the NADPH 
required for manufacture of carbohydrate; let us now examine how 
these same cells produce the required ATP. 

The machinery for ATP synthesis in a chloroplast is virtually 
identical to that of a mitochondrion or a plasma membrane of an 


brane and mediates proton movement. The two parts are connected 
by a rotary stalk. The CF, heads project outward into the stroma in 
keeping with the orientation of the proton gradient, which has its 
higher concentration within the thylakoid lumen (Figure 6.16). 
Thus, protons move from higher concentration in the lumen through 
the CF, base of the ATP synthase and into the stroma, thereby 
driving phosphorylation of ADP, as described in Chapter 5 for the 
mitochondrion. 


Measurements made during periods of maximal ATP synthesis 
suggest that differences in H* concentrations of 1000- to 2000-fold 
exist across the thylakoid membranes, corresponding to a pH gradi- 
ent (ApH) of more than three units. The movement of protons into 
the lumen during electron transport is neutralized by movement of 
other ions, so that a significant membrane potential is not built up. 
Thus, unlike in mitochondria where the proton-motive force is 
expressed primarily as an electrochemical potential (page 187), the 
proton-motive force (Ap) acting in the chloroplasts is largely, if not 
exclusively, due to a gradient of pH. 

The formation of ATP during the process of oxygenic photosyn- 
thesis is called noncyclic photophosphorylation because elec- 
trons move in a linear (i.e., noncyclic) path from H,O to NADP* 
(Figure 6.16). During the 1950s, Daniel Arnon of the University of 
California, Berkeley, discovered that isolated chloroplasts were not 
only capable of synthesizing ATP from ADP but could do so even in 
the absence of added CO, or NADP". These experiments indicated 
that chloroplasts had a means for ATP formation that did not require 
most of the photosynthetic reactions that would have led to oxygen 
production, CO, fixation, or NADP* reduction. All that was neces- 
sary was illumination, chloroplasts, ADP, and P,. The process Arnon 
had discovered was later called cyclic photophosphorylation 
and is a process that is carried out by PSI independent of PSII. 
Despite the fact that it was discovered over 50 years ago, cyclic pho- 
tophosphorylation is not well understood. Recent studies suggest 
that there are two overlapping pathways for cyclic electron transport, 
one of which is outlined in FIGURE 6.17. Cyclic electron transport 
begins with the absorption of a quantum of light by PSI and transfer 
of a high-energy electron to the primary acceptor. In the pathway 
depicted in Figure 6.17, the electron is passed along to ferredoxin, as 
is always the case, but rather than being transferred to NADP*, the 
electron is routed back to the electron-deficient reaction center (as 
shown in Figure 6.17) to complete the cycle. During the flow of an 
electron around this course, sufficient free energy is released to 
translocate protons (estimated at two H*/e’) across the membrane 
by the cytochrome b,f complex and to build a proton gradient 
capable of driving ATP synthesis. Cyclic photophosphorylation is 
thought to provide additional ATP required for carbohydrate 
synthesis (see Figure 6.19) and also for other ATP-requiring activities 
in the chloroplast (e.g., molecular chaperone involvement in protein 
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FIGURE 6.17 Simplified scheme for cyclic photophosphorylation. 
Absorption of light by PSI excites an electron, which is transferred to 
ferredoxin (step 1) and on to cytochrome bgf (step 2), plastocyanin (step 3), 
and back to P700* (step 4). In the process, protons are translocated by 
cytochrome b,f to form a gradient utilized for ATP synthesis (step 5). 
Another cyclic pathway for electron transport that involves movement of 
electrons from PSI through NADPH to cytochrome b,f is not shown. 
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development and growth of higher plants. 

Now that we have seen how the light reactions of photosynthesis 
lead to the production of ATP and NADPH, which are the energy 
stores required for the synthesis of carbohydrates, we can turn to the 
reactions that lead to carbohydrate formation. 


REVIEW | 


1. To what degree is the electrochemical proton gradient 
across the thylakoid membrane reflected in a pH 
gradient versus a voltage? 


2. How does the proton gradient lead to the formation 
of ATP? 


e____—_— 


6.9 Carbohydrate Synthesis 
in C3 Plants 


After World War II, Melvin Calvin of the University of California, 
Berkeley, along with colleagues Andrew Benson and James Bassham, 
began what was to be a decade-long study of the enzymatic reactions 
by which carbon dioxide was assimilated into the organic molecules 
of the cell. Armed with the newly available, long-lived radioactive 
isotope of carbon ('4C) and a new technique, two-dimensional paper 
chromatography, they began the task of identifying all the labeled 
molecules produced when cells took up ['*C]O}. The studies began 
with plant leaves but soon shifted to a simpler system, the green alga 
Chlorella. Algal cultures were grown in closed chambers in the pres- 
ence of unlabeled CO , after which radioactive CO, was introduced 
by injection into the culture medium. After the desired period of 
incubation with the labeled CO,, the algal suspension was drained 
into a container of hot alcohol, which had the combined effects of 
killing the cells immediately, stopping enzyme activity, and extract- 
ing soluble molecules. Extracts of the cells were then placed as a spot 
on chromatographic paper and subjected to two-dimensional chro- 
matography. To locate the radioactive compounds at the end of the 
procedure, a piece of X-ray film was pressed against the chromato- 
gram, and the plates were kept in the dark to expose the film. After 
photographic development, identification of radiolabeled com- 
pounds on the autoradiogram was made by comparison to known 
standards and by chemical analysis of the original spots. Let us con- 
sider some of their findings. 

Labeled CO, was converted to reduced organic compounds 
very rapidly. If the incubation period was very short (up to a few 
seconds), one radioactive spot on the chromatogram predominated. 
(FIGURE 6.18). The compound that formed the spot was determined 
to be 3-phosphoglycerate (PGA), one of the intermediates of glyco- 
lysis. Calvin's group initially suspected that CO, was being covalently 
linked (or fixed) to a two-carbon compound to form the three- 
carbon PGA molecule. Because the first intermediate to be identified 
was a three-carbon molecule, plants that utilize this pathway to fix 
atmospheric CO, are referred to as C3 plants. 

After considerable investigation, it became apparent that the 
initial acceptor was not a two-carbon compound, but a five-carbon 
compound, ribulose 1,5-bisphosphate (RuBP), which condensed 
with CO), to produce a six-carbon molecule. This six-carbon com- 
pound rapidly fragmented into two molecules of PGA, one of which 
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FIGURE 6.18 From the Archives: Chromatogram showing the results of 
an experiment in which algal cells were incubated for 5 seconds in ["4C]O2 
prior to immersion in alcohol. One spot, which corresponds to 
3-phosphoglycerate (labeled PGA), contains most of the radioactivity. 


Source: Courtesy of James Bassham and Melvin Calvin. 


contained the recently added carbon atom. Both the condensation of 
RuBP and the splitting of the six-carbon product (FIGURE 6.19a) are 


carried out in the stroma by a large multisubunit enzyme, ribulose 
bisphosphate carboxylase oxygenase, which is known familiarly as 
Rubisco. As the enzyme responsible for converting inorganic carbon 
into useful biological molecules, Rubisco is one of the key proteins in 
the biosphere. As it happens, Rubisco is capable of fixing only about 
three molecules of CO, per second, which may be the worst turno- 
ver number of any known enzyme (see Table 3.3). To compensate for 
this inefficiency, as much as half of the protein of leaves is present as 
Rubisco. In fact, Rubisco constitutes the most abundant protein on 
Earth, accounting for 5-10 kg for every human. 

As the structure of the various intermediates was determined, 
together with the positions of the labeled carbon atoms, it became 
apparent that the pathway for conversion of CO, into carbohydrate 
was cyclic and complex. This pathway has become known as the 
Calvin cycle (or Calvin-Benson cycle), and it occurs in cyano- 
bacteria and all eukaryotic photosynthetic cells. A simplified version 
of the Calvin cycle is shown in Figure 6.19b. The cycle comprises 
three main parts: (1) carboxylation of RuBP to form PGA; (2) reduction 
of PGA to the level of a sugar (CHO) by formation of glyceraldehyde 
3-phosphate (GAP) using the NADPH and ATP produced in the 
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version of the Calvin cycle showing the 
fate of six molecules of CO, that are fixed 
by combination with six molecules of 
RuBP. (Numerous reactions have been 
deleted.) The fixation of CO; is indicated 
in step 1. In step 2, the 12 PGA molecules 
are phosphorylated via ATP hydrolysis to 
form 12 1,3-bisphosphoglycerate (BPG) 
molecules, which are reduced in step 3 by 
electrons provided by NADPH to form 12 
molecules of glyceraldehyde 3-phosphate 
(GAP). Here, two of the GAPs are drained 
away (step 4) to be used in the synthesis 
of sucrose in the cytosol, which can be 
considered the product of the light- 
independent reactions. The other 10 
molecules are converted into six 
molecules of RuBP (step 5), which can act 
as the acceptor for six more molecules of 
CO>. The regeneration of six RuBPs 
requires the hydrolysis of 6 molecules of 
ATP. The NADPH and ATP used in the 
Calvin cycle represent the two high- 
energy products of the light-dependent 
reactions. 


POS 


light-dependent reactions; and (3) the regeneration of RuBP, which 
also requires ATP. If you look back at Figure 3.24, you will see that 
the formation of GAP from PGA in the Calvin cycle is the reverse of 
steps 5-7 of the glycolytic pathway. This simple comparison of a few 
reactions serves as another reminder that respiration and photosyn- 
thesis are biochemical pathways running in opposite directions. 
In this reaction series, respiration generates NADH and ATP, whereas 
the reverse reactions of photosynthesis expend NADPH and ATP. 

It can be seen in Figure 6.19b that for every six molecules of 
CO, fixed, 12 molecules of GAP are produced. (GAP is the spot 
marked triose phosphate on the chromatogram of Figure 6.18.) The 
atoms in 10 of these three-carbon GAP molecules are rearranged to 
regenerate six molecules of the five-carbon CO, acceptor, RuBP 
(Figure 6.19b). The other two GAP molecules can be considered as 
product. These GAP molecules can be exported to the cytosol in 
exchange for phosphate ions (see FIGURE 6.20) and used to synthe- 
size the disaccharide sucrose. Alternatively, GAP can remain in the 
chloroplast where it is converted to starch. An overview of the entire 
process of photosynthesis, including the light reactions (light 
absorption, oxidation of water, reduction of NADP‘, translocation 
of protons), phosphorylation of ADP, the Calvin cycle, and the syn- 
thesis of starch or sucrose, is shown in Figure 6.20. 
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FIGURE 6.20 An overview of the various stages of photosynthesis. 
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of the Calvin cycle are transported out of the leaf cells and into the 
phloem, where they are carried to the various nonphotosynthetic 
organs of the plant. Just as glucose serves as the source of energy and 
organic building blocks in most animals, sucrose serves an analo- 
gous role in most plants. Starch, on the other hand, is stored within 
chloroplasts as granules (see Figure 2.17b). Just as stored glycogen 
provides animals with readily available glucose in times of need, the 
starch stored in a plant's leaves provides it with sugars at night when 
the light reactions are not possible. It is evident from the reactions of 
Figure 6.19b that the synthesis of carbohydrate is an expensive activ- 
ity. The conversion of six molecules of CO, to one six-carbon sugar 
molecule and the regeneration of RuBP requires 12 molecules of 
NADPH and 18 molecules of ATP. This large energy expenditure 
reflects the fact that CO, is the most highly oxidized form in which 
carbon can occur. 


Redox Control 


Through the course of this book, we will discuss a diverse array of 
mechanisms used by cells to control the activity of proteins. One of 
these mechanisms, known as redox control, is turning up more and 
more often as a regulator of basic cellular processes, including pro- 
tein folding, transcription, and translation. We will consider it here 
because it is best understood as a regulator of chloroplast metabo- 
lism. Several key enzymes of the Calvin cycle are only active in the 
light when ATP and NADPH are being produced by photosynthe- 
sis. This light-dependent control of chloroplast enzymes is exercised 
by a small protein called thioredoxin that can occur in either a 
reduced or oxidized form. As noted on page 211, not all of the elec- 
trons that pass through ferredoxin are used to reduce NADP*. In 
fact, some of these electrons are transferred to thioredoxin. Once it 
has accepted a pair of electrons, thioredoxin reduces certain disulfide 
bridges (—S-S—) into sulfhydryl groups (—SH) in selected 
Calvin cycle enzymes (FIGURE 6.21). This covalent change in protein 
structure activates these enzymes, promoting the synthesis of 
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FIGURE 6.21 Redox control of the Calvin cycle. In the light, ferredoxin 
is reduced, and a fraction of these electrons are transferred to the small 
protein thioredoxin, which reduces the disulfide groups of certain Calvin 
cycle enzymes, maintaining them in an active state. In the dark, electron 
flow to thioredoxin ceases, the sulfhydryl groups of the regulated enzymes 
become oxidized to the disulfide state, and the enzymes are inactivated. 
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216 carbohydrates in the chloroplast. In the dark, when photosynthesis 
has ceased, thioredoxin is no longer reduced by ferredoxin, and the 
Calvin cycle enzymes revert to an oxidized (—S§-S—) state in which 
they are inactive. It can also be noted that several Calvin cycle 
enzymes are much more active at alkaline pH than at acid pH. This 
pH rise only occurs in the stroma when it is light and protons are 
being translocated into the thylakoid lumen as the result of photo- 
synthetic electron transport (Figure 6.16b). It follows from these 
findings that reference to the reactions of the Calvin cycle as “dark 
reactions” is a misnomer. 


Photorespiration 


One of the spots that appeared on the chromatograms from Calvin's 
early work on algal cells was identified as the compound glycolate, 
which was ignored (and correctly so) in formulating the Calvin cycle 
of Figure 6.19. While glycolate is not part of the Calvin cycle, it is a 
product of a reaction catalyzed by Rubisco. Approximately 20 years 
after it was found that Rubisco catalyzed the attachment of CO, to 
RuBP, it was discovered that Rubisco also catalyzed a second reaction 
in which O, is attached to RuBP to produce 2-phosphoglycolate 
(along with PGA) (FIGURE 6.22). Phosphoglycolate is subsequently 
converted to glycolate by an enzyme in the stroma. Glycolate pro- 
duced in the chloroplast is transferred to the peroxisome and even- 
tually leads to the release of CO, as described later (see FIGURE 6.23). 

Because it involves the uptake of O, and the release of CO,, this 
series of reactions is called photorespiration. Because photorespi- 
ration leads to the release of previously fixed CO, molecules, it is a 
waste of the plant’s energy. In fact, photorespiration can account for 
the loss of up to 50 percent of newly fixed carbon dioxide by crop 
plants growing under hot, dry conditions. To understand the reasons 
for this phenomenon, we need to consider an important aspect of 
plant physiology. The most serious problem faced by plants living in 
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a hot, dry climate is loss of water through openings, called stomata, 
at the surfaces of their leaves (Figure 6.2). Although open stomata 
lead to loss of water, they also provide the channel through which 
CO, enters the leaves. When it is hot and dry, C; plants close their 
stomata, which prevents dehydration. But stomata closure also pre- 
vents gas exchange so that the CO, concentration inside the leaf 
declines as it is used in the Calvin cycle and the O, concentration 
increases, as water is split by photolysis. When the concentration of 
O, is relatively high compared to the CO, concentration, photores- 
piration becomes a serious problem. As you might expect, a con- 
certed effort has been underway for decades to breed plants that are 
less likely to engage in photorespiration. So far, these efforts have 
met with virtually no success. 

Studies of the enzymatic activity of purified Rubisco show that 
the enzyme exhibits only a modest preference for CO, as a substrate 
over O,. The reason for this relative lack of specificity is that neither 
CO), nor O, binds to the active site of the enzyme (see Figure 3.11a). 
Rather, the enzyme binds RuBP, which adopts the enediol form shown 
in Figure 6.22. This form of RuBP can then be attacked by either CO, 
or O>. Viewed in this way, photorespiration appears to be an unavoid- 
able consequence of the catalytic properties of Rubisco, an enzyme 
that is presumed to have evolved at a time when O, levels in the 
atmosphere were virtually nonexistent. Under today’s atmospheric 
conditions, O, and CO, compete with one another, and the predomi- 
nant direction of the Rubisco-catalyzed reaction is determined by the 
ratio of CO,/O, that is available to the enzyme. When plants are 
grown in closed environments containing elevated levels of CO, they 
are capable of much more rapid growth by virtue of their elevated 
rates of CO, fixation. It is suggested that the rise in atmospheric CO, 
levels over the past century (from about 270 parts per million in 1870 
to 397 ppm in 2014) is responsible for as much as 10 percent of the 
increased crop yield that has occurred during this period. This rise in 
atmospheric CO, concentration is also thought to be responsible for 
global warming. Even a small rise in Earth’s temperature could have 
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FIGURE 6.22 The reactions of photorespiration. Rubisco can catalyze two different reactions with RuBP as a substrate (shown in the enediol state within 
the plane; see Figure 6.19a). If the RuBP reacts with Oz (step 1b), the reaction produces an oxygenase intermediate (step 2b) that breaks down into 3-PGA 
and 2-phosphoglycolate (step 3b). The subsequent reactions of phosphoglycolate are shown in Figure 6.23. The eventual outcome of these reactions is the 
release of CO2, a molecule that the cell had previously expended energy to fix. In contrast, if the RUBP molecule reacts with CO, (step 1a), the reaction 
produces a carboxylase intermediate (step 2a) that breaks down into two molecules of PGA (step 3a), which continue through the Calvin cycle. 
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FIGURE 6.23 The cellular basis of photorespiration. Electron micrograph of a portion of a leaf mesophyll cell of a tobacco plant showing a 
peroxisome (identified by its crystalline core) pressed against a pair of chloroplasts and close to a mitochondrion. The reactions of photorespiration 
that occur in each of these organelles are described in the text and shown superimposed on the organelles in which they occur. This series of reactions 
is referred to as the C2 cycle. The last steps of the cycle—the conversion of serine to glycerate in the peroxisome and then to 3-PGA in the chloro- 


plast—are not shown. 


Source: From Sue Ellen Frederick and Eldon H. Newcomb, Journal of Cell Biology, 43: 343-353, fig. 6, 1969. © Rockefeller University Press. 


major effects on global conditions, such as a rise in the level of the seas 
and a spread of the planet’s deserts see Section 6.11. 


Peroxisomes and Photorespiration 


Peroxisomes are cytoplasmic organelles whose role in oxidative 
metabolism was discussed in Section 5.10 of the last chapter. Studies 
on the peroxisomes of leaf cells have revealed a striking example of 
interdependence among different organelles, a feature that is often 
lost in studies with isolated cellular structures. The electron micro- 
graph in Figure 6.23 shows a peroxisome ofa leaf cell closely apposed 
to the surfaces of two adjacent chloroplasts and in close proximity to 
a nearby mitochondrion. This arrangement is not coincidental but 
reflects an underlying biochemical relationship whereby the prod- 
ucts of one organelle serve as substrates in another organelle. The 
reactions that take place in the different organelles are superimposed 
on the micrograph in Figure 6.23 and summarized below. 

It was noted above that photorespiration begins when RuBP reacts 
with O, to form 3-PGA and a two-carbon compound, phosphogly- 
colate (Figure 6.22). Once formed, phosphoglycolate is converted to 
glycolate, which is shuttled out of the chloroplast into a peroxisome. 
In the peroxisome, the enzyme glycolate oxidase converts glycolate 
to glyoxylate, which may then be converted to glycine and transferred 


to a mitochondrion. In the mitochondrion, two molecules of glycine 
(a two-carbon molecule) are converted to one molecule of serine 
(a three-carbon molecule), with the accompanying release of one 
molecule of CO,. Thus, for every two molecules of phosphoglycolate 
produced by Rubisco, one carbon atom that had previously been fixed 
is lost back to the atmosphere. The serine produced in the mitochon- 
drion may be shuttled back to the peroxisome and converted to 
glycerate, which can be transported to the chloroplast and utilized in 
carbohydrate synthesis by means of the formation of 3-PGA. 

Two groups of plants, called Cy and CAM plants, have overcome 
the negative effects of photorespiration by evolving metabolic mech- 
anisms that increase the CO,/O, ratio to which the Rubisco enzyme 
molecules are exposed. 


REVIEW | 


1. Describe the basic plan of the Calvin cycle, indicating 
the reactions that require energy input. Why is it 
described as a cycle? Why does energy have to be 
expended in this type of pathway? What are the 
eventual products of the pathway? 


————— 
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in C, and CAM Plants 


In 1965, Hugo Kortschak reported that when sugarcane was supplied 
with ['*C]O,, radioactivity first appeared in organic compounds 
containing four-carbon skeletons rather than the three-carbon PGA 
molecule found in other types of plants. Further analysis revealed 
that these four-carbon compounds (predominantly oxaloacetate and 
malate) resulted from combination of CO, with phosphoenolpyru- 
vate (PEP) and thus represented a second mechanism of fixation of 
atmospheric carbon dioxide (FIGURE 6.24). The enzyme responsible 
for linkage of CO, to PEP was named phosphoenolpyruvate carboxy- 
lase, and it catalyzes the first step of the Cy pathway. Plants that 
utilize this pathway are referred to as Cy plants and are represented 
primarily by tropical grasses. Before considering the fate of this 
newly fixed carbon atom, it is useful to examine the likely reasons an 
alternate pathway of CO, fixation has evolved. 

When a C; plant is placed in a closed chamber and its photosyn- 
thetic activity monitored, it is found that once the plant reduces CO, 
levels in the chamber to approximately 50 parts per million (ppm), the 
rate of CO, release by photorespiration equals the rate of CO, fixation 
by photosynthesis, so that net production of carbohydrate stops. In 
contrast, a plant that utilizes the C, pathway continues the net synthesis 
of carbohydrate until CO, levels have dropped to 1 to 2ppm. C, plants 
have this ability because PEP carboxylase continues to operate at much 
lower CO, levels than does Rubisco and is not inhibited by O5. But what 
is the value to a plant of being able to fix CO, at such low levels when 
the atmosphere invariably contains CO, at levels well above 300 ppm? 
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The value of the C, pathway becomes apparent when C, plants 
are placed in a dry, hot environment similar to that in which many of 
them live. C, plants are adapted to hot, arid environments because 
they are capable of closing their stomata to prevent water loss, yet they 
are still capable of maintaining sufficient CO, uptake to fuel their 
photosynthetic activity at a maximal rate. This is the reason that 
crabgrass, a C, plant, tends to take over a lawn, displacing the domes- 
tic C3 grasses originally planted. Sugarcane, corn, and sorghum are 
the most important crop plants that utilize the C, pathway. There is 
a disadvantage to C, metabolism; it requires more ATP and NADPH 
to produce carbohydrates than does C; metabolism. Consequently, 
under cooler, well-watered conditions, C3 plants can outcompete Cy 
plants because C3 photosynthesis is more efficient than C, photo- 
synthesis when photorespiration is not a problem. As a result, most 
C, plants do not do nearly as well in cooler temperatures or higher 
latitudes. 

When the fate of the CO, fixed by means of the C, pathway is 
followed, it is found that the CO, group is soon released, only to be 
recaptured by Rubisco and converted to metabolic intermediates of 
the C; pathway (Figure 6.24). The basis for this seemingly paradoxi- 
cal metabolism becomes evident when the leaf anatomy of a C, plant 
is examined. Unlike those of C; plants, leaves of C, plants contain 
two concentric cylinders of cells. An outer cylinder is made of meso- 
phyll cells and an inner cylinder of bundle-sheath cells (Figures 6.24). 
The fixation of CO, to PEP occurs in the outer mesophyll cells. PEP 
carboxylase activity can continue, even when the leaf stomata are 
almost entirely closed and the CO, level in the cells is very low. Once 
formed, the C, products are transported through plasmodesmata in 
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FIGURE 6.24 Structure and function in C, plants. Electron micrograph of a transverse section through the leaf of a Cy plant showing the spatial 
relationship between the mesophyll and bundle-sheath cells. Superimposed on the micrograph are the reactions of CO, fixation that occur in each type of 
cell. In step 1, COz is joined to PEP by the enzyme PEP carboxylase in a mesophyll cell that is located close to the leaf exterior. The four-carbon malate 
formed by the reaction is transported to the more centrally located bundle-sheath cell (step 2), where the COz is released. The CO becomes highly 
concentrated in the bundle-sheath cell, favoring the fixation of CO2 by Rubisco to form 3-PGA (step 3), which can be circulated through the Calvin cycle. 
The pyruvate formed when CO; is released is shipped back to the mesophyll cell (step 4), where it is converted to PEP. Although the process requires the 
hydrolysis of ATP (step 5), the high CO2/Oz in the bundle-sheath cell minimizes the rate of photorespiration. 


Source: Courtesy of S. Craig, CSIRO Plant Industry. 


the adjoining cell wall (see Figure 7.32) and into the thick-walled 
bundle-sheath cells, which are sealed off from atmospheric gases. 
Once in the bundle-sheath cells, the previously fixed CO, can be 
split from the C, carrier, producing a high CO, level in these interior 
cells—a level suitable for fixation by Rubisco. Carbon dioxide levels 
in the bundle-sheath cells may be 100 times that of the mesophyll. 
Thus, the C, pathway provides a mechanism to drive CO, fixation by 
the less efficient C, pathway by “pumping” CO), into the bundle 
sheath. Once CO, is split from the four-carbon compound, the 
pyruvate formed returns to the mesophyll cells to be recharged as 
PEP (Figure 6.24). In addition to saving water, plants that utilize the 
C, pathway are able to generate high CO,/O, ratios in the local 
Rubisco environment, thereby favoring the process of CO, fixation 
over that of photorespiration. In fact, attempts to demonstrate pho- 
torespiration in the intact leaves of C, plants usually fail. 

Many major events in the course of biological evolution, such as 
the uptake of the bacterium that gave rise to mitochondria (page 26), 
are thought to have occurred on only one occasion. It is remarkable 
that C, plants have evolved independently from C3; ancestors more 
than 45 times in 19 different families. Plant scientists are attempting 
to mimic this evolutionary process by introducing parts of the Cy 
photosynthetic machinery into C; plants with the goal of boosting 
crop productivity. For example, the PEP carboxylase gene from 
maize (a C, plant) has been introduced into rice (a C3 plant) in hopes 
of creating a CO,-concentrating mechanism within the rice meso- 
phyll cells where Rubisco is housed. Some of the field studies on 
these genetically engineered plants have been encouraging, but it is 
too soon to tell whether any observed increases in photosynthetic 
efficiency are due to increased levels of CO, assimilation, decreased 
levels of photorespiration, increased resistance to stress, or some 
other mechanism. 

Many desert plants, such as cacti, have another biochemical 
adaptation that allows them to survive in very hot, dry habitats. 


These plants are called CAM plants, and they utilize PEP carboxylase 219 


in fixing atmospheric CO), just like C, plants.’ Unlike C, plants, 
however, CAM species carry out light-dependent reactions and CO, 
fixation at different times of the day, rather than in different cells of 
the leaf. In other words, evolution has followed two different path- 
ways in overcoming the problem that photorespiration poses as it 
decreases photosynthetic efficiency. Whereas C3 and C, plants open 
their leaf stomata and fix CO, during the daytime, CAM plants keep 
their stomata closed during the hot, dry daylight hours. Then, at 
night, when the rate of water vapor loss is greatly reduced, they open 
their stomata and fix CO, by means of PEP carboxylase. As more 
and more carbon dioxide is fixed in the mesophyll cells during the 
night, the malate that is generated is transported into the cell’s cen- 
tral vacuole. The presence of malate (in the form of malic acid within 
the acidic vacuole) is evident by the sour “morning taste” of the 
plants. During the daylight hours, the stomata close, and the malic 
acid is moved into the cytoplasm. There it gives up its CO,, which 
can be fixed by Rubisco under conditions of low O, concentration 
that exist when the stomata are closed. Carbohydrates are then syn- 
thesized using energy from ATP and NADPH generated by the 
light-dependent reactions. 


CAM is an acronym for crassulacean acid metabolism, named for plants of the family 
Crassulaceae in which it was first discovered. 


REVIEW | 


1. Describe the major structural and biochemical 
differences between C3, Cy, and CAM plants. How do 
these differences affect the ability of plants to grow in 
hot, dry climates? 
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6.11 THE HUMAN PERSPECTIVE 


Global Warming and Carbon Sequestration 


The Earth’s surface, both land and water, has been heating up 
steadily since 1900. For example, all 10 of the top 10 hottest years 
in recorded history have happened after 1997. This trend of overall 
heating is known as “global warming,” and it poses a serious 
threat to both the natural environment and human welfare. What 
is causing this temperature increase? Atmospheric CO, has been 
steadily rising for the past century, such that the total quantity of 
CO, in the air has risen by 30 percent since pre-industrial times. 
This additional CO, is generated by burning fossil fuels such 
as coal, oil, or gas. For example, complete combustion of one 
molecule of octane (a major ingredient in gasoline), produces 
eight molecules of CO. COz is a so-called “greenhouse gas,” 
the name arising from the fact that CO, can absorb and re-emit 
infrared light. When sunlight hits the Earth, some of the visible 
light energy is re-radiated from the Earth’s surface as infrared 
light, much of which radiates out into space. But greenhouse 
gases such as CO, capture this infrared energy and re-emit it back 
to the Earth’s surface, thus heating up the atmosphere, land, and 
oceans. This capture of heat is similar to the way the glass panes 


ona greenhouse hold heat in, allowing plants to be grown during 
winter months in cold climates, which is why the effect of COz is 
called the greenhouse effect. 

There are several major greenhouse gases, including water 
vapor, COz, and methane. Although greenhouse gases occur nat- 
urally, the advent of large-scale industrialization has led to these 
gases being present in the atmosphere at levels never before 
seen. We know this is true based on measurements of these 
gases in ice core samples dating back hundreds of thousands 
of years. So, given that greenhouse gases lead to a warming of 
the earth's surface and given that the quantity of these gases has 
been steadily increasing as a result of our use of fossil fuels, it has 
been inferred that our own industrial activities have contributed 
in a large part to the observed trend of global warming. Could 
we halt global warming by producing less greenhouse gas? In the 
United States alone, annual production of COz is around 6 billion 
metric tons, of which half is generated from stationary sources 
like electrical power plants and cement factories. The remainder 
is generated by burning so-called transportation fuels, namely 
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gasoline, diesel, and jet fuel. Switching our transportation system 
to use electric cars would cut CO2 emissions by the vehicles them- 
selves, but in the end this is only a partial solution because the 
electricity to run those cars still needs to be produced by some 
sort of power plant, most likely one that burns fossil fuels. The 
only reason such a switch would even be partially useful is that 
large stationary power plants can be made more efficient than 
small gasoline engines in cars, because of the engineering trade- 
offs necessary to make car engines small and lightweight. 

Alternative sources of energy such as nuclear, solar, and wind 
could potentially have a large effect in reducing CO, emissions, 
but only if done on an extremely large scale. Nuclear energy, 
always a controversial solution, has become significantly less 
popular after the Fukushima Daiishi nuclear disaster in 2011, 
which raised serious concerns about safety. Biofuels (see chapter 
opener) are sometimes discussed in the context of global warming 
but in this case one has to be extremely careful depending on the 
type of fuel being produced. Some efforts to develop biofuel are 
directed toward production of hydrogen, which would be a highly 
desirable fuel from a greenhouse emission perspective because 
combustion of hydrogen does not produce any COz. On the other 
hand, it does not seem very likely that our entire industrial and 
transportation infrastructure can be switched over to hydrogen in 
the near term. Most biofuel work is directed at production of long 
hydrocarbon chains that would be in every respect equivalent to 
existing fossil fuels such as diesel or gasoline. When bio-diesel is 
burned, it produces just as much CO2 as would diesel fuel derived 
from crude oil. 

Fortunately, production of CO; is only half the story. As with 
any chemical equilibrium, the steady-state concentration of CO 
in the atmosphere depends not only on how fast it is being pro- 
duced, but also on how fast it gets consumed. Photosynthesis rep- 
resents the main way that COz is removed from the atmosphere. 
As we've seen in this chapter, photons of sunlight are absorbed 
by photosystems | and Il and used to move protons and elec- 
trons across the thylakoid membrane. Then the proton gradient 
thus formed is used to produce ATP by way of the CF;,CF, ATP 
synthase, and this ATP is used to convert COz into 3-phospho- 
glycerate (PGA) in the Calvin cycle. The PGA can then be used 
to make other carbohydrates. Carbohydrates made in plants by 
photosynthesis are stored inside the plant, and some are trans- 


. Which of the two photosystems operates at the most negative 
redox potential? Which generates the strongest reducing agent? 
Which must absorb four photons during each round of noncyclic 
photophosphorylation? 


2. Which type of plant (C3, Cy, or CAM) might you expect to do best 


if exposed to continuous daylight under hot, dry conditions? Why? 


3. Of the following substances, POQH>, reduced cytochrome bg, 


reduced ferredoxin, NADP*, NADPH, O5, HzO, which is the 
strongest reducing agent? Which is the strongest oxidizing 
agent? Which has the greatest affinity for electrons? Which has 
the most energetic electrons? 


4. Suppose you were to add the uncoupler dinitrophenol (DNP, 


page 187) to a preparation of chloroplasts carrying out photo- 


ported down into the roots, where they can leak out into the soil. 
The net result of these processes is a steady flow of carbon from 
CO, in the atmosphere to carbohydrates in the soil. This transfer 
of CO, from the atmosphere, where it acts as a greenhouse gas, 
to carbohydrates in the soil where they don't contribute to global 
warming, is a form of “carbon sequestration,” meaning seques- 
tration of carbon out of the atmosphere to a more innocuous loca- 
tion. Given how slowly plants grow, one might think that this route 
for carbon sequestration must be quite insignificant compared 
to the massive blast of CO» production from power generators 
around the world, but one has to remember that plants cover 
much of the Earth's land. Even if a given plant only sequesters a 
small quantity of carbon per year, the total mass of plants in all 
the world’s forests can have a substantial impact. In fact, when 
one considers how much carbon is stored in the atmosphere 
versus the soil, there is about twice as much stored in the soil. 
But as human populations increase, forests are being decimated, 
and this means that the flux of carbon out of the atmosphere, 
driven by photosynthesis, has been steadily declining, even as 
our production of COz has been increasing due to expanded fuel 
consumption. Indeed, deforestation is now viewed as a major con- 
tributor to global warming on a par with fossil fuel combustion. 
Thus, discussions about how to combat global warming need to 
consider geopolitical approaches to prevent and reverse defores- 
tation and not just how to burn oil more efficiently. 

Biofuels may have a part to play in carbon sequestration, 
because although they produce fuels that generate CO when 
burned, the production of the fuel will consume CO>. For example, 
methods are being developed to take the CO, produced by 
stationary electric power plants and feed it into algal ponds that 
would then fix the CO2 into carbohydrate. Ultimately, as dis- 
cussed previously, the biofuels would themselves produce CO> 
when they are burned, but it would be CO, that had already been 
collected from the burning of another fuel. In this sense, algal 
biofuels fed by industrial CO outflow are considered “carbon 
neutral.” If nothing else, this Human Perspective should convince 
you that reducing greenhouse gases is a complex problem that 
will require not only engineering solutions to reduce CO2 pro- 
duction, but also agricultural and ecological solutions to increase 
CO, sequestration by forest soil. This will pose a major challenge 
for the next generation of scientists and policy makers. 


Analytic Questions KR 


synthesis. Which of the following activities would you expect to be 
affected? (1) absorption of light, (2) cyclic photophosphorylation, 
(3) electron transport between PSII and PSI, (4) noncyclic pho- 
tophosphorylation, (5) synthesis of PGA, (6) NADP* reduction. 


5. Calculate the proton-motive force that would be formed across a 
thylakoid membrane that maintained a 10,000-fold difference in 
[H*] and no electric potential difference. (The equation for proton- 
motive force is on page 186.) Given what you know about photo- 
synthesis, which side of the thylakoid membrane (lumen or stroma) 
is the one with the higher proton concentration? (See Quantitative 
Tutorial Video). 


6. Under what conditions would you expect a plant to be engaged 
most heavily in cyclic photophosphorylation? 


10. 


Ws 


12. 


. Contrast the change in pH of the medium that occurs when 


isolated chloroplasts carry out photosynthesis compared to iso- 
lated mitochondria carrying out aerobic respiration. 


. It was noted in the previous chapter that the majority of the 


proton-motive force in mitochondria is expressed as a voltage. 
In contrast, the proton-motive force generated during photosyn- 
thesis is almost exclusively expressed as a PH gradient. How can 
you explain these differences? 


. Contrast the roles of PSI and PSII in generating the electrochem- 


ical gradient across the thylakoid membrane. 


Would you agree that a C3 plant has to expend more energy per 
molecule of CO converted to carbohydrate than a Cy plant? Why 
or why not? 


In photosynthesis, the capture of light energy results in the 
release and subsequent transfer of electrons. From what mole- 
cules are the electrons originally derived? In what molecules do 
these electrons eventually reside? 


How many ATP and NADPH molecules are required in the C3 
pathway to make one six-carbon sugar? If the synthesis of a mol- 
ecule of ATP were to require four protons, would you expect that 
these relative requirements for ATP and NADPH could be met 
by noncyclic photophosphorylation in the absence of cyclic 
photophosphorylation? 


13. 


14. 


15: 


16. 


12. 


18. 


19. 


If pheophytin and Ag (a chlorophyll a molecule) are the primary 221 


electron acceptors of PSII and PSI, respectively, what are the primary 
electron donors of each photosystem? 


Contrast the roles of three different metal atoms in the light- 
dependent reactions of photosynthesis. 


Would you expect the greenhouse effect (an increase in the CO 
content of the atmosphere) to have a greater effect on Cy plants 
or C3 plants? Why? 


Would you expect the water content of the atmosphere to be an 
important factor in the success of a C3 plant? Why? 


It has been suggested that low CO, levels play a key role in 
keeping Oz levels stable at 21 percent. How is it possible that 
CO, levels could affect Op» levels in the atmosphere? 


Suppose CO, levels were to rise to 600 ppm, which in fact they 
probably were about 300 million years ago. What effect would 
you suppose this might have regarding competition between C3 
and C, plants? 


Suppose you were to place a C3 plant under hot, dry conditions 
and provide it with radioactively labeled '®O. In what com- 
pounds would you find this label incorporated? 
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CHAPTER 


Interactions between Cells 
and their Environment 


Scanning electron microscopy images of a choanocyte chamber, the 
specialized feeding apparatus of some sponges. The morphological 
similarity between the non-metazoan choanoflagellate, and the choanocyte 
cells of the sponge (a metazoan) led researchers to hypothesize that 
choanoflagellates may be close relatives of metazoans and could provide 
insight into how multicellularity in animals came about. (Scott A. Nicols 
http://nicholslab.wikidot.com/research) 


Source: Courtesy of Scott Nichols. 


CHAPTER OUTLINE 


THE EVOLUTION OF 
MULTICELLULAR ORGANISMS 


Pike earliest life on Earth is thought to have originated 
three billion years ago in the form of simple, single-celled 
organisms. For over a billion years, single-celled life pros- 
pered, with some lineages, such as the eukaryotes, evolving 
increasing molecular and cellular complexity. Fossil evidence 
suggests that the earliest multicellular animal life— 
metazoans—evolved around 600 million years ago. 


What triggered the rise of metazoans? Environmental changes, 
such as an increase in atmospheric oxygen, are credited for 
the sudden burst of evolutionary diversity starting around 
1500 million years ago. It is also thought that predation played 
a key role in the evolution of multicellularity, because larger 
multicellular organisms would be able to engulf and ingest 
their smaller, single-celled neighbors. No matter the cause, 
it is clear that a number of innovations were needed in order 
to make the transition from unicellular to multicellular life. 
Some of the most basic of these requirements were the 
ability for cells to recognize, adhere, and communicate with 
one another. How did these behaviors arise in early metazoans? 


Genome sequencing has given us insight into this evolutionary 
puzzle. Studies of primitive single-celled organisms, such as 
choanoflagellates, widely considered to be one of the 
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closest living non-metazoan relative of animals, have been 
particularly illuminative. Genome sequencing of the choano- 
flagellate Monosiga brevicollis revealed that numerous 
molecules involved in cell adhesion, signaling, and the 
formation of the extracellular matrix were all present prior to 
the rise of the metazoan lineage. It is thought that these 


7.1 Overview of Extracellular 
Interactions 


Although the plasma membrane constitutes the boundary between a 
living cell and its nonliving environment, materials present outside 
the plasma membrane play an important role in the life of a cell. 
Most cells in a multicellular plant or animal are organized into 
clearly defined tissues in which the component cells maintain a 
defined relationship with one another and with the extracellular 
materials that lie between the cells. Even those cells that have no 
fixed relationship within a solid tissue, such as white blood cells that 
patrol the body, must interact in highly specific ways with other cells 
and with extracellular materials with which they come in contact. 
These interactions regulate such diverse activities as cell migration, 
cell growth, and cell differentiation, and they determine the three- 
dimensional organization of tissues and organs that emerges during 
embryonic development. 

We will focus in this chapter on the extracellular environment 
of cells and the types of interactions in which cells engage. FIGURE 7.1 
shows a diagrammatic section of human skin and provides an 
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molecules may have been used by single-celled pre-meta- 
zoan organisms to sense and attach to prey cells or to an 
immobile substrate. Over time, genome duplication events 
allowed for the diversification of genes, leading to molecules 
that could take on a variety of novel roles in the newly 
evolving metazoan phyla. 


overview of several of the subjects to be discussed in the chapter. The 
outer layer of skin (the epidermis) is a type of epithelial tissue. 
Like other epithelia, which line the spaces and free surfaces of the 
body, the epidermis consists largely of closely packed cells attached 
to one another and to an underlying noncellular layer by specialized 
contacts (depicted in the top inset of Figure 7.1). These contacts pro- 
vide a mechanism for cells to adhere to and communicate with one 
another. In contrast, the deeper layer of the skin (the dermis) is a 
type of connective tissue. Like other connective tissues, such as 
those that form a tendon or cartilage, the dermis consists largely of 
extracellular material, including a variety of distinct fibers that inter- 
act with each other in specific ways. A closer look at one of the scat- 
tered cells (fibroblasts) of the dermis shows that the outer surface of 
the plasma membrane contains receptors that mediate interactions 
between the cell and the components of its environment (depicted in 
the lower inset of Figure 7.1). These cell-surface receptors interact 
not only with the external surroundings, but are connected at their 
internal end to various cytoplasmic proteins. Receptors with this 
type of dual attachment are well suited to transmit messages between 
a cell and its environment. 
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FIGURE 7.1 An overview of how cells are organized into tissues and how they interact with one another and with their extracellular environment. 
In this schematic diagram of a section through human skin, the cells of the epidermis are seen to adhere to one another by specialized contacts. The basal 
layer of epidermal cells also adheres to an underlying, noncellular layer (the basement membrane). The dermis consists largely of extracellular elements that 
interact with each other and with the surfaces of scattered cells (primarily fibroblasts). The cells contain receptors that interact with extracellular materials and 


transmit signals to the cell interior. 
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If we begin at the plasma membrane and move outward, we can 
examine the types of extracellular elements that surround various 
types of cells. It was noted in Chapter 4 that virtually all integral 
membrane proteins, as well as certain membrane lipids, bear chains 
of sugars (oligosaccharides) of variable length that project outward 
from the plasma membrane (see Figure 4.4a). These carbohydrate 
projections form part of the glycocalyx (or cell coat) on the outer 
surface of the plasma membrane (FIGURE 7.2a). This layer is very 
prominent in some types of cells, such as the epithelial cells that line 
the mammalian digestive tract (Figure 7.2b). The glycocalyx is 
thought to mediate cell-cell and cell-substratum interactions, pro- 
vide mechanical protection to cells, serve as a barrier to particles 
moving toward the plasma membrane, and bind important regula- 
tory factors that act on the cell surface (Figure 7.2c). 

Many types of animal cells produce an extracellular matrix 
(ECM)—an organized network of secreted molecules that provides 
scaffolding to the cells and tissues it surrounds (FIGURE 7.3). The 
ECM is more than an inert packing material or a nonspecific glue that 
holds cells together; it provides physical, biochemical, and mechani- 
cal signals that can play key regulatory roles in determining the shape 
and activities of the cell. For example, enzymatic digestion of the 
ECM that surrounds cultured cartilage cells or mammary gland epi- 
thelial cells causes a marked decrease in the synthetic and secretory 
activities of the cells. Addition of extracellular matrix materials back 
to the culture dish can restore the differentiated state of the cells and 
their ability to manufacture their usual cell products. 

One of the best defined extracellular matrices is the basement 
membrane (or basal lamina), a continuous sheet 50 to 200 nm thick 
that (1) surrounds nerve fibers, muscles, and fat cells; (2) underlies 
the basal surface of epithelial tissues, such as the epidermis of the skin 
(Figures 7.1 and 7.4a-b), or the lining of the digestive and respiratory 
tracts; and (3) underlies the inner endothelial lining of blood vessels. 
Basement membranes provide mechanical support for the attached 
cells, generate signals that maintain cell survival, serve as a substra- 
tum for cell migration, separate adjacent tissues within an organ, and 
act as a barrier to the passage of macromolecules. In this latter capac- 
ity, basement membranes of the capillaries play an important role in 
preventing the passage of proteins out of the blood and into the tis- 
sues. This is particularly important in the kidney, where the blood is 
filtered under high pressure through a double-layered basement 
membrane that separates the capillaries of the glomerulus from the 
wall of the kidney tubules. Kidney failure in long-term diabetics may 
result from an abnormal thickening of the basement membranes sur- 
rounding the glomeruli. Basement membranes also serve as a barrier 
to invasion of tissues by cancer cells. The molecular organization of 
basement membranes is discussed later (see Figure 7.12). Those cells 
that do not lie on a basement membrane sheet, such as the fibroblasts 
of a connective tissue, are typically surrounded by a less organized 
ECM consisting largely of thread-like fibrils (FIGURE 7.4c). 

Even though the extracellular matrix may take diverse forms in dif- 
ferent tissues and organisms, it tends to be composed of similar macro- 
molecules. Unlike most proteins present inside of cells, which are 
compact, globular molecules, those of the extracellular space are typi- 
cally extended, fibrous species. These proteins are secreted into the 
extracellular space where they are capable of self-assembling into an 
interconnected three-dimensional network, which is depicted in 
FIGURE 7.5 and discussed in the following sections. Among their diverse 
functions, the proteins of the ECM serve as trail markers, scaffolds, gird- 


glycocalyx BM 


| PP 
—— = 


<f _stell 
2 


FIGURE 7.2 The glycocalyx. (a) The basal surface of an ectodermal cell 
of an early chick embryo. Two distinct structures closely applied to the 
outer cell surface can be distinguished: an inner glycocalyx (GC) and an 
outer basement membrane (BM). (b) Electron micrograph of the endothelial 
glycocalyx in a coronary capillary, showing extensive glycocalyx. 

(c) Molecular model of the glycocalyx. Sugar chains are shown in green, 
bound to transmembrane protein in red. 

Source: (a) From A. Martinez-Palomo, Int. Rev. Cytol. 29:64, 1970. © 1970, 
with permission from Elsevier; (b) By courtesy of B.M. van den Berg, The 
Einthoven laboratory for vascular medicine, Division of Nephrology, Leiden 
University Medical Center, Leiden, the Netherlands; (c) Courtesy of Igor Pivkin 
et al., Universita della Svizzera italiana. Biophys. Journal 106(1):232-234, 
2014, with permission from Elsevier. 


ers, wire, and glue. As noted throughout the following discussion, altera- 
tions in the amino acid sequence of extracellular proteins can lead to 
serious disorders. We will begin with one of the most important, and 
ubiquitous, molecules of the ECM, the glycoprotein collagen. 
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FIGURE 7.3 The extracellular matrix (ECM) of cartilage cells. (a) Scanning electron micrograph of a portion of a colony of cartilage cells (chondrocytes) 
showing the extracellular materials secreted by the cells. (6) The ECM of a single chondrocyte has been made visible by adding a suspension of red blood 
cells (RBCs). The thickness of the ECM is evident by the clear space (arrowhead) that is not penetrated by the RBCs. The bar represents 10 um. 

Source: (a) Courtesy Michael Solursh and Gerald Karp; (b) Courtesy of Greta M. Lee, Brian Johnston, Ken Jacobson, and Bruce Caterson. 
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FIGURE 7.4 The basement membrane (basal lamina). (a) Scanning 
electron micrograph of human skin. The epidermis has pulled away from 
part of the basement membrane, which is visible beneath the epidermal 
cells. (b) The dense network of collagen of the basement membrane is 
revealed by scanning EM after enzymatic separation of the epidermis. (c) 
The much looser collagen network of the dermis is visible here, with an 
adherent fibroblast labeled with an asterisk. 

Source: (a) Courtesy of Karen Holbrook.; (b) Reprinted by permission from 
Macmillan Publishers Ltd: Stoitzner, P., Pfaller, K. Stossel, H. A Close-Up 
View of Migrating Langerhans Cells in the Skin. Journal of Investigative 
Dermatology 118: 117-125, 2002.; (c) Reprinted by permission from 
Macmillan Publishers Ltd: Stoitzner, P., Pfaller, K. Stossel, H. A Close-Up 
View of Migrating Langerhans Cells in the Skin. Journal of Investigative 
Dermatology 118: 117-125, 2002. 
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Fibronectin 


Collagen 


FIGURE 7.5 An overview of the macromolecular organization of the extracellular matrix. The proteins and polysaccharides shown in this illustra- 
tion will be discussed in the following sections. The proteins depicted (fibronectin, collagen, and laminin) contain binding sites for one another, as well as 
binding sites for receptors (integrins) that are located at the cell surface. The proteoglycans are huge protein complexes that occupy much of the 


volume of the extracellular space. 


REVIEW | 


1. Distinguish between the glycocalyx, a basement 
membrane, and the extracellular matrix of cartilage 
tissue. 


eo | 


7.3 Components of the 
Extracellular Matrix 


The major components of extracellular matrices include collagens, 
proteoglycans, and a variety of proteins, such as fibronectin, and 
laminin. Each of the proteins of the extracellular matrix contains 
binding sites for one another and for receptors on the cell surface. As 
a result, these various extracellular materials interact to form an 
interconnected network that is bound to the cell surface. 


Collagen 


Collagens comprise a family of fibrous glycoproteins that are present 
only in extracellular matrices. Collagens are found throughout the 
animal kingdom and are noted for their high tensile strength, that is, 
their resistance to pulling forces. It is estimated that a collagen fiber 
1 mm in diameter is capable of suspending a weight of 10 kg (22 Ib) 
without breaking. Collagen is the single most abundant protein in the 
human body—constituting more than 25 percent of all protein—a fact 
that reflects the widespread occurrence of extracellular materials. 
Collagen is produced primarily by fibroblasts, the cells found in 
various types of connective tissues, but also by smooth muscle cells 
and epithelial cells. To date, 28 distinct types of human collagen have 
been identified. Each collagen type is restricted to particular locations 
within the body, but two or more different types are often present 


together in the same ECM. Additional functional complexity is 
provided by mixing several collagen types within the same fiber. 
These “heterotypic” fibers are the biological equivalent of a metal 
alloy. It is likely that different structural and mechanical properties 
result from different mixtures of collagens in fibers. Although there 
are many differences among the members of the collagen family, 
all share at least two important structural features. (1) All collagen 
molecules are trimers consisting of three polypeptide chains, called 
a chains. (2) Along at least part of their length, the three polypeptide 
chains of a collagen molecule are wound around each other to form a 
rod-like triple helix (FIGURE 7.6a). The a chains of collagen mole- 
cules contain large amounts of proline, and many of the proline (and 
lysine) residues are hydroxylated following synthesis of the polypep- 
tide. The hydroxylated amino acids are important in maintaining the 
stability of the triple helix by forming hydrogen bonds between com- 
ponent chains. Failure to hydroxylate collagen chains has serious 
consequences for the structure and function of connective tissues. 
This is evident from the symptoms of scurvy, a disease that results 
from a deficiency of vitamin C (ascorbic acid) and is characterized by 
inflamed gums and tooth loss, poor wound healing, brittle bones, and 
the weakening of the lining of blood vessels, causing internal bleed- 
ing. Ascorbic acid is required as a coenzyme by the enzymes that add 
hydroxyl groups to the lysine and proline amino acids of collagen. 

A number of the collagens, including types I, II, and III, are 
described as fibrillar collagens because they assemble into rigid, 
cable-like fibrils, which in turn become packaged into thicker fibers 
that are typically large enough to be seen in the light microscope. 
The side-by-side packaging of the rows of collagen I molecules 
within a collagen fibril is depicted in Figure 7.6b,c. The fibrils are 
strengthened by covalent cross-links between lysine and hydroxyly- 
sine residues on adjacent collagen molecules. This cross-linking pro- 
cess continues through life and may contribute to the decreased 
elasticity of skin and increased brittleness of bones among the 


(d) 250nm 


FIGURE 7.6 The structure of collagen I. This figure depicts several levels 
of organization of a fibrillar collagen. (a) The collagen molecule (or monomer) 
is a triple helix composed of three helical « chains. Some types of collagen 
contain three identical a chains and thus are homotrimers, while others are 
heterotrimers containing two or three different chains. Each collagen | 
molecule is 295 nm in length. (6) Collagen | molecules become aligned in 
rows in which the molecules in one row are staggered relative to those in the 
neighboring row. A bundle of collagen | molecules, such as that shown here, 
forms a collagen fibril. The staggered arrangement of the molecules produces 
bands (horizontal black lines in the illustration) across the fibril that repeat with 
a periodicity of 67 nm (which is equal to the length of the gap between 
molecules plus the overlap). (c) An electron micrograph of human collagen 
fibrils observed after metal shadowing (see Figure 18.17). The banding pattern 
of the fibrils is evident. (a) An atomic force micrograph showing the surface of 
a collagen fibril that suggests that its subcomponents are twisted spirally 
around the fibril axis like a rope. The banding pattern remains evident. The 
arrows point to places where the fibril is slightly unwound, as would occur if 
you were to twist a rope in the opposite direction it is wound. 

Source: (c) Courtesy Jerome Gross and Francis O. Schmitt; From Laurent 
Bozec et al., Biophys. J. 92:71, 2007. © 2007, with permission from Elsevier. 
(d) From Laurent Bozec et al., Biophys. J. 92:71, 2007. © 2007, with 
permission from Elsevier. 


elderly. Recent studies that probe the surface of collagen fibrils sug- 
gest that they are composed of strands that are twisted around the 
axis of the fibril to form a structure that is more like a nanosized rope 
than a cable (Figure 7.6d ). 

Among the various components of the ECM, collagen molecules 
provide the insoluble framework that determines many of the mechani- 
cal properties of the matrix. In fact, the properties of a particular tissue 
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collagen molecules. For example, tendons, which connect muscles to 
bones, must resist tremendous pulling forces during times of muscular 
contraction. Tendons contain an ECM in which the collagen fibrils are 
aligned parallel to the long axis of the tendon and thus parallel to the 
direction of the pulling forces. The cornea is also a remarkable tissue; it 
must serve as a durable, protective layer at the surface of the eyeball but 
must also be transparent so that light can pass through the lens to the 
retina. The thick, middle layer of the cornea is the stroma, which contains 
relatively short collagen fibrils that are organized into distinct layers. The 
layered structure of the stroma is similar to that of plywood: the fibrils of 
each layer are parallel to other fibrils in the layer but nearly perpendicular 
to the fibrils in the layer on either side (FIGURE 7.7). This plywood-like 
structure provides strength to this delicate tissue, while the uniformity of 
size and ordered packing of the fibrils minimize the scattering of incom- 
ing light rays, thereby promoting the tissue’s transparency. 

Given their abundance and widespread distribution, it is not 
surprising that abnormalities in fibrillar collagen formation can lead 
to serious disorders. Burns or traumatic injuries to internal organs | 
can lead to an accumulation of scar tissue, which consists largely of | 
fibrillar collagen. A common condition described as fibrosis results | 
from excessive production of collagen-containing connective tissue. | 
Fibrosis occurs most often in the lungs (pulmonary fibrosis) and 
liver (cirrhosis) where the collagen-containing scar tissue gradually | 
replaces the normal tissue of the organ. Mutations in genes encoding 
type I collagen can produce osteogenesis imperfecta, a potentially 
lethal condition characterized by extremely fragile bones, thin skin, 
and weak tendons. Mutations in genes encoding type II collagen 
alter the properties of cartilage tissue, causing dwarfism and skeletal 
deformities. Mutations in a number of other collagen genes can lead | 
to a variety of distinct but related defects in collagen matrix structure 
(called Ehler-Danlos syndromes). Persons with one of these syn- 
dromes have hyperflexible joints and highly extensible skin. 

Not all collagens form fibrils. One of the nonfibrillar collagens | 
is type IV, whose distribution is restricted to basement membranes. | 


200 nm 


FIGURE 7.7 The corneal stroma consists largely of layers of collagen fibrils 
of uniform diameter and spacing. The molecules of alternate layers are 
arranged at right angles to one another, resembling the structure of plywood. 
Source: From Nigel J. Fullwood, Structure 12:169, 2004, Fig. 1; © 2004, 

with permission from Elsevier. 
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collagen molecules are organized into a network that provides 
mechanical support and serves as a lattice for the deposition of other 
extracellular materials (see Figure 7.12). Unlike type I collagen, 
which consists of a long, uninterrupted triple helix, the type 
IV collagen trimer contains nonhelical segments interspersed along 
the molecule and globular domains at each end. The nonhelical seg- 
ments make the molecule flexible, while the globular ends serve as 
sites of interaction between molecules that give the complex its lat- 
tice-like character (FIGURE 7.8). Mutations in type IV collagen genes 
have been identified in patients with Alport syndrome, an inherited 
kidney disease in which the glomerular basement membrane (Figure 
7 Ab) is disrupted. 


Proteoglycans 


In addition to collagen, basement membranes and other extracellu- 
lar matrices typically contain large amounts of a distinctive type of 
protein-polysaccharide complex called a proteoglycan. A proteo- 
glycan (FIGURE 7.9a) consists of a core protein molecule (shown in 
black in Figure 7.9a) to which chains of glycosaminoglycans (GAGs) 
are covalently attached (shown in red in the figure). Each glycosami- 
noglycan chain is composed of a repeating disaccharide; that is, it 
has the structure -A-B-A-B-, where A and B represent two different 
sugars. GAGs are highly acidic due to the presence of both sulfate 
and carboxyl groups attached to the sugar rings (Figure 7.9b). 
Proteoglycans of the extracellular matrix may be assembled into 
gigantic complexes by linkage of their core proteins to a molecule of 
hyaluronic acid, a nonsulfated GAG (Figure 7.9c). The microscopic 
appearance of one of these complexes, which can occupy a volume 
equivalent to that of a bacterial cell, is shown in Figure 7.9d. 


Chondroitin 
(C005 sulfate 
Keratan 
CHAOH sulfate 
a 
0) 
H 
H NHCOCH, 
Hyaluronic 
COO- acid CH,OH 
H 0 
H oe 
HO H 
H NHCOCH; 


(a) (b) 


50 nm 


FIGURE 7.8 The type IV collagen network of the basement 
membrane. Electron micrograph of a basement membrane from human 
amniotic tissue that had been extracted with a series of salt solutions to 
remove noncollagenous materials. The treatment leaves behind an 
extensive, branching, polygonal network of strands that form an irregular 
lattice. Evidence indicates that this lattice consists of type IV collagen 
molecules covalently linked to one another in a complex three-dimensional 
array. A model of the basement membrane scaffold is shown in Figure 7.12. 
Source: From Peter D. Yurchenco and George C. Ruben, J. Cell Biol. 
105:2561, 1987, Fig. 1d. © 1987, reproduced with permission of 
Rockefeller University Press. Image courtesy Peter D. Yurchenco, Robert 
Wood Johnson Medical School. 


FIGURE 7.9 The structure of a cartilage-type proteoglycan complex. (a) Schematic representation of a single proteoglycan consisting of a core protein 
to which a large number of glycosaminoglycan chains (GAGs, shown in red) are attached. A proteoglycan from cartilage matrix (e.g., aggrecan) may contain about 
30 keratan sulfate and 100 chondroitin sulfate chains. Proteoglycans found in basement membranes (e.g., perlecan and agrin) have only a few GAG chains attached 
to the core protein. (b) The structures of the repeating disaccharides that make up each of the GAGs shown in this figure. All GAGs bear large numbers of negative 
charges (indicated by the blue shading). (c) In the cartilage matrix, individual proteoglycans are linked to a nonsulfated GAG, called hyaluronic acid (or hyaluronan), 
to form a giant complex with a molecular mass of about 3 million daltons. The box indicates one of the proteoglycans of the type shown in part a. (d) Electron 
micrograph of a proteoglycan complex, comparable to that illustrated in part c, that was isolated from cartilage matrix. 


Source: (d) Courtesy Joseph A. Buckwalter. 


Because of the negative charges borne on the sulfated GAGs, 
proteoglycans bind huge numbers of cations, which in turn bind 
large numbers of water molecules. As a result, proteoglycans form a 
porous, hydrated gel that fills the extracellular space like packing 
material (Figure 7.5) and resists crushing (compression) forces. This 
property complements that of the adjacent collagen molecules, 
which resist pulling forces and provide a scaffold for the proteogly- 
cans. Together, collagens and proteoglycans give cartilage and other 
extracellular matrices strength and resistance to deformation. The 
extracellular matrix of bone is also composed of collagen and pro- 
teoglycans, but it becomes hardened by impregnation with calcium 
phosphate salts. Both sulfated and nonsulfated GAGs are taken 
widely as health supplements with the aim of improving the condi- 
tion of skin and joints. 

In addition to its structural role, proteoglycans are thought to 
also play an important role in cell-cell signaling. A number of growth 
factors have been found to bind to proteoglycans, including fibro- 
blast growth factor (FGF) and vascular endothelial growth factor 
(VEGF). It has been postulated that the ECM may act as a “sink” for 
growth factors and other signaling molecules, allowing for the for- 
mation of a stable signaling gradient. 


Fibronectin 


The term matrix implies a structure made up of a network of inter- 
acting components. The term has proven very apt for the extracel- 
lular matrix, which contains a number of proteins, in addition to 
collagen and proteoglycans, that interact with one another in highly 
specific ways (Figure 7.5). Fibronectin, like several other proteins to 
be discussed in this chapter, consists of a linear array of distinct 
“building blocks,’ or domains, that gives each polypeptide a modu- 
lar construction (Figure 7.10a). Each fibronectin polypeptide is 
constructed from a sequence of approximately 30 Fn domains, two 
of which are depicted in the top inset of FIGURE 7.10a. While Fn- 
type domains were first discovered in fibronectin, they are found as 
part of many other proteins, ranging from blood clotting factors to 
membrane receptors (see Figure 7.21). As discussed in Section 2.10, 
the presence of shared segments among diverse proteins strongly 
suggests that many present-day genes have arisen during evolution 
by the fusion of parts of separate ancestral genes. In fibronectin, the 
30 or so Fn domains combine to form five or six larger functional 
units, illustrated by the colored cylinders in Figure 7.10a. Each of 
the two polypeptide chains that make up a fibronectin molecule 
contains 


1. Binding sites for numerous components of the ECM, such as col- 
lagens, proteoglycans, and other fibronectin molecules. These 
binding sites facilitate interactions that link these diverse mole- 
cules into a stable, interconnected network (Figure 7.5). Binding 
activities of fibronectin molecules are enhanced by mechanical 
forces that pull on these fibrous proteins and unfold its compo- 
nent modules. Unfolding of these modules exposes binding sites 
that would otherwise have remained buried. 


2. Binding sites for receptors on the cell surface. These binding sites 
hold the ECM in a stable attachment to the cell (Figure 7.5). The 
importance of a fibronectin-containing substratum for cell 
attachment is illustrated in Figure 7.10: The cultured endothelial 
cell shown in this photo has adopted a shape unlike any it would 
have in the body because it has spread itself over the available 
surface provided by a square coat of fibronectin. 
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is particularly evident during embryonic development. Development 
is characterized by waves of cell migration during which different 
cells follow different routes from one part of the embryo to another 
(FIGURE 7.11a). Migrating cells are guided by proteins, such as 
fibronectin, that are contained within the molecular landscape 
through which they pass. For example, the cells of the neural crest, 
which migrate out of the developing nervous system into virtually all 
parts of the embryo, traverse pathways rich in fibrils composed of 
interconnected fibronectin molecules (Figure 7.10b). The ability of 
these fibronectin pathways to direct neural crest cell migration can 
be recapitulated on a culture dish as shown in Figure 7.10c. A num- 
ber of organs of the body, including the salivary gland, kidney, and 
lung are formed by a process of branching in which the epithelial 
layer becomes divided by a series of clefts (Figure 7.10d ). The 
importance of fibronectin in the formation of these clefts is illus- 
trated in Figure 7.10e, which shows a salivary gland that has been 
incubated with antibodies that bind to fibronectin. Cleft formation 
and branching is entirely abolished as a result of the inactivation of 
fibronectin molecules. 


Laminin 


Laminins are a family of extracellular glycoproteins that consist of 
three different polypeptide chains linked by disulfide bonds and 
organized into a molecule resembling a cross with three short arms 
and one long arm (Figure 7.5). At least 15 different laminins have 
been identified. Like fibronectin, extracellular laminins can greatly 
influence a cell’s potential for migration, growth, and differentiation. 
For example, laminins play a critical role in the migration of primor- 
dial germ cells (Figure 7.11a). These cells arise in the yolk sac, which 
is located outside the embryo itself, and then migrate by way of the 
bloodstream and embryonic tissues to the developing gonad, where 
they eventually give rise to sperm or eggs. During their migration, 
the primordial germ cells traverse surfaces that are particularly rich 
in laminin (Figure 7.11). Primordial germ cells possess a cell sur- 
face protein that adheres strongly to one of the subunits of the 
laminin molecule. The influence of laminin on nerve outgrowth is 
shown in Figure 9.48a. 

In addition to binding tightly to cell-surface receptors, laminins 
can bind to other laminin molecules, to proteoglycans, and to other 
components of basement membranes (Figure 7.5). The laminin and 
type IV collagen molecules of basement membranes are thought 
to form separate, but interconnected, networks, as depicted in 
FIGURE 7.12. These interwoven networks give basement membranes 
both strength and flexibility. In fact, basement membranes fail to 
form in mouse embryos that are unable to synthesize laminin, caus- 
ing the death of the embryo around the time of implantation. 


REVIEW | 


1. Contrast the roles of collagen and proteoglycans in 
the extracellular space. How do fibronectin and 
laminin contribute to embryonic development? 


2. List a few of the functions of the extracellular matrix in 
animal tissues. 
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FIGURE 7.10 Structure of fibronectin and its importance during 
embryonic development. (a) A human fibronectin molecule consists of two 
similar, but nonidentical, polypeptides joined by a pair of disulfide bonds 
located near the C-termini. Each polypeptide is composed of a linear series 
of distinct modules, which are organized into several larger functional units, 
illustrated by the colored cylinders in this figure. Each of these functional 
units contains one or more binding sites for either a specific component of 
the ECM or for the surface of cells. Some of these binding activities are 
indicated by the labels. The cell-binding site of the polypeptide that 
contains the sequence arg-gly-asp, or RGD, is indicated. As discussed later 
in the chapter, this sequence binds specifically to a particular class of integral 
plasma membrane proteins (integrins) that are involved in cell attachment 
and signal transduction. The inset shows two of the 30 or so repeating Fn 
modules that comprise the polypeptide; the RGD sequence forms a loop of 
the polypeptide, protruding from one module. (b) A section through a 
young chick embryo that has been treated with fluorescent antibodies 
against fibronectin. Fibronectin is present as fibrils in the basement 
membranes (dark red sites) that lie beneath the embryonic epithelia and 
provide a substratum over which cells migrate. (c) In this micrograph, neural 
crest cells are migrating out from a portion of the developing chick nervous 
system (beyond the edge of the photo) onto a glass culture dish that 
contains strips of fibronectin-coated surface alternating with strips of bare 
glass. The boundary of a fibronectin-coated region is indicated by the white 
lines. It is evident that the cells remain exclusively on the fibronectin-coated 
surface. Cells that reach the glass substratum (arrowheads) tend to round up 
and lose their migratory capabilities. The arrow indicates the direction of 
migration. (d, e) The role of fibronectin in the formation of the embryonic 
salivary gland. The micrograph in d shows a mouse embryonic salivary gland 


that has been grown for 10 hours in culture. The gland is seen to be divided into separate buds by a series of clefts (open triangles). The gland shown in e has 
been grown for the same period of time in the presence of an anti-fibronectin antibody, which has completely inhibited cleft formation. Scale bar equals 
100um. (f) Fluorescence micrograph of an endothelial cell—a type of cell that lines the inner surface of blood vessels. The cell is square because it has spread 
itself over a tiny square-shaped patch of an adhesive protein called fibronectin that was applied to a culture dish. The cell appears to be mounted in a green 
frame because it was treated with a green fluorescent antibody that binds to the cytoplasmic protein actin, a component of the cytoskeleton. 

Source: (b) Courtesy of James W. Lash; (c) From Giovanni Levi, Jean-Loup Duband, and Jean Paul Thiery, Int. Rev. Cytol. 123:213, 1990. © 1990, with 
permission of Elsevier; (d-e) From Takayoshi Sakai, Melinda Larsen, and Kenneth M. Yamada, Nature 423:877, 2003, Figs. 3a and 3b © 2003, reprinted by 
permission of Macmillan Publishers Ltd.; (f) From Christopher S. Chen, Clifford Brangwynne, and Donald E. Ingber, Trends Cell Biol. 9:283, 1999; © 1999, 
with permission of Elsevier. 


Neural crest 


Erythroid 
cells 


Lymphoid 
cells » 
Primordial 
germ cells 

(PGC) 


(a) (b) 


20 um 


FIGURE 7.11 The role of cell migration during embryonic development. (a) A summary of some of the cellular traffic occurring during mammalian develop- 
ment. The most extensive movements are conducted by the neural crest cells (shown in blue), which migrate out of the neural plate in the dorsal midline of the 
embryo and give rise to all the pigment cells of the skin (P), the sympathetic ganglia (SeG), adrenal medulla (AdM), and the cartilage of the embryonic skull (Mx, Md 
for maxillary and mandibular arches). The primordial germ cells (PGC) migrate from the yolk sac to the site of gonad (G) formation within the embryo. The 
progenitors of lymphoid cells are transported to the liver (L), bone marrow (Bm), thymus (Thy), lymph nodes (LN), and spleen (Sp). (Note: The “pathways” shown here 
connect the original sites of cells with their destinations; they do not accurately depict the actual routes of the cells.) (b) Micrograph of a section of a portion of the 


hind gut of a 10-day mouse embryo. The primordial germ cells (green) are seen to be migrating along the dorsal mesentery on their way to the developing gonad. 
The tissue has been stained with antibodies against the protein laminin (red), which is seen to be concentrated in the surface over which the cells are migrating. 
Source: (a) A. A. Moscona & R. E. Hausman, Cell and Tissue Interactions, J. W. Lash & M. M. Burger Eds. Raven Press, 1977 ISBN: 0890041806; (b) From 
Martin |. Garcia-Castro, Robert Anderson, Janet Heasman, and Christopher Wylie, J. Cell Biol. 138:475, 1997, Fig. 3. Reproduced with permission of the 


Rockefeller University Press. 


FIGURE 7.12 A model of the basement membrane scaffold. Basement 
membranes contain two network-forming molecules, collagen IV (pink), 
which was illustrated in Figure 7.8, and laminin (green), which is indicated 
by the thickened cross-shaped molecules. The collagen and laminin 
networks are connected by entactin molecules (purple). 

Source: Copyright 1992, Rockefeller University Press. Originally published in 
The Journal of Cell Biology 117:1132. 


7.4 Dynamic Properties of the 
Extracellular Matrix 


The micrographs and diagrams presented in the first section of this 
chapter portray the extracellular matrix (ECM) as a static structure. 
In actuality, the ECM can exhibit dynamic properties, both in space 
and time. Spatially, for example, ECM fibrils can be seen to stretch 
several times their normal length as they are pulled on by cells and 
contract when tension is relieved. Temporally, the components of an 


ECM are subject to continual degradation and reconstruction. These 
processes serve to renew the matrix and to allow it to be remodeled 
during embryonic development or following tissue injury. Even the 
calcified matrix of our bones, which we think of as a stable, inert 
structure, is subject to continual restoration. 

The degradation of extracellular materials, along with cell surface 
proteins, is accomplished largely by a family of zinc-containing enzymes 
called matrix metalloproteinases (MMPs) that are either secreted 
into the extracellular space or anchored to the plasma membrane. As a 
group, MMPs can digest nearly all of the diverse ECM components, 
although individual family members are limited as to the types of extra- 
cellular molecules they can attack. The physiological roles of MMPs are 
not well understood, but they are thought to be involved in tissue 
remodeling, embryonic cell migration, wound healing, and the forma- 
tion of blood vessels. As might be expected from enzymes whose nor- 
mal function is to destroy extracellular materials, the excessive or 
inappropriate activity of MMPs is likely to cause disease. In fact, MMPs 
have been implicated in a number of pathological conditions, including 
arthritis, tooth and gum disease, formation of blood clots and heart 
attack, and tumor progression (Section 7.8). At least three inherited 
skeletal disorders have been traced to mutations in MMP genes. 


7.5 Integrins 


It was noted in the previous discussion that components of the ECM, 
such as fibronectin, laminin, proteoglycans, and collagen, are capable 
of binding to receptors situated on the cell surface (as in Figure 7.5). 
The most important family of receptors that attach cells to their 
extracellular microenvironment is the integrins. 
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Integrins are a family of membrane proteins found only in 
animals that play a key role in integrating the extracellular and intra- 
cellular environments. On the outer side of the plasma membrane, 
integrins bind to a remarkably diverse array of molecules (ligands) 
that are present in the extracellular environment. On the intracellular 
side of the membrane, integrins interact either directly or indirectly 
with dozens of different proteins to influence the course of events 
within the cell. Integrins are composed of two membrane-spanning 
polypeptide chains, an o chain and a B chain, that are noncovalently 
linked. Eighteen different a subunits and eight different B subunits 
have been identified. Although more than a hundred possible pair- 
ings of a and B subunits could theoretically occur, only about two 
dozen different integrins have been identified on the surfaces of cells, 
each with a specific distribution within the body. Whereas many sub- 
units occur in only a single integrin heterodimer, the B, subunit is 
found in 12 different integrins and the a, subunit in 5 of them (see 
Table 7.1). Most cells possess a variety of different integrins, and con- 
versely, most integrins are present on a variety of different cell types. 

Electron microscopic observations of integrin molecules beginning 
in the late 1980s suggested that the two subunits are oriented so as to 
form a globular extracellular head connected to the membrane by a pair 
of elongated “legs” (as depicted in Figure 7.5). The legs of each subunit 
extend through the lipid bilayer as a single transmembrane helix and 
end ina small cytoplasmic domain of about 20 to 70 amino acids.' The first 
X-ray crystallographic structure of the extracellular portion of an inte- 
grin was published in 2001 and displayed a highly unexpected feature. 
Rather than “standing upright” as predicted, the integrin af, was 
bent dramatically at the “knees” so that the head faces the outer surface 
of the plasma membrane (FIGURE 7.13). To understand the signifi- 
cance of this bent structure, we need to explore the properties of integrins. 


‘One exception to this molecular architecture is the B , chain, which has an extra thousand 
or so amino acids as part of its cytoplasmic domain. This huge addition makes  , integrins 
capable of extending much more deeply into the cytoplasm (see Figure 7.19) 


TABLE 7.1 Classification of Integrin Receptors Based 
on Recognition of RGD Sequences 


RGD Recognition Non-RGD Recognition 


Integrin 
receptor 


Integrin 


receptor Key ligands Key ligands 


Fibronectin Collagen 
Collagen 
Laminin 
Collagen 
Laminin 
Fibronectin 


O38) 
OsBy 
OB 


Fibronectin 
Fibronectin 


Fibronectin 
von Willebrand 
factor 


oB3 


VCAM 

Laminin 
ICAM-1 
ICAM-2 


Vitronectin 
Fibrinogen 
Fibronectin 
von Willebrand 
factor 
Vitronectin 


Fibrinogen 
ICAM-1 


Vitronectin 
Fibronectin 


Source: S. E. D'Souza, M. H. Ginsberg, E. F. Plow, Trends Biochemical Sciences 16:249, 
1991. 

Trends in Biochemical Sciences by International Union of Biochemistry. 
Reproduced with permission of International Union of Biochemistry; Elsevier Publ. 
in the format Journal via Copyright Clearance Center. 


A body of evidence strongly suggests that the bent conforma- 
tion of an integrin shown in Figure 7.13a corresponds to the inactive 
state of the protein, one that is incapable of binding a ligand. In fact, 
when an af; integrin containing a bound ligand is analyzed, the 
integrin no longer exhibits the bent structure but is present instead 
in an upright conformation as illustrated in Figure 7.13b-c. The 
ligand is bound to the head of the integrin in a cleft where the a and 
B subunits come together If the bent and upright structures repre- 
sent the inactive and active states of an integrin, respectively, then it 
is important to consider what type of stimulus is capable of trigger- 
ing such a remarkable transformation in protein conformation. 

In the bent, inactive conformation, the transmembrane domains 
of the two subunits are in close proximity, apparently held together by 
noncovalent interactions between residues of the two helices. The 
cytoplasmic domains of integrins bind a wide array of proteins; one of 
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FIGURE 7.13 Integrin conformations. (a) Schematic drawing of a complete 
integrin in the bent, inactive conformation, and a corresponding electron 
micrograph of the extracellular portion of a similar molecule. (b) Schematic 
drawing of an integrin in an active conformation and bound to a ligand, in 
this case talin. Binding of talin to the small cytoplasmic domain of the B 
subunit induces a separation of the two subunits and conversion to the 
active conformation. Activated integrins typically become clustered as the 
result of interactions of their cytoplasmic domains with the underlying 
cytoskeleton as indicated in Figure 7.16c. The extracellular ligand, in 

this case a collagen fiber, binds to both of the subunits in the head region 
of the activated integrin dimer. Note that conformations that are 
intermediate between those shown here may also exist and exhibit reduced 
ligand-binding. 

Source: Electron micrographs from Junichi Takagi et al., Cell 110:601, 2002; 
with permission from Elsevier; Ribbon drawings from T.-L. Lau, et al, Embo 
J. 28:1359, 2009, Fig. 9; Courtesy of Tobias Ulmer; © 2009, reprinted by 
permission from Macmillan Publishers, Ltd. 


these proteins, called talin, causes separation of the & and g subunits, 
as shown in the structure of the activated integrin in Figure 7.13b 
Mutations in talin that block its interaction with B-integrin subunits 
also prevent activation of integrins and adhesion to the ECM. 
Separation of the cytoplasmic tails and transmembrane domains of an 
integrin that results when talin is bound is thought to send a change in 
conformation through the legs of the integrin, causing the molecule to 
assume an upright position in which the head of the protein is capable 
of interacting specifically with its appropriate ligand. An increase in 
affinity of individual integrins is often correlated with the clustering of 
the activated integrins, which greatly enhances the overall strength of 
the interaction between the cell surface and its extracellular ligands. 
This type of alteration in integrin affinity triggered by changes occur- 
ring inside the cell is referred to as “inside-out” signaling. 

Integrins have been implicated in two major types of activities: 
adhesion of cells to their substratum (or to other cells) and transmission 
of signals between the external environment and the cell interior. We 
have already discussed the transmission of signals in an inside-out direc- 
tion, which primarily affects the binding properties of the integrins. 
Signaling can also occur in the opposite direction, a phenomenon known 
as “outside-in’ signaling. The binding of the extracellular domain of an 
integrin to a ligand, such as fibronectin or collagen, can induce a confor- 
mational change at the opposite, cytoplasmic end of the integrin, espe- 
cially its 8 subunit. Changes at the cytoplasmic end can, in turn, alter the 
way the integrin interacts with a host of different cytoplasmic proteins, 
modifying their activity. For example, outside-in signals can induce a 
conformational change in talin on the inside of the membrane, initiating 
a cascade of events leading to the polymerization of actin filaments of the 
cytoskeleton. Binding of integrins to an extracellular ligand can also 
trigger the activation of cytoplasmic protein kinases, such as FAK and 
Src (see Figure 7.16c). These kinases can then phosphorylate other pro- 
teins, initiating a chain reaction. In some cases, the chain reaction leads 
all the way to the nucleus, where a specific group of genes may be activated. 

Outside-in signals transmitted by integrins (and other cell surface 
molecules) can influence many aspects of cell behavior, including dif- 
ferentiation, motility, growth, and even the survival of the cell. The 
influence of integrins on cell survival is best illustrated by comparing 
normal and malignant cells. Most malignant cells are capable of grow- 
ing while suspended in a liquid culture medium. Normal cells, in con- 
trast, can only grow and divide if they are cultured on a solid 
substratum; if they are placed in suspension cultures, they die. Normal 
cells are thought to die in suspension culture because their integrins 
are not able to interact with extracellular substrates and, as a result, are 
not able to transmit life-saving signals to the interior of the cell. When 
cells become malignant, their survival no longer depends on integrin 
binding. The role of integrins in transmitting signals is a very active 
area of research and will be explored further in Section 7.10. 

Table 7.1 lists a number of known integrins and the key extracel- 
lular ligands to which they bind. Because individual cells may express 
a variety of different integrins on their cell surface, such cells are capa- 
ble of binding to a variety of different extracellular components (Figure 
7.5). Despite the apparent overlap seen in Table 7.1, most integrins do 
appear to have unique functions, as knockout mice (Section 18.17) 
that lack different integrin subunits exhibit distinct phenotypes. For 
example, a, knockouts show kidney defects, ~, knockouts exhibit 
heart defects, and a knockouts display vascular defects. 

Many of the extracellular proteins that bind to integrins do so 
because they contain the amino acid sequence arginine-glycine-aspartic 
acid (or, in abbreviated amino acid nomenclature, RGD). This tri- 
peptide is present in the cell-binding sites of proteoglycans, fibronec- 
tin, laminin, and various other extracellular proteins. The cell-binding 


domain of fibronectin, with its extended RGD-containing loop, is 233 


shown in Figure 7.10a. 

The discovery of the importance of the RGD sequence has 
opened the door to new avenues for treatment of medical conditions 
that involve receptor-ligand interactions. When the wall of a blood 
vessel is injured, the damaged region is sealed by the controlled aggre- 
gation of blood platelets, which are nonnucleated cell fragments that 
circulate in the blood. When it occurs at an inappropriate time or 
place, the aggregation of platelets can form a potentially dangerous 
blood clot (thrombus) that can block the flow of blood to major organs 
and is one of the leading causes of heart attack and stroke. The aggre- 
gation of platelets requires the interaction of a platelet-specific integ- 
rin (a,;,83) with soluble RGD-containing blood proteins, such as 
fibrinogen and von Willebrand factor, which act as linkers that hold 
the platelets together (FIGURE 7.14, top). Experiments with animals 
had indicated that RGD-containing peptides can inhibit blood clot 
formation by preventing the platelet integrin from binding to blood 
proteins (Figure 7.14, bottom). These findings led to the design of a 
new class of antithrombotic agents (Aggrastat and Integrelin) that 
resemble the RGD structure but bind selectively to the platelet integ- 
rin. Both of these drugs are based on the structures of compounds 
present in snake venom. A specific antibody (ReoPro) directed against 
the RGD binding site of «,,8, integrins can also prevent blood clots 
in certain patients undergoing high-risk vascular surgeries. A differ- 
ent integrin-targeting antibody (Tysabri) is prescribed for treatment 
of multiple sclerosis (see The Human Perspective, Chapter 17). 
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FIGURE 7.14 The role of integrins in platelet aggregation. (a) Blood 
clots form when platelets adhere to one another through fibrinogen 
bridges that bind to the platelet integrins. (b) The presence of synthetic 
RGD peptides can inhibit blood clot formation by competing with 
fibrinogen molecules for the RGD-binding sites on platelet a,B3 integrins. 
Nonpeptide RGD analogues and anti-integrin antibodies can act in a similar 
way to prevent clot formation in high-risk patients. 
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The cytoplasmic domains of integrins contain binding sites for 
a variety of cytoplasmic proteins, including several that act as adap- 
tors that link the integrin to actin filaments of the cytoskeleton (see 
Figure 7.16). The role of integrins in making the connection between 
the ECM and the cytoskeleton is best seen in two specialized struc- 
tures: focal adhesions and hemidesmosomes. 


REVIEW | 


1. How are integrins able to link the cell surface with 
materials that make up the ECM? How do the inactive 
and active structures of integrins differ from one 
another structurally and functionally? What is the 
significance of the presence of an RGD motif in an 
integrin ligand? 

2. How can a cell-surface protein be involved in both cell 
adhesion and transmembrane signal transduction? 


3. What is the difference between inside-out and 
outside-in signaling? What is the importance of each 
to cell activities? 


e—_______ 


7.6 Anchoring Cells 
to Their Substratum 


It is much easier to study the interaction of cells with the bottom of 
a culture dish than with an extracellular matrix inside an animal. 
Consequently, much of our knowledge of cell-matrix interactions 
has been obtained from the study of cells adhering to various 


(a) 2.5 um 


substrates in vitro. The stages in the attachment of a cultured cell to its 
substratum are shown in FIGURE 7.15. At first, the cell has a rounded 
morphology, as is generally true of animal cells suspended in aqueous 
medium. Once the cell makes contact with the substratum, it sends 
out projections that form increasingly stable attachments. Over time, 
the cell flattens and spreads itself out on the substratum. 


Focal Adhesions 


When fibroblasts or epithelial cells spread onto the bottom of a cul- 
ture dish, the lower surface of the cell is not pressed uniformly 
against the substratum. Instead, the cell is anchored to the surface of 
the dish only at scattered, discrete sites, called focal adhesions 
(FIGURE 7.16). Focal adhesions are dynamic structures that can be 
rapidly disassembled if the adherent cell is stimulated to move or 
enter mitosis. The plasma membrane of a focal adhesion contains 
large clusters of integrins. The cytoplasmic domains of the integrins 
are connected to actin filaments of the cytoskeleton through strati- 
fied layers of adaptor proteins (e.g., talin, a-actinin and vinculin) 
(Figure 7.16c). Focal adhesions play a key role in cell locomotion, 
during which the integrins develop transient interactions with extra- 
cellular materials (see Figure 9.65b). 

Focal adhesions are capable of creating mechanical forces or 
responding to such forces. These properties might be expected from 
a structure that contains actin and myosin, two of the cell’s major 
contractile proteins. FIGURE 7.17 shows a fibroblast lying on a bed of 
fibronectin-coated posts. The posts are flexible and capable of being 
moved independently of one another. It can be seen in this micro- 
graph that the tips of the posts are being deflected as a result of 
pulling forces exerted by the cell’s focal adhesions. 

Acting in the opposite direction, mechanical forces applied to 
the surfaces of cells can be converted by focal adhesions into bio- 


(d) 2.5 um 


FIGURE 7.15 Steps in the process of cell spreading. Scanning electron micrographs showing the morphology of mouse fibroblasts at successive times 
during attachment and spreading on glass coverslips. Cells were fixed after (a) 30 minutes, (b) 60 minutes, (c) 2 hours, and (d ) 24 hours of attachment. 


Source: (a-d) From J. J. Rosen and L. A. Culp, Exp. Cell Res. 107:141, 1977 with permission from Elsevier. 


(a) 
FIGURE 7.16 Focal adhesions are sites 


where cells adhere to their substratum 
and transmit signals in both directions 
across the plasma membrane. (a) This 
cultured cell has been stained with fluores- 
cent antibodies to reveal the locations of 
actin filaments (gray-green) and integrins 
(red). The integrins are localized in small 
patches that correspond to the sites of focal 
adhesions. (b) The cytoplasmic surface of a 
focal adhesion of a cultured amphibian cell is 
shown here after the inner surface of the 
membrane was processed for quick-freeze, 
deep-etch analysis. Bundles of actin (c) 
filaments are seen to associate with the inner 
surface of the membrane in the region of a 


(b) 2.5 um 
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focal adhesion. (c) Schematic drawing of a focal adhesion showing the interactions of integrin molecules with other proteins on both sides of the lipid bilayer. The 
binding of extracellular ligands, such as collagen and fibronectin, is thought to induce conformational changes in the cytoplasmic domains of the integrins that 
cause the integrins (red) to become linked to actin filaments of the cytoskeleton. Linkages with the cytoskeleton lead, in turn, to the clustering of integrins at the 
cell surface. Linkages with the cytoskeleton are mediated by various actin-binding proteins, such as talin (shown in yellow) and a-actinin (purple), that bind to the 
B subunit of the integrin. During the process of focal adhesion formation, talin undergoes a conformational change that exposes binding sites on talin’s rod 
domain. The binding of vinculin (light green) to these exposed sites promotes the assembly of additional actin filaments. The cytoplasmic domains of integrins 
are also associated with protein kinases, such as FAK (focal adhesion kinase, shown in orange) and Src. The attachment of the integrin to an extracellular ligand can 
activate these protein kinases and start a chain reaction that transmits signals throughout the cell. The association of myosin molecules with the actin filaments can 
generate traction forces that are transmitted to sites of cell-substrate attachment. 

Source: (a) From Molecular expressions.com at Florida State University; (b) From Steven J. Samuelsson, Paul J. Luther, David W. Pumplin, and Robert J. 
Bloch, J. Cell Biol. 122:487, 1993; Fig. 1. Reproduced with permission of the Rockefeller University Press. (c): From P. Kanchanawong et al., Reprinted by 


permission from Macmillan Publishers Ltd: Nature 468:580-584, 2010. 


chemical signals in the cytoplasm. This process of signal conversion 
is described as mechanotransduction, and the focal adhesion is act- 
ing as a mechanosensor by recognizing the physical properties of the 
environment. The ability of cells to respond to physical forces is 
important in many cellular behaviors and can be illustrated by a 
study in which cells were allowed to bind to beads that had been 
covered with a coating of fibronectin. When the membrane-bound 
beads were pulled by an optical tweezer (page 135), the mechanical 
stimulus was transmitted into the cell interior where it generated a 
wave of Src kinase activation. The strength of an integrin-ligand 
bond was measured to be on the order of 40 piconewtons (pN), 
based on an experiment where an integrin ligand was immobilized 
on a surface using different molecular tethers that rupture at differ- 
ent forces. As shown in FIGURE 7.18, a tether of 56pN allowed the 
cell to form strong focal adhesions and actin stress fibers, while 
a tether of 43pN did not. Mechanotransduction is thought to 
be mediated by conformational changes in some of the adaptor pro- 
teins, such as talin, that are induced by stretching. Such conforma- 
tional changes can expose important binding sites in these proteins 


that were previously hidden, allowing additional protein molecules 
to be recruited to the adhesion complex. 

Figure 7.16c illustrates how the pull of a fibronectin or collagen 
molecule is able to activate protein kinases, such as FAK or Src, which 
would then phosphorylate various substrates. Activation of these 
protein kinases can transmit signals throughout the cell, including 
the cell nucleus, where they can promote changes in gene expression. 
Activation of either FAK or Src can dramatically alter cell behavior. 
The importance of the physical properties of a cell’s environment in 
influencing cellular behavior is illustrated by a study in which mesen- 
chymal stem cells (MSCs) derived from adult bone marrow were grown 
on substrates of varying elasticity (or stiffness). When these MSCs 
were grown on a soft, pliable substratum, such as might be encoun- 
tered by a cell within the developing brain, the MSCs differentiated 
into nerve cells. When grown on a substratum of greater stiffness, 
these same cells differentiated into muscle cells. Finally, when grown 
on even stiffer substrates, such as those that might be home to cells 
growing in a skeletal tissue such as cartilage or bone, the MSCs dif- 
ferentiated into osteoblasts, which give rise to bone cells. 
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FIGURE 7.17 Experimental demonstration of forces exerted by focal 
adhesions. Scanning-electron micrograph of a smooth muscle cell 

sitting atop a bed of flexible (elastomeric) micro-posts whose tips had 
been coated with fibronectin. The attached cell is seen deflecting the 
position of multiple posts. The degree of movement of a particular post 
reflects the magnitude of the traction forces exerted by the cell on 

that post. 

Source: From John L. Tan, et al., PNAS 100:1484, 2003, Fig. 2d, 

courtesy of Christopher S. Chen. © 2003 National Academy of 

Sciences, U.S.A. 


FIGURE 7.18 Measuring the force necessary to activate integrins. 
In this experiment, an integrin ligand is attached to a glass substrate using 
a tether that will rupture once a specific force is applied. Mammalian cells 
are allowed to spread on the glass, and the cells are stained for actin 
(green) and a focal adhesion marker (vinculin, red). When tethers of a 
rupture strength of 43 piconewtons were used, integrins were not activated, 


and focal adhesions did not form. However, tethers of a rupture strength of 
56 piconewtons resulted in integrin activation, as seen through the 


formation of focal adhesions (right). 


Source: From Wang et al., Science 2013 May 24;340(6135):991-4. doi: 
10.1126/science.1231041. 


Hemidesmosomes 


Focal adhesions are most commonly seen in cells grown in vitro, 
although similar types of adhesive contacts are found in certain tissues, 
such as muscle and tendon. Within the body, the tightest attachment 
between a cell and its extracellular matrix is seen at the basal surface 
of epithelial cells where the cells are anchored to the underlying base- 
ment membrane by a specialized adhesive structure called a 
hemidesmosome (Figures 7.1 and 7.19). Hemidesmosomes con- 
tain a dense plaque on the inner surface of the plasma membrane with 
filaments coursing outward into the cytoplasm (FIGURE 7.19a). Unlike 
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FIGURE 7.19 Hemidesmosomes are differentiated sites at the basal 
surfaces of epithelial cells where the cells are attached to the underlying 
basement membrane. (a) Electron micrograph of several hemidesmosomes 
showing the dense plaque on the inner surface of the plasma membrane 
and the intermediate filaments projecting into the cytoplasm. (b) Schematic 
diagram showing the major components of a hemidesmosome connecting 
the epidermis to the underlying dermis. The a4B,4 integrin molecules of the 
epidermal cells are linked to cytoplasmic intermediate filaments by a 
protein called plectin that is present in the dark-staining plaque and to the 
basement membrane by anchoring filaments of a particular type of laminin. 
A second transmembrane protein (BP180) is also present in hemides- 
mosomes. The collagen fibers are part of the underlying dermis. 


Source: (a) From Douglas E. Kelly, J. Cell Biol. 28:51, 1966 (bottom Figure), 
Fig. 11. Reproduced with permission of the Rockefeller University Press. 


the filaments of focal adhesions, which consist of actin, the filaments 
of the hemidesmosome are thicker and consist of the protein keratin. 
Keratin-containing filaments are classified as intermediate filaments, 
which serve primarily in a supportive function (as discussed in detail 
in Section 9.9), The keratin filaments of the hemidesmosome are 
linked to the extracellular matrix by membrane-spanning integrins, 
including a8, (Figure 7.19b). Like their counterparts in focal adhe- 
sions, these integrins also transmit signals from the ECM that affect 
the shape and activities of the attached epithelial cells. 

The importance of hemidesmosomes is revealed by a rare dis- 
ease, bullous pemphigoid, in which individuals produce antibodies 
that recognize proteins present in these adhesive structures. Diseases 
caused by production of antibodies directed against one’s own tis- 
sues (i.e., autoantibodies) are called autoimmune disorders and are 
responsible for a wide variety of conditions. In this case, the presence 
of autoantibodies causes the lower layer of the epidermis to lose 
attachment to the underlying basement membrane (and thus to the 
underlying connective tissue layer of the dermis). The leakage of 
fluid into the space beneath the epidermis results in severe blistering 
of the skin. A similar inherited blistering disease, epidermolysis bul- 
losa, can occur in patients with genetic alterations in any one of a 
number of hemidesmosomal proteins, including the a, or By integrin 
subunit, collagen VII, or laminin-5. 


REVIEW | 


1. List two characteristics that distinguish 
hemidesmosomes from focal adhesions. 


7.7 Interactions of Cells 
with Other Cells 


Examination of a thin section through a major organ of an animal 
reveals a complex architecture involving a variety of different types 
of cells. Little is known about the precise mechanisms responsible 


FIGURE 7.20 Experimental demonstration of cell-cell recognition. 
When cells from different parts of an embryo are dissociated and then 
intermixed, the cells initially aggregate and then sort out by associating 
with other cells of the same type. The results of two such classic experi- 
ments are shown here. (a) In this experiment, two regions of an early 
amphibian embryo (the ectoderm and mesoderm) were dissociated into 
single cells and combined. At first the cells form a mixed aggregate, but 
eventually they sort out. The ectodermal cells (shown in red) move to the 
outer surface of the aggregate, which is where they would be located in the 
embryo, and the mesodermal cells (shown in purple) move to the interior, 
the position they would occupy in the embryo. Both types of cells then 
differentiate into the types of structures they would normally give rise to. 
(b) Light micrograph showing the results of an experiment in which 
precartilage cells from a chick limb are mixed with chick heart ventricle 
cells. The two types of cells have sorted themselves out of the mixed 
aggregate with the heart cells forming a layer on the outside of the 
precartilage cells. It is proposed that the precartilage cells collect in the 
center of the aggregate because the cells adhere to one another more 
strongly than do the cells from the heart. (This and other models are 
discussed in Nat. Cell Biol. 10:375, 2008.) 


Source: (a) P. L. Townes, J. Holtfreter, Journal of Experimental Zoology 
128:53, 1955; (b) M. S. Steinberg, Journal Experimental Zoology 173-411, 
1970. This material is used by permission of John Wiley & Sons Inc. 
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found within developing organs. It is presumed that this process depends 
heavily on selective interactions between cells of the same type, as well as 
between cells of a different type. It is evident that cells can recognize the 
surfaces of other cells, interacting with some and ignoring others. 
Early attempts to learn about cell-cell recognition and adhesion 
were carried out by removing a developing organ from a chick or 
amphibian embryo, dissociating the organ’s tissues to form a 
suspension of single cells, and determining the ability of the cells to 
reaggregate in culture (FIGURE 7.20a). In experiments in which cells 
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together, the cells would initially aggregate to form a mixed clump. 
Over time, however, the cells would move around within the aggregate 
and eventually “sort themselves out” so that each cell adhered only to 
cells of the same type (Figure 7.20b). Once separated into a homoge- 
neous cluster, these embryonic cells would often differentiate into 
many of the structures they would have formed within an intact 
embryo. 

Researchers now have identified dozens of different proteins 
involved in cell adhesion. Different arrays of these molecules on the 
surfaces of different types of cells are thought to be responsible for 
the specific interactions between cells within complex tissues. 

Four distinct families of integral membrane proteins play a 
major role in mediating cell-cell adhesion: (1) selectins (2) certain 
members of the immunoglobulin superfamily (IgSF), (3) certain 
members of the integrin family, and (4) cadherins. 


Selectins 


During the 1960s, it was discovered that lymphocytes that had been 
removed from peripheral lymph nodes, radioactively labeled, and 


_ injected back into the body would return to the sites from which 
they were originally derived—a phenomenon called lymphocyte 


Lectin-like 
domain 


Fn til 


L1 protein 


EGF-like 
domain 


Structural 
domain 


L-selectin 
IgG 


E-selectin 


P-selectin 


L1 protein 


(a) () 


homing. It was subsequently found that homing could be studied in 
vitro by allowing lymphocytes to adhere to frozen sections of lym- 
phoid organs. Under these experimental conditions, the lympho- 
cytes would selectively adhere to the endothelial lining of the 
venules (the smallest veins) of peripheral lymph nodes. Binding of 
lymphocytes to venules could be blocked by antibodies directed 
against a specific glycoprotein, called L-selectin, on the lymphocyte 
surface. 

Selectins comprise a family of integral membrane glycopro- 
teins that recognize and bind to a particular arrangement of sugars 
in the oligosaccharides that project from the surfaces of other cells 
(Section 4.3). The name of this class of cell-surface receptors is 
derived from the word “lectin,” a general term for a compound that 
binds to specific carbohydrate groups. Selectins possess a small cyto- 
plasmic segment, a single membrane-spanning domain, and a large 
extracellular portion that consists of a number of separate domains, 
including an outermost domain that acts as the lectin (FIGURE 7.21A). 
There are three known selectins: E-selectin, present on endothelial 
cells; P-selectin, present on platelets and endothelial cells; and 
L-selectin, present on leukocytes (white blood cells). All three selectins 
recognize a particular grouping of sugars (Figure 7.21a) that is found 
at the ends of the carbohydrate chains of certain complex 
glycoproteins. Binding of selectins to their carbohydrate ligands 
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FIGURE 7.21 Overview of cell adhesion molecules. (a) Selectins. All three types of known selectins recognize and bind to a similar carbohydrate ligand 
_ at the ends of oligosaccharide chains on glycoproteins. In the inset, the detailed structure of the carbohydrate ligand is shown. The terminal fucose and 
sialic acid moieties are particularly important in selectin recognition, and the N-acetylglucosamine moiety is often sulfated. (b) L1 molecules. L1 is a 
member of the immunoglobulin (lg) superfamily. Cell-cell adhesion results from the specific interactions of the immunoglobulin (lg) domains of two L1 
molecules projecting from the surfaces of neighboring cells. Each L1 molecule contains a small cytoplasmic domain, a transmembrane segment, several 
segments that resemble one type of module found in fibronectin, and six lg domains situated at the N-terminal portion of the molecule. The inset shows 
the structure of the two N-terminal lg domains of VCAM, an IgSF molecule on the surface of endothelial cells. The Ig domains of VCAM and L1 have a 
similar three-dimensional structure consisting of two sheets packed face-to-face. (c) Integrin-IgSF interactions. |gSF family members can also bind to 
integrins of neighboring cells (d) Cadherins. Cell-cell adhesion results from the interactions between similar types of cadherins projecting from the plasma 
membrane of each cell. Calcium ions (shown as small yellow spheres) are situated between the successive domains of the cadherin molecule where they 
play a critical role in maintaining the rigidity of the extracellular portion of the protein. This illustration shows several alternate models by which cadherins 
from opposing cells might interact. Different types of studies have suggested different degrees of overlap (interdigitation) between the extracellular 
domains of molecules from opposing cells. For consistency, subsequent figures will depict cadherins with a single domain overlap, which is likely the 


predominant configuration. 


Source: Inset of (b) From E. Yvonne Jones et al., Nature 373:540, 1995; © 1995, reprinted by permission from Macmillan Publishers Ltd. 


requires calcium. As a group, selectins mediate transient interactions 
between circulating leukocytes and vessel walls at sites of 
inflammation and clotting. Capturing a leukocyte is a challenging 
task because these cells are flowing through the bloodstream at a 
considerable rate of speed. Selectins are well suited for this function 
because the bonds that they form with their ligands become stronger 
when the interaction is placed under mechanical stress, as occurs 
when the leukocyte is being pulled away from a given site on the ves- 
sel wall. The role of selectins in inflammation is discussed further in 
the Human Perspective in Section 7.8. 


The Immunoglobulin Superfamily 


Elucidation of the structure of bloodborne antibody molecules in 
the 1960s was one of the milestones in our understanding of the 
immune response. Antibodies, which are a type of protein called an 
immunoglobulin (or Ig), were found to consist of polypeptide 
chains composed of a number of similar domains. Each of these Ig 
domains, as they are called, is composed of 70 to 110 amino acids 
organized into a tightly folded structure, as shown in the inset of 
Figure 7.21b. The human genome encodes 765 distinct Ig domains, 
making it the most abundant domain in human proteins. Taken as a 
group, these proteins are members of the immunoglobulin super- 
family, or IgSF. Most members of the IgSF are involved in various 
aspects of immune function, but some of these proteins mediate 
calcium-independent cell-cell adhesion. In fact, the discovery of Ig- 
like domains in cell-adhesion receptors in invertebrates—animals 
that lack a classic immune system—suggests that Ig-like proteins 
originally evolved as cell-adhesion mediators and only secondarily 
took on their functions as effectors of the vertebrate immune 
system. 

Most IgSF cell-adhesion molecules mediate the specific interac- 
tions of lymphocytes with cells required for an immune response 
(e.g., macrophages, other lymphocytes, and target cells). However, 
some IgSF members, such as VCAM (vascular cell-adhesion mole- 
cule), NCAM (neural cell-adhesion molecule), and L1 (also called 
L1CAM), mediate adhesion between nonimmune cells. NCAM and 
L1, for example, play important roles in nerve outgrowth, synapse 
formation, and other events during the development of the nervous 
system. Like fibronectin and many other proteins involved in cell 
adhesion, the IgSF cell adhesion molecules have a modular con- 
struction (Figure 7.21b,c) and are composed of individual domains 
of similar structure to the domains in other proteins. 

The importance of L1 in neural development has been revealed 
in several ways. In humans, mutations in the L1 gene can have dev- 
astating consequences. In extreme cases, babies are born with a fatal 
condition of hydrocephalus (“water on the brain”). Children with 
less severe mutations typically exhibit mental retardation and diffi- 
culty in controlling limb movements (spasticity). Autopsies on 
patients that have died of an L1-deficiency disease reveal a remark- 
able condition: They are often missing two large nerve tracts, one 
that runs between the two halves of the brain and the other that runs 
between the brain and the spinal cord. The absence of such nerve 
tracts suggests that L1 is involved in the directed growth of axons 
within the embryonic nervous system. 

Various types of proteins serve as ligands for IgSF cell surface mol- 
ecules. As described earlier, most integrins facilitate adhesion of cells to 
their substratum, but a few integrins mediate cell-cell adhesion by 


binding to proteins on other cells (Figure 7.21c). For example, the integ- 239 


rin g §, on the surface of leukocytes binds to VCAM, an IgSF protein on 
the endothelial lining of certain blood vessels (Section 7.8). 


Cadherins 


The cadherins are a large family of glycoproteins that mediate Ca”*- 
dependent cell-cell adhesion and transmit signals from the ECM to 
the cytoplasm. Cadherins typically join cells of similar type to one 
another and do so predominantly by binding to the same cadherin 
present on the surface of the neighboring cell. This property of cad- 
herins was first demonstrated by genetically engineering cells that 
were normally nonadhesive to express one of a variety of different 
cadherins. The cells were then mixed in various combinations and 
their interactions monitored. It was found that cells expressing one 
species of cadherin preferentially adhered to other cells expressing 
the same cadherin. These types of findings have led researchers to 
believe that cadherins are largely responsible for the ability of like 
cells to “sort out” of mixed aggregates, as was illustrated in Figure 7.20. 
In fact, cadherins may be the single most important factor in molding 
cells into cohesive tissues in the embryo and holding them together 
in the adult. As discussed in the Human Perspective, the loss of cad- 
herin function may play a key role in the spread of malignant tumors. 

Like many other proteins involved in adhesion, cadherins have 
a modular construction. The best-studied cadherins are E-cadherin 
(epithelial), N-cadherin (neural), and P-cadherin (placental). These 
“classical” cadherins, as they are called, contain a relatively large 
extracellular segment consisting of five tandem domains of similar 
size and structure, a single transmembrane segment, and a small 
cytoplasmic domain (Figure 7.21d). The cytoplasmic domain is 
often associated with members of the catenin family of cytosolic pro- 
teins, which have a dual role: They tether the cadherins to the 
cytoskeleton (see Figure 7.25), and they transmit signals to the cyto- 
plasm and nucleus. A number of models of cadherin adhesion are 
depicted in Figure 7.21d. Structural studies indicate that cadherins 
from the same cell-surface associate laterally to form parallel dimers. 
These studies also shed light on the role of calcium, which has been 
known for decades to be essential for cell-cell adhesion. As indi- 
cated in Figure 7.21d, calcium ions form bridges between successive 
domains of a given molecule. These calcium ions maintain the extra- 
cellular portion of each cadherin in a rigid conformation required 
for cell adhesion. Adhesion between cells results from the interac- 
tion between the extracellular domains of cadherins from opposing 
cells to form a “cell-adhesion zipper.’ Different types of cells bearing 
different cadherins likely engage in different types of interactions, so 
that many different configurations (as depicted in Figure 7.21d) may 
occur within an organism. Just as cadherin interdigitation can be 
compared to a zipper, cadherin clusters can be compared to Velcro; 
the greater the number of interacting cadherins in a cluster, the 
greater the strength of adhesion between apposing cells. 

Embryonic development is characterized by change: change in 
gene expression, change in cell shape, change in cell motility, change 
in cell adhesion, and so forth. Cadherins are thought to mediate 
many of the dynamic changes in adhesive contacts that are required 
to construct the tissues and organs of an embryo, which is known as 
the process of morphogenesis. For example, a number of morphoge- 
netic events during embryonic development involve a group of 
cells changing from an epithelium (tightly adherent, apical-basal 
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FIGURE 7.22 Cadherins and morphogenesis. (a) During gastrulation, cells in the upper layer of the embryo (the epiblast) move toward a groove 

in the center of the embryo, sink into the groove, and then migrate laterally as mesenchymal cells in the space beneath the epiblast. This epithelial- 
mesenchymal transition is marked by a loss of expression of E-cadherin that is characteristic of epithelial cells. Cells expressing E-cadherin are depicted in 
orange. (b) Scanning electron micrograph of a chick embryo during gastrulation that has been fractured to reveal the cells undergoing the epithelial- 
mesenchymal transition (arrow) depicted in part a. (c) This sequence of drawings depicts the development of the neural tube, which is an epithelial layer 
that forms by separation from the overlying layer of dorsal ectoderm. In the top drawing, the epithelial cells are expressing E-cadherin. In the lower 
drawings, the cells of the neural tube stop expressing E-cadherin (orange) and instead express N-cadherin (blue). 


Source: Courtesy of Michael Solursh and Jean Paul Revel. 


polarized, stationary cell layer) toa mesenchyme (solitary, nonadhe- 
sive, nonpolarized, migratory cells), or vice versa. The epithelial- 
mesenchymal transition (or EMT) is illustrated by the formation 
of the mesoderm during gastrulation in a chick or mammalian 
embryo. Typically, these cells break away from a cohesive epithelial 
layer (called the epiblast) at the dorsal surface of the early embryo 
and wander into the interior regions as mesenchymal cells 
(FIGURE 7.22a,b). These mesenchymal cells will eventually give rise 
to mesodermal tissues such as blood, muscle, and bone. The cells of 
the epiblast display E-cadherins on their surface, which is presumed 
to promote their close association with one another. Prior to their 
separation from the epiblast, the future mesodermal cells stop 
expressing E-cadherin, which is thought to promote their release 
from the epithelium and transformation into mesenchymal cells 
(Figure 7.22a). At a later stage of development, another major event, 
the formation of the primitive nervous system, is also characterized 
by changes in cadherin expression. Following gastrulation, the dor- 
sal surface of the embryo is covered by a single-celled epithelial layer, 
which will become the ectodermal tissues of the animal (including 
the skin and nervous system). At this stage, the cells in the central 


region of this layer stop expressing E-cadherin and begin expressing 
N-cadherin (Figure 7.22c). In subsequent stages, the epithelial cells 
expressing N-cadherin separate from their neighbors on either side 
and roll into the neural tube, which will go on to become the animal's 
brain and spinal cord. It is thought that cadherins (and other cell- 
adhesion molecules) play a key role in these events by changing the 
adhesive properties of cells. 

To date, over 100 cadherins have been identified in humans, 
many of which are thought to be functionally redundant. Single 
mutations in a number of cadherins, however, have been linked to 
genetic diseases. One of the most notable is Usher syndrome, 
which is characterized by deafness and gradual loss of vision. Usher 
syndrome can be caused by mutations in a number of genes, but 
some of the most severe forms are caused by mutations in either 
cadherin 23 or protocadherin 15. These non-classical cadherins 
form a fine tether, called a tip link, between adjacent stereocilia on 
the surface of hair cells in the inner ear, as shown in FIGURE 7.23. Tip 
links are essential for converting the mechanical movement of the 
stereocilia (caused by soundwaves) into an electrical current, and 
also play a critical structural role in the maintenance of stereociliary 


organization. Mutations in these tip links can result in severe defects 
in the organization and development of stereocilia, resulting in 
profound deafness. 


REVIEW | 


1. Which type(s) of cell junctions contain actin filaments? 
Which contain(s) intermediate filaments? Which 
contain(s) integrins? Which contain(s) cadherins? 

2. How do cadherins, IgSF members, and selectins differ 


at the molecular level in the way they mediate cell-cell 
adhesion? 


oo 


FIGURE 7.23 Cadherins form tip links of stereocilia. Cadherin 23 and 
protocadherin 15 form the tip link on stereocilia that play a key role in 
converting mechanical force, in the form of sound waves, into electrical 
signals that are carried to the brain. (a) A schematic drawing of stereocilia, 
which are located on cells known as hair cells in the inner ear, is shown 
here. The tip links are colored in red. The structure of the tip link is shown 
in (b), with cadherin 23 show in blue and protocadherin shown in purple. 
The crystal structure of the interacting domains is revealed in (c). 


Source: From M. Sotomayor et al, Reprinted by permission from Macmillan 
Publishers Ltd: Nature 492:128-32, 2012. 
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7.8 THE HUMAN PERSPECTIVE 


The Role of Cell Adhesion in Inflammation and Metastasis 


Inflammation is one of the primary responses to infection. If a part 
of the body becomes contaminated by bacteria, as might occur fol- 
lowing a puncture wound to the skin, the site of injury becomes a 
magnet for a variety of white blood cells. White blood cells (leu- 
kocytes) that would normally remain in the bloodstream are stimu- 
lated to traverse the endothelial layer that lines the smallest veins 
(venules) in the region and enter the tissue. Once in the tissue, 
the leukocytes move in response to chemical signals toward the 
invading microorganisms, which they ingest. Although inflamma- 
tion is a protective response, it also produces adverse side effects, 
such as fever, swelling due to fluid accumulation, redness, and pain. 

Inflammation can also be triggered inappropriately. For 
example, damage to the tissues of the heart or brain can occur 
when blood flow to these organs is blocked during a heart attack 
or stroke. When blood flow to the organ is restored, circulating 
leukocytes may attack the damaged tissue, causing a condition 
known as reperfusion damage. An overzealous inflammatory 
response can also lead to asthma, toxic shock syndrome, and 


*A remarkable film of a leukocyte “chasing” a bacterium can be found on the 
Web by searching the keywords “neutrophil crawling.” 


respiratory distress syndrome. A great deal of research has 
focused on questions related to these conditions: How are leu- 
kocytes recruited to sites of inflammation? How are they able to 
stop flowing through the bloodstream and adhere to vessel walls? 
How do they penetrate the walls of the vessels? How can some 
of the negative side effects of inflammation be blocked without 
interfering with the beneficial aspects of the response? Answers 
to questions about inflammation have focused on three types of 
cell adhesion molecules: selectins, integrins, and IgSF proteins. 
A chain of events that has been proposed to occur during 
acute inflammation is shown in FIGURE 1. The first step begins as 
the walls of the venules become activated in response to chemical 
signals from nearby damaged tissue. The endothelial cells that 
line these venules become more adhesive to circulating neutro- 
phils, a type of phagocytic leukocyte that carries out a rapid, non- 
specific attack on invading pathogens. This change in adhesion 
is mediated by a temporary display of P- and E-selectins on the 
surfaces of the activated endothelial cells in the damaged area 
(step 2, Figure 1). When neutrophils encounter the selectins, they 
form transient adhesions that dramatically slow their movement 
through the vessel. At this stage, the neutrophils can be seen to 


continued 
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FIGURE 1. Steps in the movement of neutrophils from the bloodstream during inflammation. The steps are described in the text. 


“roll” slowly along the wall of the vessel. Anumber of biotechnology 
companies are attempting to develop anti-inflammatory drugs 
that act by interfering with binding of ligands to E- and P-selectins. 
Anti-selectin antibodies block neutrophil rolling on selectin-coated 
surfaces in vitro and suppress inflammation and reperfusion damage 
in animals. A similar type of blocking effect has been attained 
using synthetic carbohydrates (e.g., efomycines) that bind to 
E- and P-selectin, thereby competing with carbohydrate ligands 
on the surfaces of the neutrophils. 

As the neutrophils interact with the inflamed venule endothe- 
lium, interactions take place between other molecules on the sur- 
faces of the two types of cells. One of the molecules displayed on 
the surface of the endothelial cells is a phospholipid called platelet 
activating factor (PAF). PAF binds to a receptor on the surface of 
the neutrophil sending a signal into the neutrophil that leads to an 
increase in the binding activity of certain integrins (e.g., a.B2 and 
a4B,) already situated on the neutrophil surface (step 3, Figure 1). 
This is an example of the type of inside-out signal that is illustrated 
in Figure 7.13. The activated integrins then bind with high affinity 
to IgSF molecules (e.g., ICAM-1 and VCAM-1) on the surface of 
the endothelial cells, causing the neutrophils to stop their rolling 
and adhere firmly to the wall of the vessel (step 4). The bound neu- 
trophils then change their shape and squeeze between adjacent 
endothelial cells into the damaged tissue (step 5). Invading neu- 
trophils appear capable of disassembling the adherens junctions 
(Section 7.9) that form the major barrier between cells of the vessel 
wall. This cascade of events, which involves several different types 
of cell-adhesion molecules, ensures that attachment of blood 
cells to the walls of blood vessels and subsequent penetration will 
occur only at sites where leukocyte invasion is required. 

The importance of integrins in the inflammatory response is 
demonstrated by a rare disease called leukocyte adhesion defi- 
ciency (LAD). Persons with the type | form of LAD are unable to 
produce the Bz subunit as part of a number of leukocyte integrins. 
As a result, the leukocytes of these individuals lack the ability to 
adhere to the endothelial layer of venules, a step required for their 
exit from the bloodstream. These patients suffer from repeated 
life-threatening bacterial infections. The disease is best treated 
by bone marrow transplantation, which provides the patient with 
stem cells capable of forming normal leukocytes. Administration 
of antibodies against the Bz subunit can mimic the effects of LAD, 
blocking the movement of neutrophils and other leukocytes out of 
blood vessels. Such antibodies might prove useful in preventing 
inflammatory responses associated with diseases such as asthma 
and rheumatoid arthritis or with reperfusion. 


Cancer is a disease in which cells escape from the body's normal 
growth control mechanisms and proliferate in an unregulated 
manner. Were the malignant cells to remain in a single mass, 
as often occurs in some types of skin cancer or thyroid cancer, 
most cancers would be readily cured by surgical removal of the 
diseased tissue. Most malignant tumors, however, spawn cells 
that are capable of leaving the primary tumor mass and entering 
the bloodstream or lymphatic channels, thereby initiating the 
growth of secondary tumors in other parts of the body (FIGURE 2A). 
The spread of a tumor within the body is called metastasis and is 
the reason cancer is such a devastating disease. Metastatic cells 
(cancer cells that are able to initiate the formation of secondary 
tumors) are thought to have special cell-surface properties that 
are not shared by most other cells in the tumor. For example: 


1. Metastatic cells must be less adhesive than other cells to break 
free of the tumor mass. 


2. Metastatic cells must be able to penetrate numerous barriers, 
such as the extracellular matrices of surrounding connective 
tissue and the basement membranes that underlie the epi- 
thelium and line the blood vessels that carry them to distant 
sites (Figure 2a). Although the presence of cancer cells in the 
bloodstream is a potentially lethal circumstance, it does pro- 
vide an opportunity for nonsurgical access to a patient's can- 
cer. Anumber of devices have recently been developed to trap 
rare circulating tumor cells (CTCs) present in blood samples 
taken from cancer patients (Figure 2b). Once they have been 
captured, CTCs from a patient can be studied to determine 
the molecular characteristics of the tumor, how aggressive 
the tumor is likely to be, how it might be best treated, and 
how well a particular therapy is progressing as reflected in the 
decreasing number of CTCs over time. One day CTCs might 
be used as the basis for early detection of disease. 


3. Metastatic cells must be able to invade normal tissues and sur- 
vive to form secondary colonies. For some reason, this third 
step appears to pose the greatest obstacle in the process of 
metastasis for most types of tumors. Consequently, only a very 
small fraction of cancer cells that exit the bloodstream typically 
produce viable secondary tumors. 


The mechanisms used by cancer cells to penetrate extra- 
cellular matrices are poorly understood because such events 
have been virtually impossible to study within the tissues of a living 
animal. It is thought that movement through basement membranes 
is accomplished largely by ECM-digesting enzymes, most notably 


Vascular 
BM 


Cancer 
cell 


(a) 


(b) 


FIGURE 2 Steps leading to the metastatic spread of an epithelial cancer (a carcinoma). (a) A fraction of the cells of the primary 
tumor lose their adhesiveness to other tumor cells and gain the capability to penetrate the basement membrane (BM) barrier that 
underlies the epithelial tissue. These cells, which have assumed a mesenchymal-like appearance, migrate through the surrounding 
stromal tissue and cross the BM of a blood or lymph vessel, thereby entering the general circulation. The cells are carried to other 
tissues, where they migrate back across the BM of the vessel and enter a tissue in which they possess the potential to form secondary 
tumors. Only a very small percentage of tumor cells that are released from a primary tumor manage to overcome these numerous 
hurdles, but those that do pose a threat to the life of the host. (b) These circulating tumor cells (CTCs) have been isolated from a blood 
sample of a patient with prostate cancer. Even though the blood of a cancer patient may contain less than one cancer cell for every 
billion normal cells, these rare cancer cells can be selectively trapped on a chip that has been coated with antibody molecules directed 
against a cell-surface protein (in this case, EXpCAM) that is present on the cancer cells and absent from normal blood cells. 


Source: (a) R. G. Rowe, S. J. Weiss, Trends Cell Biology 18:562, 2008, Copyright 2008, with permission from Elsevier Science. Trends in 
Cell Biology by Elsevier Ltd. Reproduced with permission of Elsevier Ltd. in the format Journal via Copyright Clearance Center; 

(b) From Min Yu, Shannot Stott, et al., the cover of J. Cell Biol. Vol. 192, #3, 2011, courtesy of Daniel A. Haber, Shannon Stott, 

and Min Yu. Reproduced with permission of the Rockefeller University Press. 


the matrix metalloproteinases (MMPs) discussed on page 231. 
These enzymes are presumed to degrade the proteins and pro- 
teoglycans that stand in the way of cancer cell migration. In some 
types of cancer cells, MMPs are directly associated with long cell 
projections, called invadopodia, that push their way through 
the ECM. Among other tumor-promoting functions of MMPs, 
cleavage of certain proteins of the ECM or cell surface produces 
active protein fragments that act back on the cancer cells to stimu- 
late their growth and invasive character. Because of their apparent 
roles in the development of malignant tumors, MMPs became 
a prominent target of the pharmaceutical industry. Once it was 
demonstrated that synthetic MMP inhibitors were able to reduce 
metastasis in mice, a number of clinical trials of these drugs were 
conducted on patients with a variety of advanced, inoperable can- 
cers. Unfortunately, these inhibitors have shown little promise in 
stopping late-stage tumor progression and, in some cases, have 
led to joint damage. To date, the only FDA-approved MMP inhib- 
itor (Periostat) is used to treat periodontal disease. 

Studies have shown that changes to the ECM itself accom- 
pany many diseases, including cancer. In breast cancer, for in- 
stance, the ECM surrounding tumor cells can be up to 10 times 
stiffer, resulting in the ability to feel a “lump” during a breast 
exam. This increased stiffness is thought to be due to increased 
crosslinks in collagen and other components of the ECM. In 
addition, breast tumors show an increase in linearized and bun- 
dled forms of collagen compared to normal breast tissue. This 
reorganization of the ECM by cancer tissue appears to play an 
important role in metastases, because stiffer, bundled collagen 
tends to promote cell migration. 


Changes in the numbers and types of various cell-adhe- 
sion molecules—and thus the ability of cells to adhere to other 
cells or to extracellular matrices—have also been implicated in 
promoting metastasis. The greatest focus in this area has centered 
on E-cadherin, which is the predominant cell-adhesion molecule 
of the adherens junctions that hold epithelial cells in a cohesive 
sheet. Page 240 describes how the loss of E-cadherin from epi- 
thelial cells during certain events of embryonic development is 
associated with conversion of the cells into a less adhesive, more 
motile, mesenchymal phenotype. A remarkably similar epithelial- 
mesenchymal transition occurs at the peripheral edges of a tumor 
as malignant cells separate from the primary tumor mass and 
invade the adjacent normal tissue (Figure 2a). This is an important 
step in the process of metastasis. Surveys of a variety of epithelial 
cell tumors (e.g., breast, prostate, and colon cancers) confirm that 
these malignant cells have greatly reduced levels of E-cadherin; 
the lower the level of expression of E-cacherin, the greater the 
cell's metastatic potential. Conversely, when malignant cells are 
forced to express extra copies of the E-cadherin gene, the cells 
become much less capable of causing tumors when injected into 
host animals. The presence of E-cadherin is thought to favor the 
adhesion of cells to one another and suppress the dispersal of 
tumor cells to distant sites. E-cadherin may also inhibit the sig- 
naling pathways within the cell that lead to tissue invasion and 
metastasis. The importance of E-cadherin is evident from a study 
of a family of native New Zealanders that had lost 25 members 
to stomach cancer over a 30-year period. Analysis of the DNA 
of family members has revealed that susceptible individuals carry 
mutations in the gene encoding E-cadherin. 
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7.9 Adherens Junctions 
and Desmosomes 


Whereas cadherins are typically distributed diffusely all along the 
surfaces of two adherent cells, they also participate in the formation 
of specialized intercellular junctions called adherens junctions and 
desmosomes. In addition to these adhesive junctions, epithelial cells 
often contain other types of cell junctions, called tight junctions and 
gap junctions, that will be discussed later in the chapter. 

Adherens junctions are found in a variety of sites within the 
body. They are particularly common in epithelia, such as the lining 
of the intestine, where they occur as “belts” (or zonulae adherens) 
that encircle each of the cells near its apical surface, binding that cell 
to its surrounding neighbors (FIGURE 7.24). In an adherens junction, 
cells are held together by calcium-dependent linkages formed 
between the extracellular domains of cadherin molecules that bridge 
the 30-nm gap between neighboring cells (FIGURE 7.25). As Figure 
7.25a-b illustrates, the cytoplasmic domain of these cadherins is 
linked by a- and B-catenins to a variety of cytoplasmic proteins, 
including actin filaments of the cytoskeleton. Thus, like the integrins 
of a focal adhesion, the cadherin clusters of an adherens junction 
(1) connect the external environment to the actin cytoskeleton 
and (2) provide a pathway for signals to be transmitted from the cell 


Adherens 
junction 


exterior to the cytoplasm. To give one example, the adherens junc- 
tions situated between endothelial cells that line the walls of blood 
vessels transmit signals that ensure the survival of the cells. Mice that 
are lacking an endothelial cell cadherin are unable to transmit these 
survival signals, and these animals die during embryonic develop- 
ment as a result of the death of the cells lining the vessel walls. 
Desmosomes (or maculae adherens) are disk-shaped adhesive 
junctions approximately 1um in diameter (Figure 7.26) that are 
found in a variety of tissues. Desmosomes are particularly numerous 
in tissues that are subjected to mechanical stress, such as cardiac 
muscle and the epithelial layers of the skin and uterine cervix. Like 
adherens junctions, desmosomes contain cadherins that link the two 
cells across a narrow extracellular gap. The cadherins of desmosomes 
have a different domain structure from the classical cadherins 
found in adherens junctions and are referred to as desmogleins and 
desmocollins (Figure 7.25b). Dense cytoplasmic plaques on the inner 
surface of the plasma membranes serve as sites of anchorage for 
looping intermediate filaments similar to those of hemidesmosomes 
(Figure 7.19). The three-dimensional network of ropelike interme- 
diate filaments provides structural continuity and tensile strength to 
the entire sheet of cells. Intermediate filaments are linked to the 
cytoplasmic domains of the desmosomal cadherins by additional 
proteins, as depicted in FIGURE 7.26. The importance of cadherins in 


(b) 0.1 um 


FIGURE 7.24 An intercellular junctional complex. (a) Schematic diagram showing the junctional complex on the lateral surfaces of a simple columnar 
epithelial cell. The complex consists of a tight junction (zonula occludens), an adherens junction (zonula adherens), and a desmosome (macula adherens). 
Other desmosomes and gap junctions are located deeper along the lateral surfaces of the cells. Adherens junctions and tight junctions encircle the cell, 

whereas desmosomes and gap junctions are restricted to a particular site between adjacent cells. Hemidesmosomes are shown at the basal cell surface. 

(b) Electron micrograph of a junctional complex between two rat airway epithelial cells (TJ, tight junction; AJ, adherens junction; D, desmosome). 


Source: (b) From Eveline E. Schneeberger and Robert D. Lynch, Am. J. Physiol. 262:1648, 1992. © The American Physiological Society (APS). All rights reserved. 
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FIGURE 7.25 Schematic model of the molecular architecture of an 
adherens junction and desomosome. (a) The cytoplasmic domain of each 
cadherin molecule is connected to the actin filaments of the cytoskeleton 
by linking proteins, including B-catenin, a-catenin, and various actin-binding 
proteins. One of these actin-binding proteins is formin, which participates 
in the polymerization of the actin filaments. (b) Schematic model of the 
molecular architecture of a desmosome. Specialized cadherins, called 
desmoglein and desmocollin, bind to one another in the extracellular 
space. In the cytoplasm, the desmosomal cadherins bind indirectly to 
intermediate filaments. 
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FIGURE 7.26 Specialized cadherins, called desmoglein and desmocol- 
lin, bind to one another in the extracellular space. In the cytoplasm, the 
desmosomal cadherins bind indirectly to intermediate filaments. Electron 
micrograph of a desmosome from newt epidermis. 

Source: From Douglas E. Kelly, J. Cell Biol. 28 : 51, 1966, fig. 10. 
Reproduced with permission of Rockefeller University Press. 


maintaining the structural integrity of an epithelium is illustrated by 245 


an autoimmune disease (pemphigus vulgaris) in which antibodies 
are produced against one of the desmogleins. The disease is 
characterized by a loss of epidermal cell-cell adhesion and severe 
blistering of the skin. 


REVIEW | 


1. Distinguish between a hemidesmosome and a 


desmosome; a desmosome and an adherens junction. 
oe 


7.10 The Role of Cell-Adhesion 
Receptors in Transmembrane 
Signaling 


One of the roles of integral membrane proteins is to transfer infor- 
mation across the plasma membrane, a process known as trans- 
membrane signaling. While the subject will be explored in detail 
in Chapter 15, it can be noted that all four types of cell-adhesion 
molecules illustrated in Figure 7.21 have the potential to carry out 
this function. For example, integrins and cadherins can transmit sig- 
nals from the extracellular environment to the cytoplasm by means 
of linkages with the cytoskeleton and with cytosolic regulatory mol- 
ecules, such as protein kinases and G proteins. Protein kinases acti- 
vate (or inhibit) their target proteins through phosphorylation, 
whereas G proteins activate (or inhibit) their protein targets through 
physical interaction (see Figure 15.21b). The engagement of an inte- 
grin with its ligand can induce a variety of responses within a cell, 
including changes in cytoplasmic pH or Ca”* concentration, protein 
phosphorylation, and gene expression. These changes, in turn, can 
alter a cell's growth potential, migratory activity, state of differentia- 
tion, or survival. 


7.11 Tight Junctions: Sealing 
the Extracellular Space 


A simple epithelium, like the lining of the intestine or lungs, is com- 
posed of a layer of cells that adhere tightly to one another to form a 
cellular sheet. Biologists have known for decades that when certain 
types of epithelia, such as frog skin or the wall of the urinary bladder, 
are mounted between two compartments containing different solute 
concentrations, very little diffusion of ions or solutes is observed 
across the wall of the epithelium from one compartment to the other. 
Given the impermeability of plasma membranes, it is not surprising 
that solutes cannot diffuse freely through the cells of an epithelial 
layer. But why aren't they able to pass between cells by way of 
the paracellular pathway (as in FIGURE 7.27a)? The reason became 
apparent during the 1960s with the discovery of specialized 
contacts, called tight junctions (or zonulae occludens), between 
neighboring epithelial cells. Tight junctions (TJs) are located at the 
very apical end of the junctional complex between adjacent epithe- 
lial cells (see Figure 7.24). An electron micrograph of a section 
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Membrane proteins 
that form tight junctions 


(b) 


FIGURE 7.27 Tight junctions. (a) Electron micrograph of a section through the apical region of adjoining epithelial cells showing where the plasma 
membranes of the two cells come together at intermittent points within the tight junction. Inset shows the tight junction structure at higher magnification. 
Tight junctions block the diffusion of solutes through the paracellular pathway between cells. (b) A model of a tight junction showing the intermittent 
points of contact between integral proteins from two apposing membranes. Each of these contact sites extends as a paired row of proteins within the 
membranes, forming a barrier that blocks solutes from penetrating the space between the cells. (c) Freeze-fracture replica showing the E face of the 
plasma membrane of one of the cells in a region of a tight junction. The grooves in the E face are left behind after the integral membrane proteins are 
pulled from this half of the membrane. (d) Scanning electron micrograph of the apical surface of an epithelium showing the encircling nature of the 


tight junctions. 


Source: (a) Courtesy of Daniel S. Friend, Harvard Medical School, Courtesy of Hiroyuki Sasaki and Shoichiro Tsukita; (c) From Philippa Claude and Daniel A. 
Goodenough, J. Cell Biol. 58:390, 1973, Fig. 6. Reproduced with permission of the Rockefeller University Press; (d) Courtesy of D. Tarin. 


through a TJ that has been cut to include the plasma membranes of 
the adjacent cells is shown in Figure 7.27a. A higher magnification 
view showing the interaction between the membranes of a TJ is 
shown in the inset of Figure 7.27a. It is evident that the adjoining 
membranes make contact at intermittent points, rather than being 
fused over a large surface area. As indicated in Figure 7.27b, the 
points of cell-cell contact are sites where integral proteins of two 
adjacent membranes meet within the extracellular space. 

Freeze fracture, which allows observation of the internal faces of 
a membrane (Figure 4.15), shows that the plasma membranes of a TJ 
contain interconnected strands (Figure 7.27c) that run mostly paral- 
lel to one another and to the apical surface of the epithelium. The 
strands (or grooves in the opposite face of a fractured membrane) 
correspond to paired rows of aligned integral membrane proteins 


that are illustrated in the inset of Figure 7.27b. The integral proteins 
of TJs form continuous fibrils that completely encircle the cell 
like a gasket and make contact with neighboring cells on all sides 
(Figure 7.27d). As a result, TJs serve as a barrier to the free diffusion 
of water and solutes from the extracellular compartment on one side 
of an epithelial sheet to that on the other side. Tight junctions also 
serve as “fences” that help maintain the polarized character of epi- 
thelial cells (see Figure 4.30). They accomplish this by blocking the 
diffusion of integral proteins between the apical domain of the 
plasma membrane and its lateral and basal domains. Like other sites 
of cell adhesion, tight junctions are also involved in signaling path- 
ways that regulate numerous cellular processes. 

Not all TJs exhibit the same permeability properties. Part of the 
explanation can be seen under the electron microscope: TJs with 


several parallel strands (such as that in Figure 7.27c) tend to form 
better seals than junctions with only one or a couple of strands. 
But there is more to the story than numbers of strands. Some TJs 
are permeable to specific ions or solutes to which other TJs are 
impermeable. Studies over the past decade have shed considerable 
light on the molecular basis of TJ permeability. 

It was originally thought that TJ strands were composed of a 
single protein, occludin. Then, it was found that cultured cells that 
lacked a gene for occludin were still able to form TJ strands of nor- 
mal structure and function. Subsequent studies by Shoichiro Tsukita 
and his colleagues at the University of Kyoto led to the discovery of 
a family of proteins called claudins that form the major structural 
component of TJ strands. The electron micrograph of FIGURE 7.28 
shows that occludin and claudin are present together within the lin- 
ear fibrils of a TJ. At least 24 different claudins have been identified, 
and differences in the distribution of these proteins may explain 
selective differences in TJ permeability. For example, one small 
region of a human kidney tubule—a region known as the thick 
ascending limb (or TAL)—has TJs that are permeable to magne- 
sium (Mg”*) ions. It is thought that the loops of the claudin mole- 
cules that extend into the extracellular space form pores in the TAL 
that are selectively permeable to Mg”* ions. Support for this concept 
has come from research on one specific member of the claudin fam- 
ily, claudin-16, which is expressed primarily in the TAL. The impor- 
tance of claudin-16 in kidney function was revealed in studies of 
patients suffering from a rare disease characterized by abnormally 
low Mg”* levels in their blood. These patients were found to have 
mutations in both copies of their claudin-16 gene. The Mg”* level in 
their blood is low because tight junctions containing the abnormal 
claudin are impermeable to Mg’*. As a result, this important ion 
fails to be reabsorbed from the tubule and is simply excreted in 
the urine. 

Another important function of tight junctions came to light in 
2002. It was thought for decades that the impermeability of mam- 
malian skin to water was solely a property of the outer, cornified 
layer of the skin (see Figure 7.1), which contains tightly packed pro- 
tein filaments and associated lipids. It was discovered, however, that 
mice lacking a gene for claudin-1 died shortly after birth as a result 
of dehydration. Further investigation revealed that the cells in one of 
the outer layers of normal epidermis are connected to one another by 
tight junctions. Animals lacking the gene for claudin-1 were unable 


FIGURE 7.28 The molecular composition of tight junction strands. 
Electron micrograph of a freeze-fracture replica of cells that had been 
joined to one another by tight junctions. The fracture faces were incubated 
with two types of gold-labeled antibodies. The smaller gold particles 
(arrowheads) reveal the presence of claudin molecules, whereas the larger 
gold particles (arrows) indicate the presence of occludin. These experi- 
ments demonstrate that both proteins are present in the same tight 
junction strands. Bar equals 0.15um. The inset shows a possible arrange- 
ment of the two integral membrane proteins as they make contact in the 
intercellular space. Both the claudins (red) and occludin (brown) span the 
membrane four times. 

Source: Micrograph from Mikio Furuse, Hiroyuki Sasaki, Kazushi Fujimoto, 
and Shoichiro Tsukita, J. Cell Biology 143:398, 1998, Fig. 6; reproduced 
with permission of the Rockefeller University Press. 


to assemble watertight epidermal tight junctions and, as a result, suf- 
fered from uncontrolled water loss. 

Tight junctions are also present between the endothelial cells 
that line the walls of capillaries. These junctions are particularly evi- 
dent in the brain where they help form the blood-brain barrier, 
which prevents substances from passing from the bloodstream into 
the brain. Although small ions and even water molecules may not be 
able to penetrate the blood-brain barrier, cells of the immune sys- 
tem are able to pass across the endothelium through these junctions. 
These cells are thought to send a signal that opens the junction, 
allowing the cells to pass. While protecting the brain from unwanted 
solutes, the blood-brain barrier also prevents access of many drugs 
to the central nervous system. Consequently, a major goal of the 
pharmaceutical industry is to develop drugs that temporarily open 
the tight junctions of the brain to allow passage of therapeutic 
compounds. 


REVIEW | 


1 What does freeze-fracture analysis tell you about the 
structure of a junction that can’t be learned from 
examination of stained tissue sections? 


2 How does the structure of a tight junction contribute 
to its function? 


e—____ 


7.12 Gap Junctions and 
Plasmodesmata: Mediating 
Intercellular Communication 


Gap junctions and plasmodesmata are specialized sites of communi- 
cation between adjoining cells in animals and plants, respectively. 
The plasma membranes of a gap junction contain channels that con- 
nect the cytoplasm of one cell with the cytoplasm of the adjoining 
cell. The passage of ionic currents through gap junctions plays a piv- 
otal role in numerous physiologic processes. Plasmodesmata are 
cytoplasmic channels between adjacent plant cells that allow the free 
passage of solute molecules. 
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FIGURE 7.29 Gap junctions. (a) Electron micrograph of a section through a gap junction perpendicular to the plane of the two adjacent membranes. The 
“pipelines” between the two cells are seen as electron-dense beads on the apposed plasma membranes. (b) Schematic model of a gap junction showing 
the arrangement of six connexin subunits to form a connexon, which contains half of the channel that connects the cytoplasm of the two adjoining cells. 
Each connexin subunit is an integral protein with four transmembrane domains. (c) High-resolution images derived from atomic force microscopy of the 
extracellular surface of a single connexon in the open (left) and closed (right) conformations. Closure of the connexon was induced by exposure to elevated 
Ca?* ion concentration. (d) Freeze-fracture replica of a gap junction plaque showing the large numbers of connexons and their high concentration. (The 


crystal structure of a gap junction can be found in Nature 458: 597, 2009.) 


Source: (a) From Camillo Peracchia and Angela F. Dulhunty, J. Cell Biol. 70:419, 1976, Fig. 5. Reproduced with permission of the Rockefeller University Press; 
(c) Courtesy Gina E. Sosinsky. From J. Cell Science 116:4479, 2003; by permission of The Company of Biologists, Ltd. http://jcs.biologists.org/content/ 
116/22/4479.full?sid=43a0380-6c77-4b57-ad32-7d6e8884a69%e; (d) Don W. Fawcett / Science Source Images. 


Gap Junctions 


Gap junctions are sites between animal cells that are specialized 
for intercellular communication. Electron micrographs reveal gap 
junctions to be sites where the plasma membranes of adjacent cells 
come very close to one another (within about 3 nm) but do not make 


direct contact. Instead, the cleft between the cells is spanned by very 
fine strands (FIGURE 7.29a) that are actually molecular “pipelines” 
that pass through the adjoining plasma membranes and open into 
the cytoplasm of the adjoining cells (Figure 7.29b). 

Gap junctions have a simple molecular composition; they are 
composed entirely of an integral membrane protein called connexin. 


Connexins are organized into multisubunit complexes, called con- 
nexons, that completely span the membrane (Figure 7.29b). Each 
connexon is composed of six connexin subunits arranged in a ring 
around a central opening, or annulus, that is approximately 1.5 nm 
in diameter at its extracellular surface (Figure 7.29c, left). 

During the formation of gap junctions, the connexons in the 
plasma membranes of apposing cells become tightly linked to one 
another through extensive noncovalent interactions of the extracel- 
lular domains of the connexin subunits. Once aligned, connexons in 
apposing plasma membranes form complete intercellular channels 
connecting the cytoplasm of one cell with the cytoplasm of its neigh- 
bor (Figure 7.29b). Large numbers of connexons become clustered 
in specific regions of the membrane, forming gap-junction plaques 
that can be visualized when the membrane is split down the middle 
by freeze fracture (See Figure 7.29d and the Experimental Walkthrough 
Video: Freeze fracture electron microscopy for this chapter). 

As discussed in the Experimental Pathways (Section 7.13), gap 
junctions are sites of communication between the cytoplasms of 
adjacent cells. The existence of gap-junction intercellular communi- 
cation (GJIC) is revealed through the passage of either ionic currents 
or low-molecular weight dyes, such as fluorescein, from one cell to 
its neighbors (FIGURE 7.30). Mammalian gap junctions allow the dif- 
fusion of molecules having a molecular mass below approximately 
1000 daltons. In contrast to the highly selective ion channels that 


?As discussed in the Experimental Pathways, gap junctions also occur between the 
presynaptic and postsynaptic membranes of adjacent nerve cells in certain parts of the 
brain, allowing nerve impulses to be transmitted directly from one neuron to another 
without requiring the release of chemical transmitters. 


FIGURE 7.30 Results of an experiment demonstrating the passage of 
low-molecular-weight solutes through gap junctions. Micrograph 
showing the passage of fluorescein from one cell into which it was injected 
(X) to the surrounding cells. 

Source: From R. Azarnia and W. R. Loewenstein, J. Memb. Biol. 6:378, 
1971. Reproduced with permission from Springer Science and Business 
Media. 


connect a cell to the external medium (Section 4.11), gap junction 249 


channels are relatively nonselective. Just as ion channels can be open 
or closed, gap-junction channels are also gated. Channel closure can 
be triggered by a number of stimuli, including phosphorylation of 
connexin subunits, and changes in voltage across the junction 
(Figure 7.29c, right). 

We saw in Chapter 4 how skeletal muscle cells are stimulated by 
chemicals released from the tips of nearby nerve cells. Stimulation of 
a cardiac or smooth muscle cell occurs by a very different process, 
one involving gap junctions. The contraction of the mammalian 
heart is stimulated by an electrical impulse generated in a small 
region of specialized heart muscle called the sinoatrial node, which 
acts as the heart’s pacemaker. The impulse spreads rapidly as a cur- 
rent of ions flows through gap junctions from one cardiac muscle 
cell to its neighbors, causing the cells to contract in synchrony. 
Similarly, the flow of ions through gap junctions that interconnect 
smooth muscle cells in the wall of the esophagus or intestine leads to 
the generation of coordinated peristaltic waves that move down the 
length of the wall.” 

Gap junctions can put a large number of cells of a tissue into 
intimate cytoplasmic contact. This has important physiologic 
consequences because a number of highly active regulatory sub- 
stances, such as cyclic AMP and inositol phosphates (Chapter 15), 
are small enough to fit through gap-junction channels. As a result, 
gap junctions have the potential to integrate the activities of indi- 
vidual cells of a tissue into a functional unit. If, for example, only a 
few cells near a particular blood vessel happen to be stimulated by a 
hormone, the stimulus can be rapidly transmitted to all cells of the 
tissue. Gap junctions also allow cells to cooperate metabolically by 
sharing key metabolites, such as ATP, sugar phosphates, amino acids, 
and many coenzymes, which are small enough to pass through these 
intercellular channels. This is particularly important in tissues such 
as the lens, which are avascular (i.e., lack blood vessels). 

Connexins (Cx), the proteins of which gap junctions are con- 
structed, are members of a multigene family. Approximately 20 dif- 
ferent connexins with distinct tissue-specific distributions have been 
identified. Connexons composed of different connexins exhibit 
marked differences in conductance, permeability, and regulation. In 
some cases, connexons in neighboring cells that are composed of 
different connexins are able to dock and form functional channels, 
whereas in other cases, they are not. These compatibility differences 
may play important roles in either promoting or preventing com- 
munication between different types of cells in an organ. For exam- 
ple, connexons joining cardiac muscle cells are composed of the 
connexin Cx43, whereas connexons joining the cells that make up 
the heart’s electrical conduction system are composed of Cx40. 
Because these two connexins form incompatible connexons, the two 
types of cells are electrically insulated from one another even though 
they are in physical contact. A number of inherited disorders have 
been associated with mutations in genes encoding connexins. 
Consequences of these disorders include deafness, blindness, car- 
diac arrhythmias, skin abnormalities, or nerve degeneration. 

Over the past few years a new type of communication system 
has been discovered that consists of thin, highly elongated tubules 
capable of conducting cell-surface proteins, cytoplasmic vesicles, 
organelles, and calcium signals from one cell to another over long 
distances. They can also transmit infectious viral particles and prions 
between cells. To date, these tunneling nanotubes, as they are called, 
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c 7.31 Tunneling nanotubes. Scanning electron micrograph 
showing two cultured neuroendocrine cells connected to one another by a 
thin tubular process capable of carrying materials between the cytoplasm 
of the neighboring cells. These processes, which are only about 100 nm in 
diameter, are supported by an internal actin “skeleton.” The inset shows a 
number of fluorescently labeled vesicles caught in the act of movement 
between two cells. 

Source: From Amin Rustom, et al., Science 303: 1007, 2004; © 2004, reprinted 
with permission from AAAS. Image courtesy of Hans-Hermann Gerdes. 


have been studied primarily in cultured cells (as in FIGURE 7.31), but 
increasing evidence suggests that they may play a role in diverse 
developmental processes in a variety of model organisms. How tun- 
neling nanotubules are formed and regulated, however, is currently 
not well understood. 


Plasmodesmata 


Unlike animals, whose cells make intimate contact with one another, 
plant cells are separated from one another by a substantial barrier— 
the cell wall. It is not surprising, therefore, that plants lack the 
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'.32 Plasmodesmata. (a) Electron micrograph of a section through a plasmodesma of a fern 
gametophyte. The desmotubule is seen to consist of membrane that is continuous with the endoplasmic 
reticulum (ER) of the cytoplasm on both sides of the plasma membrane. (b) Schematic drawing of a 
plasmodesma. The black arrows indicate the pathways taken by molecules as they pass through the annulus 
from cell to cell. (c) An example of the movement of a protein from one cell to another within a plant root. 
The smaller inset shows the localization of the fluorescently labeled messenger RNA molecules (green) that 
encode a protein called Shr. The mRNA is localized within the cells of the stele (Ste), which is thus the tissue 
in which this protein is synthesized. The larger photo shows the localization of the fluorescently labeled Shr 
protein (also green), which is present both within the stele cells where it is synthesized and the adjoining 
endodermal cells (End) into which it has passed by way of the connecting plasmodesmata. The transported 
protein is localized within the nuclei of the endodermal cells where it acts as a transcription factor. Bars: 
50mm and 25pm (inset). 

Source: (a) From Lewis G. Tilney, Todd J. Cooke, Patricia S. Connelly, and Mary S. Tilney, J. Cell Biol. 
112:740, 1991, Fig. 1a; Reproduced with permission of the Rockefeller University Press; (b) From Keiji 
Nakajima et al., Nature 413:308, 2001; © 2001, reprinted by permission from Macmillan Publishers, Ltd. 
Image courtesy of Philip N. Benfey. 
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cell-adhesion molecules we have been discussing in this chapter. 
Although plants lack the specialized junctions found in animal 
tissues, most plant cells are connected to one another by plasmodes- 
mata. Plasmodesmata (singular, plasmodesma) are cytoplasmic 
channels that pass through the cell walls of adjacent cells. A simple (i.e., 
unbranched) plasmodesmais shown in Figure 7.32a,b. Plasmodesmata 
are lined by plasma membrane and usually contain a dense central 
structure, the desmotubule, derived from the smooth endoplasmic 
reticulum of the two cells. Like gap junctions between animal cells, 
plasmodesmata serve as sites of cell-to-cell communication, as sub- 
stances pass through the annulus surrounding the desmotubule. 

It was thought for many years that plasmodesmata were imper- 
meable to molecules greater than about 1000 daltons (1 kDa). This 
conclusion was based on studies in which different-sized fluorescent 
dyes were injected into cells. More recent studies suggest that plas- 
modesmata allow much larger molecules (up to 50 kDa) to pass 
between cells, owing to the fact that the plasmodesmatal pore is 
capable of dilation. The first insight into this dynamic property was 
gained from studies in the 1980s on plant viruses that spread from 
one cell to another through plasmodesmata. It was found that the 
viruses encoded a movement protein that interacted with the wall of 
the plasmodesmata and increased the diameter of the pore. 


Subsequent studies revealed that plant cells produce their own 251 


movement proteins that regulate the flow of proteins and RNAs 
from cell to cell. Some of these macromolecules find their way into 
the vascular system of the plant, where they integrate plant-wide 
activities, such as the growth of new leaves and flowers or defense 
against pathogens. FIGURE 7.32¢ documents the movement of a pro- 
tein (labeled with green fluorescence) from one type of plant tissue 
(the stele), where it was synthesized, to an adjoining tissue (the 
endodermis). The protein is seen to be concentrated in the spherical 
nuclei of the single layer of endodermal cells where it acts to stimu- 
late gene transcription. 


REVIEW | 


1. Compare the arrangement of integral membrane 
proteins in a tight junction versus a gap junction. 


2. How are plasmodesmata and gap junctions similar? 
How are they dissimilar? Would you expect a 
moderately sized protein to pass through a 
plasmodesma? 


7.13 EXPERIMENTAL PATHWAYS 


The Role of Gap Junctions in Intercellular Communication 


Based on the information presented in Chapter 4, you might pre- 
sume that synaptic transmission always occurs by movement of 
neurotransmitter molecules from the presynaptic neuron to the 
postsynaptic cell. This was the prevailing view until the 1950s, when 
Edwin Furshpan and David Potter of University College in London 
found a notable exception. Furshpan and Potter were studying 
synaptic transmission between giant neurons in the nerve cord 
of the crayfish. They noted that a small, subthreshold depolariza- 
tion induced in the presynaptic nerve cell produced a very rapid 
(0.1msec) depolarization in the postsynaptic cell.'? If the nerve 
cells had been connected by a chemical synapse, a subthreshold 
change in membrane potential should not have been propagated 
to the postsynaptic cell, because it would not be sufficient to 
stimulate the release of neurotransmitter molecules. Even if neu- 
rotransmitter molecules were released, they could not possibly 
induce such a rapid change in the postsynaptic cell. Furshpan and 
Potter concluded that the two nerve cells were connected by a 
different type of synapse, an electrotonic synapse, in which ionic 
currents in the presynaptic cell could flow directly into the post- 
synaptic cell on the other side of the synapse. It was presumed 
that this type of cell-cell connection, which allows for the flow of 
ions between cells, was peculiar to excitable cells, such as neu- 
rons, which are specialized for cell-cell communication. 

During the early 1960s, Yoshinobu Kanno and Werner 
Loewenstein of Columbia University were studying the perme- 
ability properties of the nuclear envelope, the membranous com- 
plex that bounds the nucleus. To determine whether ions were 
capable of flowing across the nuclear envelope, they had turned 
to the very large cells that make up epithelial tissues of the larval 
fruit fly (cells that contain the giant chromosomes that had proved 


so useful to geneticists). These cells were large enough to allow 
penetration of microelectrodes capable of inducing and recording 
ionic currents (FIGURE 1). To their surprise, Kanno and Loewenstein 
found that when ions were injected into the nucleus of one cell, 
not only did the ion flux (measured as an electrical current) spread 
into the cytoplasm of that cell, but it flowed directly into the cyto- 
plasm of an adjacent cell. In fact, the potential recorded in the 
adjacent cell was almost as great as that in the cell in which the 
current was originally induced. Kanno and Loewenstein con- 
cluded that the epithelial cells that make up the salivary gland are 
electrically coupled to one another, meaning that ions are able to 
flow freely from cell to cell through low-resistance cell junctions.° 
If small inorganic ions could pass through these junctions between 
neighboring cells, what about larger substances? When a small 


FIGURE 1. Micrograph showing recording microelectrodes being 
inserted into living cells of the Malpighian tubule of an insect cell. 


Source: Courtesy of Werner R. Loewenstein. 
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volume of the fluorescent ion fluorescein (molecular weight of 
376 daltons) was injected into the cytoplasm of one cell with a 
micropipette, the fluorescence rapidly spread into adjacent cells 
until the entire epithelial layer glowed from the presence of the 
tracer (as in Figure 7.31). In contrast, none of the fluorescent dye 
leaked out of the cells into the external medium, indicating that 
fluorescein molecules were diffusing directly from the cytoplasm 
of one cell into the cytoplasm of adjacent cells by means of per- 
meable cell-cell contacts.* Similar observations were soon made 
on a variety of different types of epithelial and mesenchymal cells, 
including those of various mammals, indicating that these com- 
municating junctions are widespread. 

Electron microscopic studies had shown that animal cells 
are bounded by a continuous plasma membrane. This new 
data suggested that the structure of this membrane was some- 
times modified at sites where cells made contact with other 
cells. Otherwise, it would be impossible for substances to move 
directly from the cytoplasm of one cell into another. The discovery 
of cell junctions containing channels between closely applied 
cells was made in 1967 by Jean Paul Revel and M. J. Karnovsky.° 
Electron micrographs of these junctions showed a distinct cleft 
between adjoining cells, which led the researchers to name them 
“gap junctions” to distinguish them from tight junctions, where 
adjoining cells make direct contact. 

Studies were carried out to learn more about the size 
of the channels connecting the cytoplasms of adjacent cells. 
Loewenstein's laboratory tested fluorescent probes that were 
linked to peptides of varying size. They found that molecules up 
to about 1200 daltons were able to diffuse between insect larval 
salivary gland cells.° Based on estimates of the dimensions of 
these molecules, they concluded that the effective diameter of the 
channel was approximately 10 to 15 A (1.0 to 1.5 nm), a value that 


(a) 


(b) 


matched closely with that estimated from high-resolution micro- 
graphs of gap junctions taken with the electron microscope.’ 
One of the first questions considered by Kanno and 
Loewenstein following their discovery of permeable intercellular 
contacts was whether these same types of contacts were present 
in cancer cells. The growth rates of normal cells were known to 
be influenced by stimuli from their environment. It was possible 
that one of the factors that allowed cancer cells to escape from 
the types of growth control mechanisms that prevail in normal 
cells was a loss in their ability to receive regulatory molecules 
from neighboring cells. Kanno and Loewenstein investigated this 
possibility by measuring the flow of ionic current in normal liver 
tissue as compared to that in a variety of liver tumors. Whereas 
ionic current readily flowed between normal cells of a rat liver, 
no passage of current could be detected between the cells of 
any of the liver tumors that were investigated.® Since these initial 
studies, hundreds of different types of cancer cells have been 
analyzed for their ability to carry out gap-junction intercellular 
communication (GJIC). The initial results of Loewenstein and 
Kanno have been found to hold true for most, but not all, cancer 
cells that have been investigated.”"° It is not surprising that not 
all cancer cells exhibit the same properties with regard to GJIC. 
The conversion of a normal cell into a malignant cell is a mul- 
tistage phenomenon that can occur as the result of changes in 
a wide variety of different genes (Chapter 16). There are differ- 
ent mechanisms by which a cell can lose growth control. Some of 
these mechanisms do seem to involve the loss of the cells’ ability 
to transmit signals across gap junctions. Among those tumors 
where this does occur, there is often a progressive loss in GJIC as 
the cells become more and more malignant." In addition, there 
may be a correlation between the loss of GJIC and an increase 
in the metastatic potential of a population of cells.'* Metastatic 
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FIGURE 2. Calcium waves induced by mechanical stimulation in (a) a control culture of rat C6 glioma cells and (b) a clone of the same cells shown 

in a, but transfected with connexin43 DNA. The injected DNA is transcribed and translated in the transfected cells, and the connexin protein molecules 
are incorporated into the plasma membrane. When one of the nontransfected cells is mechanically stimulated, there is very little passage of Ca** into 
its neighbors. In contrast, when cells that express the connexin gene are mechanically stimulated, a wave of Ca** passes from cell to cell through the 


gap junctions formed by the connexin43 protein. 


Source: From Andrew C. Charles et al., J. Cell Biol. 118:197, 1992; by copyright permission of the Rockefeller University Press. 


(a) 


FIGURE 3 (a) When a small mass of C6 glioma cells is implanted into the brain of a rat, the cells develop into a large tumor mass after two weeks 
growth. (b) The same cells that have been transfected with connexin43 DNA and are presumed to engage in gap junction intercellular communication 


have developed into a much smaller tumor during the same time period. 
Source: From Christian C. G. Naus et al., Cancer Res. 52:4210, 1992. 


potential is a property of cancer cells that enables them to ignore 
adjacent cells and move off on their own. It would be expected 
that such behavior could occur only if a cell severed its communi- 
cation links with neighboring cells. 

Evidence obtained by correlations between one condition 
(e.g., loss of GJIC) and another condition (e.g., malignancy) is 
circumstantial. The best evidence of a direct, causal relationship 
between the two conditions has been obtained by Christian Naus 
and colleagues at the University of Western Ontario in studies 
in which cancer cells are forced to express gap junction proteins 
(connexins). When Cé glioma cells from a rat brain tumor are 
transfected with DNA that encodes the protein connexin43, there 
is a dramatic increase in GJIC between the tumor cells (FIGURE 2) 
and a corresponding decrease in the growth rate of the cells.'*"4 
Similarly, when Cé glioma cells that have been transfected with 
connexin43-coding DNA are implanted into the brains of adult 
rats, they generate much smaller tumors than the nontransfected 
glioma cells (FIGURE 3).'° Similar findings have been reported on 
human cancer cells.'©!7'8 For example, when human mammary 
carcinoma cells are transfected with genes encoding connexins 
that are expressed in normal mammary cells, the capability of the 
cells to form tumors in susceptible mice is greatly reduced. One 
of the transfected cell lines exhibited suppressed growth for sev- 
eral weeks and then displayed a sudden increase in malignancy. 
When analyzed, these cells no longer expressed the connexin 
protein, suggesting that the transfected gene had been lost or 
inactivated during tumor growth. '7 
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Because a lipid-protein plasma membrane of less than 10-nm thick- 
ness can be expected to offer only minimal protection for a cell’s con- 
tents, it is not surprising that “naked” cells are extremely fragile 
structures. The cells of nearly all organisms other than animals are 
enclosed in a protective outer envelope. Protozoa have a thickened 
outer coat, whereas bacteria, fungi, and plants have distinct cell walls. 
We will restrict our discussion to the cell walls of plants, which were the 
first cellular structures to be observed with a light microscope (page 2). 
Plant cell walls perform numerous vital functions. As discussed 
in Figure 4.36, plant cells develop turgor pressure that pushes against 
their surrounding wall. As a result, the wall gives the enclosed cell 
its characteristic polyhedral shape (FIGURE 7.33a). In addition to 
providing support for individual cells, cell walls serve collectively as 
a type of “skeleton” for the entire plant. In fact, a tree without cell 
walls might resemble, in many respects, a human without bones. 
Cell walls also protect the cell against damage from mechanical 
abrasion and pathogens, and they mediate cell-cell interactions. 
Like the ECM at the surface of an animal cell, a plant cell wall is a 
source of signals that alter the activities of the cells that it contacts. 
Plant cell walls are often likened to fabricated materials such as 
reinforced concrete or fiberglass because they contain a fibrous 
element embedded in a nonfibrous, gel-like matrix. Cellulose, whose 


structure was described in Figure 2.17, provides the fibrous compo- 
nent of the cell wall, and proteins and pectin (described below) pro- 
vide the matrix. Cellulose molecules are organized into rod-like 
microfibrils (Figure 7.33b,c) that confer rigidity on the cell wall and 
provide resistance to tensile (pulling) forces. Each microfibril is 
about 5 nm in diameter and is typically composed of bundles of 36 
cellulose molecules oriented parallel to one another and held 
together by hydrogen bonds. The walls of many plant cells are com- 
posed of layers in which the microfibrils of one layer are oriented at 
approximately 90° to those of adjacent layers (Figure 7.33). 

Cellulose molecules are polymerized at the cell surface. Glucose 
subunits are added to the end of a growing cellulose molecule by a 
multisubunit enzyme called cellulose synthase. The subunits of the 
enzyme are organized into a six-membered ring, or rosette, which is 
embedded within the plasma membrane (Figure 7.34a,b). In con- 
trast, materials of the matrix are synthesized within the cytoplasm 
(FIGURE 7.34c) and carried to the cell surface in secretory vesicles. 
The matrix is highly complex, requiring hundreds of enzymes for its 
synthesis and degradation. The matrix of the cell wall is composed of 
three types of macromolecules (Figure 7.34c): 


1. Hemicelluloses are branched polysaccharides whose backbone 
consists of one sugar, such as glucose, and side chains of other 
sugars, such as xylose. Hemicellulose molecules bind to the 


Hemicellulose 


Cellulose 
microfibril 


(c) 


FIGURE 7.33 The plant cell wall. (a) Electron micrograph of a plant 
cell surrounded by its cell wall. The middle lamella is a pectin-containing 
layer situated between adjacent cell walls. (b) Electron micrograph 
showing the cellulose microfibrils and hemicellulose cross-links of an 
onion cell wall after extraction of the nonfibrous pectin polymers. (c) 
Schematic diagram of a model of a generalized plant cell wall. 
Source: (a) Omikron/ Photo Researchers, Inc.; (b) From Maureen 
Mccann, B. Wells, and K. Roberts, J. Cell Sci. 96:329, 1990; 
Reproduced by permission of The Company of Biologists, Ltd. 
http://jcs. biologists.org/content/96/2/323. full.pdf+html?sid= 
f1c1dOcf-55bf-4050-b9bb-af46eae08bOf 
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(b) Microtubule 


surfaces of cellulose microfibrils, cross-linking them into a resil- 
ient structural network. 


2. Pectins are a heterogeneous class of negatively charged polysac- 
charides containing galacturonic acid. Like the glycosaminogly- 
cans of animal cell matrices, pectins hold water and thus form an 
extensive hydrated gel that fills in the spaces between the fibrous 
elements. When a plant is attacked by pathogens, fragments of 
pectins released from the wall trigger a defensive response by the 
plant cell. Purified pectin is used commercially to provide the 
gel-like consistency of jams and jellies. 


3. Proteins, whose functions are not well understood, mediate 
dynamic activities. One class, the expansins, facilitate cell growth. 
These proteins cause localized relaxation of the cell wall, which 
allows the cell to elongate at that site in response to the turgor 
pressure generated within the cell. Cell wall-associated protein 
kinases span the plasma membrane and are thought to transmit 
signals from the cell wall to the cytoplasm. 


The percentages of these various materials in cell walls are 
highly variable, depending on the type of plant, the type of cell, and 
the stage of the wall. Like the extracellular matrices of animal tissues, 
plant cell walls are dynamic structures that can be modified in 
response to changing environmental conditions. 

Cell walls arise as a thin cell plate that forms between the 
plasma membranes of newly formed daughter cells following cell 
division. The cell wall matures by the incorporation of additional 
materials that are assembled inside the cell and secreted into the 
extracellular space. In addition to providing mechanical support 
and protection from foreign agents, the cell wall of a young, 


FIGURE 7.34 Synthesis of plant cell wall macromolecules. (a) Freeze- 
fracture replica of the plasma membrane of an algal cell. The rosettes are 
thought to represent the cellulose-synthesizing enzyme (cellulose synthase) 
situated within the plasma membrane. (b) A model of cellulose fibril 
deposition. Each rosette is thought to form a single microfibril that 
associates laterally with the microfibrils from other rosettes to form a larger 
fiber. The entire array of rosettes might move laterally within the membrane | 
as it is pushed by the elongating cellulose molecules. Studies suggest that 
the direction of movement of the membrane rosettes is determined by 
oriented microtubules present in the cortical cytoplasm beneath the plasma | 
membrane (discussed in Chapter 9). (c) Electron micrograph of a Golgi 
complex of a peripheral root cap cell stained with antibodies against a | 
polymer of galacturonic acid, one of the major components of pectin. This 
material, like hemicellulose, is assembled in the Golgi complex. The 
antibodies have been linked to gold particles to make them visible as dark 
granules. The bar represents 0.25mm. | 


Source: (a) From T. H. Giddings, Jr., D. L. Brower, and L. A. Staehelin, J. Cell 
Biol. 84:332, 1980, Fig. 6. reproduced with permission of The Rockefeller 
University Press; (c) From Margaret Lynch and L. A. Staehelin, J. Cell Biol. 
118:477, 1992, Fig. 9. reproduced with permission of The Rockefeller 
University Press. 


undifferentiated plant cell must be able to grow in conjunction with | 
the enormous growth of the cell it surrounds. The walls of growing | 
cells are called primary walls, and they possess an extensibility that 
is lacking in the thicker secondary walls present around many | 
mature plant cells. The transformation from primary to secondary | 
cell wall occurs as the wall increases in cellulose content and, in most | 
cases, incorporates a phenol-containing polymer called lignin. | 
Lignin provides structural support and is the major component of | 
wood. The lignin in the walls of water-conducting cells of the xylem | 
provides the support required to move water through the plant. 


REVIEW | 


1. Describe the components that make up a plant cell 
wall and the role of each in the wall’s structure and 
function. 


2. Distinguish between cellulose and hemicellulose, a 
cellulose molecule and a microfibril, a primary cell wall 
and a secondary cell wall. 
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. Cell adhesion can often be blocked in vitro by treating cells with 


specific agents. Which of the following substances would be 
expected to interfere with cell adhesion mediated by selectins? 
with cell adhesion mediated by L1 molecules? The substances are 
trypsin, which digests proteins; a peptide containing RGD; neu- 
raminidase, which removes sialic acid from an oligosaccharide; 
collagenase, which digests collagen; hyaluronidase, which digests 
hyaluronic acid; EGTA, which binds Ca** ions in the medium. 


. What substance might you add to a culture dish to block the 


migration of neural crest cells? to block the adhesion of fibro- 
blasts to the substratum? 


. Mice lacking a gene for fibronectin do not survive early embry- 


onic development. Name two processes that might be disrupted 
in such embryos. 


. Suppose you found that molecule A, which had a molecular 


mass of 1500 daltons, was able to penetrate the channels of a 
gap junction, but molecule B, whose molecular mass was only 
1200 daltons, was unable to diffuse between the same cells. 
How might these molecules be different so as to explain these 
results? 


. In what ways are the extracellular matrices of animals and the cell 


walls of plants similar in construction? 


. [twas noted that two different autoimmune diseases, one produc- 


ing antibodies against a component of hemidesmosomes and the 
other producing antibodies against a component of desmosomes, 
both cause severe blistering of the skin. Why do you think these 
two conditions have such similar symptoms? 


. Which of the various types of molecules that mediate cell adhe- 


sion is most likely to be responsible for the type of sorting out 
demonstrated by the cells of Figure 7.20? Why? How could you 
test your conclusion? 


10. 


Ta. 


12. 


13. 


14. 


Follicle-stimulating hormone (FSH) is a pituitary hormone that 
acts on the follicle cells of the ovary to trigger the synthesis of 
cyclic AMP, which stimulates various metabolic changes. FSH 
normally has no effect on cardiac muscle cells. However, when 
ovarian follicle cells and cardiac muscle cells are grown together 
in a mixed culture, a number of the cardiac muscle cells are seen 
to contract following addition of FSH to the medium. How might 
this observation be explained? 


. Why do you think animal cells are able to survive without the 


types of cell walls that are found in nearly every other group of 
organisms? 

Why would lowering the temperature of the medium in which 
cells were growing be expected to affect the ability of the cells 
to form gap junctions with one another? 


Certain of the cell junctions occur as encircling belts, while oth- 
ers occur as discrete patches. How do these two types of struc- 
tural arrangements correlate with the functions of the respective 
junctions? 


Propose some mechanism that might explain how tobacco 
mosaic virus was able to alter the permeability of a plasmodesma. 
How might you be able to test your proposal? 


One type of vertebrate cell that is thought to lack integrins is 
the erythrocyte (red blood cell). Does this surprise you? Why or 
why not?. 


Focal adhesions can withstand forces much greater than the 
adhesive force of a single integrin molecule. Why? Suppose a 
focal adhesion can withstand a force of 5 nN/um?. Given that a 
single integrin-ligand bond can withstand a force up to 50 pN, 
estimate how many integrin molecules are present in a single 
focal adhesion with a surface area of 1 ttm*? (See Quantitative 
Tutorial Video). 
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Cytoplasmic Membrane 
Systems: Structure, Function, 
and Membrane Trafficking 


HIJACKING THE CELL 


Saining access to an animal cell poses a formidable 
challenge to would-be invaders. Viruses, however, have 
evolved a multitude of ways to overcome cellular 
obstacles in order to execute their viral replication 
programs. Fundamental to the life cycle of all animal 
viruses is the hijacking of cellular membrane systems, 
including the plasma membrane, endosomal vesicles, 
endoplasmic reticulum (ER), and the Golgi complex. 


The first challenge that a virus faces is how to enter the 
cell. Some enveloped viruses, such as the herpes virus, 
fuse directly with the plasma membrane, depositing the 
genome-laden capsid inside the cell. A larger proportion 
of viruses, however, hijack endocytic pathways to enter 
the cell, thereby leaving few traces on the cell surface 
that roving immune cells may take note of. In these 
cases, viruses are internalized into vesicles known as 


; ; Human immunodeficiency virus (HIV) on a human lymphocyte. This 
endosomes and later release their genome into the cell. pseudocolored scanning EM image shows HIV particles (green) on the surface of 


a lymphocyte (pink). Many viruses, including HIV, must hijack the cells’ 
membrane systems in order to enter, propagate, and exit their host cells. 
Source: CDC/ C. Goldsmith, P. Feorino, E. L. Palmer, W. R. McManus / Center for 
Disease Control and Prevention. 


Another major challenge the virus now faces is how to 
replicate and produce new viable progeny. In recent 
years, researchers have found that cytoplasmic viruses 
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membranes, most often the ER membrane, to create new 
organelle-like structures, referred to as virus factories or 
viroplasm. Inside these virus factories, viral proteins required 
for replication are organized into assembly lines for the 
efficient production of new viral particles. In many ways, 


8.1 An Overview of the 
Endomembrane System 


Under the light microscope, the cytoplasm of living cells appears 
relatively devoid of structure. Yet, even before the beginning of the 
twentieth century, examination of stained sections of animal tissues 
hinted at the existence of an extensive membrane network within the 
cytoplasm. It wasn’t until the development of the electron micro- 
scope in the 1940s, however, that biologists began to appreciate the 
diverse array of membrane-bound structures present in the cyto- 
plasm of most eukaryotic cells. These early electron microscopists 
saw membrane-bound vesicles of varying diameter containing mate- 
rial of different electron density; long channels bounded by mem- 
branes that radiate through the cytoplasm to form an interconnected 
network of canals; and stacks of flattened membrane-bound sacs. 

It became evident from these early electron microscopic studies 
and the biochemical investigations that followed that the cytoplasm 
of eukaryotic cells was subdivided into a variety of distinct compart- 
ments bounded by membrane barriers. As more types of cells were 
examined, it became apparent that these membranous compartments 
in the cytoplasm formed different organelles that could be identified 
in diverse cells from yeast to multicellular plants and animals. The 
extent to which the cytoplasm of a eukaryotic cell is occupied by 
membranous structures is illustrated in the electron micrograph of a 
maize root cell shown in FIGURE 8.1. As we will see in the following 
pages, each of these organelles contains a particular complement of 
proteins and is specialized for particular types of activities. Thus, just 
as a house or restaurant is divided into specialized rooms where dif- 
ferent activities can take place independent of one another, the cyto- 
plasm of a cell is divided into specialized membranous compartments 
for analogous reasons. Keep in mind, as you examine the micro- 
graphs in this chapter, that these cytoplasmic organelles may appear 
as stable structures, like the rooms of a house or restaurant, but in fact 
they are dynamic compartments that are in continual flux. 

In the present chapter, we will examine the structure and functions 
of the endoplasmic reticulum, Golgi complex, endosomes, lysosomes, 
and vacuoles. Taken together, these organelles form an endomem- 
brane system in which the individual components function as part 
of a coordinated unit. (Mitochondria and chloroplasts are not part of 
this interconnected system and were the subjects of Chapters 5 and 6. 
Current evidence suggests that peroxisomes, which were also dis- 
cussed in Chapter 6, have a dual origin. The basic elements of the 
boundary membrane are thought to arise from the endoplasmic retic- 
ulum, but many of the membrane proteins and the soluble internal 
proteins are taken up from the cytosol, as described in Section 8.21.) 

The organelles of the endomembrane system are part of a 
dynamic, integrated network in which materials are shuttled back 
and forth from one part of the cell to another. For the most part, 


these factories operate like a scaled-down version of the 
cellular organelles from which they are derived, but 
designed for the mass production of just a handful of 
molecules, rather than the thousands that the ER and 
Golgi complex are responsible for producing in any 
given cell. 
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FIGURE 8.1 Membrane-bound compartments of the cytoplasm. The 
cytoplasm of this root cap cell of a maize plant contains an array of 
membrane-bound organelles whose structure and function will be 
examined in this chapter. As is evident in this micrograph, the combined 
surface area of the cytoplasmic membranes is many times greater than that 
of the surrounding plasma membrane. 


Source: Courtesy of Hilton H. Mollenhauer. 


materials are shuttled between organelles—from the Golgi complex 
to the plasma membrane, for example—in small, membrane-bounded 
transport vesicles that bud from a donor membrane compartment 
(FIGURE 8.2a).' Transport vesicles move through the cytoplasm in a 
directed manner, often pulled by motor proteins that operate on 
tracks formed by microtubules and microfilaments of the cytoskele- 
ton (see Figure 9.1a). When they reach their destination, the vesicles 
fuse with the membrane of the acceptor compartment, which receives 
the vesicle’s soluble cargo as well as its membranous wrapper (Figure 
8.2a). Repeated cycles of budding and fusion shuttle a diverse array of 
materials along numerous pathways that traverse the cell. 

Several distinct pathways through the cytoplasm have been 
identified and are illustrated in the overview shown in Figure 8.2b. A 
biosynthetic pathway can be discerned in which proteins are syn- 
thesized in the endoplasmic reticulum, modified during passage 
through the Golgi complex, and transported from the Golgi complex 
to various destinations, such as the plasma membrane, a lysosome, 
or the large vacuole of a plant cell. This route is also referred to as the 
secretory pathway, because many of the proteins synthesized in 


‘The term vesicle implies a spherical-shaped carrier. Cargo may also be transported in 
irregular or tubular-shaped membrane-bound carriers. For the sake of simplicity, we 
will generally refer to carriers as “vesicles,” while keeping in mind they are not always 
spherical. 
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FIGURE 8.2 An overview of the biosynthetic/secretory and endocytic 
pathways that unite endomembranes into a dynamic, interconnected 
network. (a) Schematic diagram illustrating the process of vesicle transport 
by which materials are transported from a donor compartment to a 
recipient compartment. Vesicles form by membrane budding, during which 
specific membrane proteins (green spheres) of the donor membrane are 
incorporated into the vesicle membrane and specific soluble proteins 
(purple spheres) in the donor compartment are bound to specific recep- 
tors. When the transport vesicle subsequently fuses to another membrane, 
the proteins of the vesicle membrane become part of the recipient 
membrane, and the soluble proteins become sequestered within the lumen 
of the recipient compartment. (b) Materials follow the biosynthetic (or 
secretory) pathway from the endoplasmic reticulum, through the Golgi 
complex, and out to various locations including lysosomes, endosomes, 
secretory vesicles, secretory granules, vacuoles, and the plasma mem- 
brane. Materials follow the endocytic pathway from the cell surface to the 
interior by way of endosomes and lysosomes, where they are generally 
degraded by lysosomal enzymes. 


the endoplasmic reticulum (as well as complex polysaccharides syn- 
thesized in the Golgi complex, Figure 7.34c) are destined to be dis- 
charged (secreted or exocytosed) from the cell. Secretory 
activities of cells can be divided into two types: constitutive and regu- 
lated (Figure 8.2b). During constitutive secretion, materials are 
transported in secretory vesicles from their sites of synthesis and dis- 
charged into the extracellular space in a continual manner. Most cells 
engage in constitutive secretion, a process that contributes not only 
to the formation of the extracellular matrix (Section 7.2), but to the 
formation of the plasma membrane itself. During regulated secre- 
tion, materials are stored as membrane-bound packages and dis- 
charged only in response to an appropriate stimulus. Regulated 
secretion occurs, for example, in endocrine cells that release hor- 
mones, in pancreatic acinar cells that release digestive enzymes, and 
in nerve cells that release neurotransmitters. In some of these cells, 
materials to be secreted are stored in large, densely packed, mem- 
brane-bound secretory granules (see FIGURE 8.3). Proteins, lipids, 
and complex polysaccharides are transported through the cell along 
the biosynthetic or secretory pathway. We will focus in the first part of 
the chapter on the synthesis and transport of proteins, as summarized 
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in Figure 8.2b. During the discussion, we will consider several dis- 
tinct classes of proteins. These include soluble proteins that are dis- 
charged from the cell, integral proteins of the various membranes 
depicted in Figure 8.2b, and soluble proteins that reside within the 
various compartments enclosed by the endomembranes (e.g., lysoso- 
mal enzymes). Whereas materials move out of the cell by the secre- 
tory pathway, the endocytic pathway operates in the opposite 
direction. By following the endocytic pathway, materials move 
from the outer surface of the cell to compartments, such as endosomes 
and lysosomes, located within the cytoplasm (Figure 8.25). 

The movement of vesicles and their contents along the various 
pathways of a cell is analogous to the movement of trucks carrying 
different types of cargo along the various highways of a city. Both 
types of transport require defined traffic patterns to ensure that 
materials are accurately delivered to the appropriate sites. For exam- 
ple, protein trafficking within a salivary gland cell requires that the 
proteins of salivary mucus, which are synthesized in the endoplas- 
mic reticulum, are specifically targeted to secretory granules, while 
lysosomal enzymes, which are also manufactured in the endoplas- 
mic reticulum, are specifically targeted to a lysosome. Different 


259 


WaSKG SULIQUSLWOPUF 24} JO MAIAISAC UY e |'8 


260 organelles also contain different integral membrane proteins. 


Bupjoyjes, SueIqud\ PUue ‘UO!}OUNY ‘aIN}ONIIS :swa}shs aueIquUa| DIUse|doIAD e 8 YALdVHOD 


Consequently, membrane proteins must also be targeted to particu- 
lar organelles, such as a lysosome or Golgi cisterna. These various 
types of cargo—secreted proteins, lysosomal enzymes, and mem- 
brane proteins—are routed to their appropriate cellular destinations 
by virtue of specific “addresses” or sorting signals that are encoded in 
the amino acid sequence of the proteins or in the attached oligosac- 
charides. The sorting signals are recognized by specific receptors 
that reside in the membranes or surface coats of budding vesicles, 
ensuring that the protein is transported to the appropriate destina- 
tion. For the most part, the machinery responsible for driving this 
complex distribution system consists of soluble proteins that are 
recruited to specific membrane surfaces. During the course of this 
chapter we will try to understand why one protein is recruited, for 
example, to the endoplasmic reticulum whereas another protein 
might be recruited to a particular region of the Golgi complex. 

Great advances have been made over the past three decades in 
mapping the traffic patterns that exist in eukaryotic cells, identifying 
the specific addresses and receptors that govern the flow of traffic, 
and dissecting the machinery that ensures that materials are deliv- 
ered to the appropriate sites in the cell. Motor proteins and cytoskel- 
etal elements, which play key roles in the movements of transport 
vesicles and other endomembranes, will be described in the follow- 
ing chapter. We will begin the study of endomembranes by discuss- 
ing a few of the most important experimental approaches that have 
led to our current understanding of the subject. 


REVIEW | 
1. Compare and contrast the biosynthetic pathway with 
the endocytic pathway. 


2. How are particular proteins targeted to particular 
subcellular compartments? 


8.2 A Few Approaches to the 
Study of Endomembranes 


Early studies with the electron microscope provided biologists with a 
detailed portrait of the structure of cells but gave them little insight 
into the functions of the components they were observing. Determining 
the functions of cytoplasmic organelles required the development of 
new techniques and the execution of innovative experiments. The 
experimental approaches described in the following sections have 
proven particularly useful in providing the foundation of knowledge 
on which current research on cytoplasmic organelles is based. 


Insights Gained from Autoradiography 


Among the many cells in the body, the acinar cells of the pancreas 
have a particularly extensive endomembrane system. These cells 
function primarily in the synthesis and secretion of digestive 
enzymes. After secretion from the pancreas, these enzymes are 
shipped through ducts to the small intestine, where they degrade 
ingested food matter. Where within the pancreatic acinar cells are the 
secretory proteins synthesized, and how do they reach the surface of 
the cells where they are discharged? These questions are inherently 
difficult to answer because all of the steps in the process of secretion 


occur simultaneously within the cell. To follow the steps of a single 
cycle from start to finish, that is, from the synthesis of a secretory 
protein to its discharge from the cell, James Jamieson and George 
Palade of Rockefeller University utilized the technique of autoradi- 
ography (Section 18.8). 

Autoradiography provides a means to visualize biochemical 
processes by allowing an investigator to determine the location of 
radioactively labeled materials within a cell. In this technique, tissue 
sections containing radioactive isotopes are covered with a thin layer 
of photographic emulsion, which is exposed by radiation emanating 
from radioisotopes within the tissue. Sites in the cells containing 
radioactivity are revealed under the microscope by silver grains in 
the overlying emulsion (FIGURE 8.3). 

To determine the sites where secretory proteins are synthesized, 
Palade and Jamieson incubated slices of pancreatic tissue in a solu- 
tion containing radioactive amino acids for a brief period of time. 
During this period, labeled amino acids were taken up by the living 
cells and incorporated into the digestive enzymes as they were being 
synthesized on ribosomes. The tissues were quickly fixed, and the 
locations of proteins that had been synthesized during the brief incu- 
bation with labeled amino acids were determined autoradiographi- 
cally. Using this approach, the endoplasmic reticulum was discovered 
to be the site of synthesis of secretory proteins (Figure 8.3a). 

To determine the intracellular path followed by secretory pro- 
teins from their site of synthesis to their site of discharge, Palade and 
Jamieson carried out an additional experiment. After incubating the 
tissue for a brief period in radioactive amino acids, they washed the 
tissue free of excess isotope and transferred the tissue to a medium 
containing only unlabeled amino acids. An experiment of this type 
is called a “pulse-chase.” The pulse refers to the brief incubation with 
radioactivity during which labeled amino acids are incorporated 
into protein. The chase refers to the period when the tissue is exposed 
to the unlabeled medium, a period during which additional proteins 
are synthesized using nonradioactive amino acids. The longer the 
chase, the farther the radioactive proteins manufactured during the 
pulse will have traveled from their site of synthesis within the cell. 
Using this approach, one can ideally follow the movements of newly 
synthesized molecules by observing a wave of radioactive material 
moving through the cytoplasmic organelles of cells from one loca- 
tion to the next until the process is complete. The results of these 
experiments—which first defined the biosynthetic (or secretory) 
pathway and tied a number of seemingly separate membranous 
compartments into an integrated functional unit—are summarized 
in Figure 8.3b-d. 


Insights Gained from the Use 
of the Green Fluorescent Protein 


The autoradiographic experiments described in the previous section 
require investigators to examine with an electron microscope thin 
sections of different cells that have been fixed at various times after 
introduction of a radioactive label. Techniques involving the use of 
radioactive isotopes have largely been abandoned by modern cell 
biologists in favor of “tagging” proteins of interest using fluorescent 
proteins such as green fluorescent protein (GFP), which allows for 
viewing of protein movement in live cells under a light microscope 
(Section 18.3). 

FIGURE 8.4 shows a pair of micrographs depicting cells that con- 
tain a GFP-tagged protein. In this case, the cells were infected with a 
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strain of the vesicular stomatitis virus (VSV) in which one of the viral 
genes (VSVG) is fused to the GFP gene. Viruses are useful in these 
types of studies because they turn infected cells into factories for the 
production of viral proteins, which are carried like any other protein 
cargo through the biosynthetic pathway. When a cell is infected with 
VSV, massive amounts of the VSVG protein are produced in the 
endoplasmic reticulum (ER). The VSVG molecules then traffic 
through the Golgi complex and are transported to the plasma mem- 
brane of the infected cell where they are incorporated into viral enve- 
lopes. As in a radioactive pulse-chase experiment, the use of a virus 
allows investigators to follow a relatively synchronous wave of protein 
movement, in this case represented by a wave of green fluorescence 


FIGURE 8.3 Autoradiography reveals the sites of synthesis 
and subsequent transport of secretory proteins. (a) Electron 
micrograph of a section of a pancreatic acinar cell that had been 
incubated for 3 minutes in radioactive amino acids and then 
immediately fixed and prepared for autoradiography. The black 
silver grains that appear in the emulsion following development 
are localized over the endoplasmic reticulum. (b-d) Diagrams of 
a sequence of autoradiographs showing the movement of 
labeled secretory proteins (represented by the silver grains in 
red) through a pancreatic acinar cell. When the cell is pulse- 
labeled for 3 minutes and immediately fixed (as shown in a), 
radioactivity is localized in the endoplasmic reticulum (b). After a 
3-minute pulse and 17-minute chase, radioactive label is 
concentrated in the Golgi complex and adjacent vesicles (c). 
After a 3-minute pulse and 117-minute chase, radioactivity is 
concentrated in the secretory granules and is beginning to be 
released into the pancreatic ducts (d). 


Source: (a) Courtesy of James D. Jamieson and George Palade. 


120 min 


20 min 


that begins soon after infection. Synchrony can be enhanced, as was 
done in the experiment depicted in Figure 8.4, by use of a virus with 
a mutant VSVG protein that is unable to leave the ER of infected cells 
that are grown at an elevated temperature (e.g., 40°C). When the 
temperature is lowered to 32°C, the fluorescent GFP-VSVG protein 
that had accumulated in the ER (Figure 8.4a, c) moves synchronously 
to the Golgi complex (Figure 8.45, c), where various processing events 
occur, and then on to the plasma membrane. Mutants of this type 
that function normally at a reduced (permissive) temperature, but 
not at an elevated (restrictive) temperature, are described as temperature- 
sensitive mutants. An experiment utilizing two different fluorescent 
probes is described on page 299. 
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FIGURE 8.4 The use of green fluorescent protein (GFP) reveals the movement of proteins within a living cell. (a) Fluorescence micrograph of a live 
cultured mammalian cell that had been infected with the VSV virus at 40°C. This particular strain of the VSV virus contained a VSVG gene that (1) was fused 
to a gene encoding the fluorescent protein GFP and (2) contained a temperature-sensitive mutation that prevented the newly synthesized VSVG protein 
from leaving the ER when kept at 40°C. The green fluorescence in this micrograph is restricted to the ER. (b) Fluorescence micrograph of a live infected cell 
that was held at 40°C to allow the VSVG protein to accumulate in the ER and then incubated at 32°C for 10 minutes. The fluorescent VSVG protein has 
moved on to the Golgi complex. (c) Schematic drawing showing the retention of the mutant VSVG protein in the ER at 40°C and its synchronous movement 


to the Golgi complex within 10 minutes of incubation at the lower temperature. 


Source: (b) From Daniel S. Chao et al., courtesy of Richard H.Scheller, J. Cell Biol. 144:873, 1999; reproduced with permission of The Rockefeller University Press. 


Insights Gained from the Analysis 
of Subcellular Fractions 


Electron microscopy, autoradiography, and the use of GFP provide 
information on the structure and function of cellular organelles but 
fail to provide much insight into the molecular composition of these 
structures. Techniques to break up (homogenize) cells and isolate 
particular types of organelles were pioneered in the 1950s and 1960s 
by Albert Claude and Christian De Duve. When a cell is ruptured by 
homogenization, the cytoplasmic membranes become fragmented 
and the fractured edges of the membrane fragments fuse to form 
spherical vesicles less than 100 nm in diameter. Vesicles derived from 
different organelles (nucleus, mitochondrion, plasma membrane, 
endoplasmic reticulum, and so forth) have different properties, 
which allow them to be separated from one another, an approach 
that is called subcellular fractionation. 

Membranous vesicles derived from the endomembrane system 
(primarily the endoplasmic reticulum and Golgi complex) form a 


heterogeneous collection of similar sized vesicles referred to as 
microsomes. A rapid (and crude) preparation of the microsomal 
fraction ofa cell is depicted in FIGURE 8.5a. The microsomal fraction 
can be further fractionated into smooth and rough membrane frac- 
tions (Figure 8.5b,c) by the gradient techniques discussed in 
Section 18.6. Once isolated, the biochemical composition of various 
fractions can be determined. In recent years, the identification of the 
proteins present in cell fractions has been carried out using prot- 
eomic technology. Once a particular organelle has been isolated, the 
proteins can be extracted, separated, and identified by mass spec- 
trometry as discussed in Section 18.6. Hundreds of proteins can be 
identified simultaneously, providing a comprehensive molecular 
portrait of any organelle that can be prepared in a relatively pure 
state. In one example of this technology, it was found that a simple 
phagosome (Section 8.20) containing an ingested latex bead com- 
prised more than 160 different proteins, many of which had never 
previously been identified or were not known to be involved in 
phagocytosis. 
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Insights Gained from the Use 
of Cell-Free Systems 


Once techniques to fractionate membranous organelles were devel- 
oped, researchers began to probe the capabilities of these crude sub- 
cellular preparations. They found that the isolated parts of a cell were 
capable of remarkable activities. These early cell-free systems—so 
called because they did not contain whole cells—provided a wealth of 
information about biological processes that were impossible to study 
within the complex environment of intact cells. During the 1960s, for 


FIGURE 8.5 Isolation of a microsomal fraction by differential centrifu- 
gation. (a) When a cell is broken by mechanical homogenization (step 1), 
the various membranous organelles become fragmented and form 
spherical membranous vesicles. Vesicles derived from different organelles 
can be separated by various techniques of centrifugation. In the procedure 
depicted here, the cell homogenate is first subjected to low-speed 
centrifugation to pellet the larger particles, such as nuclei and mitochon- 
dria, leaving the smaller vesicles (microsomes) in the supernatant (step 2). 
The microsomes can be removed from the supernatant by centrifugation at 
higher speeds for longer periods of time (step 3). A crude microsomal 
fraction of this type can be fractionated into different vesicle types in 
subsequent steps. (b) Electron micrograph of a smooth microsomal fraction 
in which the membranous vesicles lack ribosomes. (c) Electron micrograph 
of a rough microsomal fraction containing ribosome-studded membranes. 


Source: (b-c) Courtesy of J. A. Higgins and R. J. Barnett. 


example, George Palade, Philip Siekevitz, and their colleagues at 
Rockefeller University set out to learn more about the properties of 
the rough microsomal fraction (shown in Figure 8.5c), whose mem- 
brane vesicles are derived from the rough ER (Section 8.4). They 
found that they could strip a rough microsomal preparation of its 
attached particles, and the isolated particles (ie., ribosomes) were 
capable of synthesizing proteins when provided with required ingre- 
dients from the cytosol. Under these conditions, the newly synthe- 
sized proteins were simply released by the ribosomes into the aqueous 
fluid of the test tube. When the same experiment was carried out 
using intact rough microsomes, the newly synthesized proteins were 
no longer released into the incubation medium but were trapped 
within the lumen of the membranous vesicles. It was concluded from 
these studies that the microsomal membrane was not required for 
the incorporation of amino acids into proteins but for sequestering 
newly synthesized secretory proteins within the ER cisternal space. 

Over the past few decades, researchers have used cell-free sys- 
tems to identify the roles of many of the proteins involved in mem- 
brane trafficking. James Rothman and Randy Schekman, recipients 
of the 2013 Nobel in Physiology or Medicine, have been noted for 
their use of cell-free systems in their research on vesicle trafficking. 
FIGURE 8.6 shows a liposome with vesicles budding from its surface 
(arrows) from an experiment conducted in Schekman’s lab. As dis- 
cussed on page 120, liposomes are vesicles whose surface consists of 
an artificial bilayer that is created in the laboratory from purified 
phospholipids. The buds and vesicles seen in Figure 8.6 were pro- 
duced after the preparation of liposomes was incubated with puri- 
fied proteins that normally comprise coats on the cytosolic surface 
of transport vesicles within the cell. Without the added coat proteins, 
vesicle budding could not occur. 

Using this strategy in which cellular processes are reconstituted 
in vitro from purified components, researchers have been able to 
study the proteins that bind to the membrane to initiate vesicle for- 
mation, the proteins responsible for cargo selection, and the proteins 
that sever the vesicle from the donor membrane. 


Insights Gained from the Study 
of Mutant Phenotypes 


A mutant is an organism (or cultured cell) whose chromosomes contain 
one or more genes that encode abnormal proteins. When a protein 
encoded by a mutant gene is unable to carry out its normal function, 
the cell carrying the mutation exhibits a characteristic deficiency. 
Determining the precise nature of the deficiency provides information 
on the function of the normal protein. Large-scale studies of the genetic 
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FIGURE 8.6 Formation of coated vesicles in a cell-free system. Electron 
micrograph of a liposome preparation that had been incubated with the 
components required to promote vesicle budding within the cell. The proteins 
in the medium have become attached to the surface of the liposomes and 
have induced the formation of protein-coated buds (arrows). 


Source: Courtesy of Lelio Orci and Randy Schekman. 
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basis of secretion have been carried out largely on yeast cells, most 
notably by Randy Schekman and colleagues (See the Experimental 
Walkthrough Video: Screening for sec mutants for this chapter). 

In yeast, as in all eukaryotic cells, vesicles bud from the ER and 
travel to the Golgi complex, where they fuse with the Golgi cisternae 
(FIGURE 8.7a). To identify genes whose encoded protein is involved 
in this portion of the secretory pathway (i.e., sec genes), researchers 
screen for mutant cells that exhibit an abnormal distribution of cyto- 
plasmic membranes. An electron micrograph of a wild-type yeast 
cell is shown in Figure 8.7b. The cell depicted in Figure 8.7c has a 
mutation in a gene that encodes a protein involved in the formation 
of vesicles at the ER membrane (step 1, Figure 8.7a). In the absence 
of vesicle formation, mutant cells accumulate an expanded endo- 
plasmic reticulum. In contrast, the cell depicted in Figure 8.7d car- 
ries a mutation in a gene that encodes a protein involved in vesicle 
fusion (step 2, Figure 8.7a). When this gene product is defective, the 
mutant cells accumulate an excess number of unfused vesicles. 
Researchers have isolated dozens of different mutants which, taken 
as a group, exhibit disruptions in virtually every step of the secretory 
pathway. The genes responsible for these defects have been cloned 
and sequenced, and the proteins they encode have been isolated. 
Isolation of proteins from yeast has launched successful searches for 
homologous proteins (Le., proteins with related sequence) in mammals. 


Golgi 
complex 


FIGURE 8.7 The use of genetic mutants in the study of secretion. (a) The first leg of the biosynthetic secretory pathway in budding yeast. The steps are 
described below. (b) Electron micrograph of a section through a wild-type yeast cell. (c) A yeast cell bearing a mutation in the sec12 gene whose product is 
involved in the formation of vesicles at the ER membrane (step 1, part a). Because vesicles cannot form, expanded ER cisternae accumulate in the cell. 

(d) A yeast cell bearing a mutation in the sec17 gene, whose product is involved in vesicle fusion (step 2, part a). Because they cannot fuse with Golgi 
membranes, the vesicles (indicated by the arrowheads) accumulate in the cell. [The mutants depicted in c and d are temperature-sensitive mutants. When 
kept at the lower (permissive) temperature, they are capable of normal growth and division.] 


Source: From Chris A. Kaiser and Randy Schekman, Cell 61:724, 1990; © 1990, reprinted with permission from Elsevier. 


One of the most important lessons learned from the use of all of 
these techniques is that the dynamic activities of the endomembrane 
system are highly conserved. Not only do yeast, plant, insect, and 
human cells carry out similar processes, but they do so with remark- 
ably similar proteins. It is evident that the structural diversity of cells 
belies their underlying molecular similarities. In many cases, pro- 
teins from widely divergent species are interchangeable. For exam- 
ple, cell-free systems derived from mammalian cells can often utilize 
yeast proteins to facilitate vesicle transport. Conversely, yeast cells 
with genetic deficiencies that disrupt some phase of the biosynthetic 
pathway can often be “cured” by genetically engineering them to 
carry mammalian genes. 

Over the past decade or so, researchers interested in searching 
for genes that affect a particular cellular process in plant or animal 
cells have taken advantage of a cellular phenomenon called RNA 
interference (RNAi). RNAi is a process in which cells produce small 
RNAs (called siRNAs) that bind to specific mRNAs and inhibit the 
translation of these mRNAs into proteins. This phenomenon and its 
use are discussed in detail in Sections 11.10 and 18.25. For the present 
purpose we will simply note that researchers can synthesize a collec- 
tion (library) of siRNAs that are capable of inhibiting the translation 
of virtually any mRNA that is produced by a genome. Each mRNA 
represents the expression of a specific gene; therefore, one can find 
which genes are involved in a particular process by determining 
which siRNAs interfere with that process. In the experiment depicted 
in FIGURE 8.8, researchers set out to identify genes that were involved 
in various steps of the secretory pathway, a goal similar to that of the 
investigators studying the yeast mutants shown in Figure 8.7. In this 
case, researchers used a strain of cultured Drosophila cells and 
attempted to identify genes that affected the localization of man- 
nosidase II, an enzyme that is synthesized in the endoplasmic reticu- 
lum and moves via transport vesicles to the Golgi complex, where it 
takes up residence. Figure 8.8a shows a control cell that is synthesiz- 
ing a GFP-labeled version of mannosidase IJ; the fluorescence 
becomes localized in the numerous Golgi complexes of the cell as 
would be expected. Figure 8.8b shows a cell that contains siRNA 
molecules that have caused a relocation of the GFP-mannosidase 
into the ER, which is seen to be fused with the Golgi complexes. This 
type of phenotype is typically caused by the absence of one of the 
proteins involved in the transport of the enzyme from the ER to the 
Golgi complex. Of the 130 different siRNAs that were found to inter- 
fere in some way with the secretory pathway in this study, 31 of them 


FIGURE 8.8 Inhibition of gene expression with RNA interference. (a) A 
control Drosophila S2 cultured cell expressing GFP-labeled mannosidase Il. 
The fluorescent enzyme becomes localized in the Golgi complex after its 
synthesis in the ER. (b) A cell that has been genetically engineered to 
express a specific siRNA, which binds to a complementary mRNA and 
inhibits translation of the encoded protein. In this case, the siRNA has 
caused the fluorescent enzyme to remain in the ER, which has fused with 
the Golgi membranes. This phenotype suggests that the mRNA being 
affected by the siRNA encodes a protein involved in an early step of the 
secretory pathway during which the enzyme is synthesized in the ER and 
traffics to the Golgi complex. Among the genes that exhibit this phenotype 
when targeted by siRNAs are those encoding proteins of the COPI coat, 
Sar1, and Sec23. The functions of these proteins are discussed later in 

the chapter. 

Source: (a) From Frederic Bard et al., courtesy of Vivek Malhotra, Nature 
439:604, 2006; © 2006, Reprinted by permission of Macmillan Publishers 
Ltd.; (lb) From Frederic Bard et al., courtesy of Vivek Malhotra, Nature 439: 
604, 2006; © 2006, Reprinted by permission of Macmillan Publishers Ltd. 


generated a phenotype similar to that shown in Figure 8.8b. Included 265 


among these 31 siRNAs were numerous species that inhibited the 
expression of genes that were already known to be involved in the 
secretory pathway. In addition, the study identified other genes 
whose function had been unknown and are now presumed to be 
involved in these processes as well. Because it is easier to synthesize 
a small RNA than to generate an organism with a mutant gene, 
RNAi has become a common strategy to investigate the effect of a 
missing protein. 
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1. Describe the differences between an autoradiograph 
of a pancreas cell that had been incubated in labeled 
amino acids for 3 minutes and immediately fixed and 
one of acell that had been labeled for 3 minutes, 
chased for 40 minutes, and then fixed. 


2. What techniques or approaches might you use to 
learn which proteins are normally present in the 
endoplasmic reticulum? 


3. How does the isolation of a mutant yeast that 
accumulates vesicles provide information on the 
process of protein trafficking? 


4. How can GFP be used to study membrane dynamics? 
° 


8.3 The Endoplasmic Reticulum 


The endoplasmic reticulum (ER) comprises a network of mem- 
branes that penetrates much of the cytoplasm. The ER probably 
evolved from invaginations of the plasma membrane as described in 
Section 1.9. Enclosed within the ER is an extensive space, or lumen, 
that is separated from the surrounding cytosol by the ER membrane. 
As will be evident in the following discussion, the composition of the 
luminal (or cisternal) space inside the ER membranes is quite differ- 
ent from that of the surrounding cytosolic space. Like other subcel- 
lular organelles, the ER is a highly dynamic structure undergoing 
continual turnover and reorganization. 

The endoplasmic reticulum is divided into two subcompartments, 
the rough endoplasmic reticulum (RER) and the smooth endo- 
plasmic reticulum (SER). The rough ER is defined by the presence 
of ribosomes bound to its cytosolic surface, whereas the smooth 
ER lacks associated ribosomes. The RER is typically composed of a 
network of flattened sacs (cisternae), as shown in FIGURE 8.9a and 
8.9c, where each layer is connected to its neighbors by helicoidal 
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membrane of the nuclear envelope, which also bears ribosomes on 
its cytosolic surface (Figure 8.2b). In contrast, the membranes of the 
SER are highly curved and tubular, forming an interconnecting sys- 
tem of pipelines traversing the cytoplasm (Figure 8.9a). When cells 
are homogenized, the SER tubules fragment into smooth-surfaced 
vesicles, whereas the RER sheets fragment into rough-surfaced vesi- 
cles (Figure 8.5b,c). 

Fluorescently labeled proteins and lipids are capable of diffusing 
from one type of ER into the other, indicating that their membranes 
are continuous. In fact, the two types of ER share many of the same 
proteins and engage in certain common activities, such as the synthesis 
of certain lipids and cholesterol. At the same time, numerous proteins 
are found only in one or the other type of ER. For example, the high 
degree of curvature of the SER tubules is induced and maintained by 
the presence of large numbers of membrane-bending proteins, called 
reticulons, which are only present in small numbers at the edges of the 
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flattened RER sheets. The RER, in contrast, contains high levels of pro- 
teins involved in the movement of nascent proteins into the ER lumen. 

Different types of cells contain markedly different ratios of the 
two types of ER, depending on the activities of the cell. For example, 
cells that secrete large amounts of proteins, such as the cells of the 
pancreas or salivary glands, have extensive regions of RER (Figure 
8.9b-d). We will return to the function of the RER shortly, but first 
we will describe the activities of the SER. 


The Smooth Endoplasmic Reticulum 


The SER is extensively developed in a number of cell types, includ- 
ing those of skeletal muscle, kidney tubules, and steroid-producing 
endocrine glands (FIGURE 8.10). SER functions include: 


e Synthesis of steroid hormones in the endocrine cells of the gonad 
and adrenal cortex. 


(b) 0.3 um 


(d) 


FIGURE 8.9 The rough endoplasmic reticulum (RER). (a) Schematic diagram showing the stacks of flattened cisternae that make up the rough ER. The 
cytosolic surface of the ER membrane contains bound ribosomes, which gives the cisternae their rough appearance. (b) Scanning electron micrograph of the 
rough ER in a pancreatic acinar cell. (c) High resolution EM imaging has revealed that ER sheets are connected by helicoidal ramps. A 3D reconstruction of 
such a ramp is shown here. (d) Visualization of the ER in a whole cultured live mammalian cell as revealed by GFP fluorescence of an ER protein. 


Source: (b) From K. Tanaka, Int. Rev. Cytol. 68:101, courtesy of K. Tanaka; 1980; (c) From Terasaki et al., Cell 154:285, 2013; (d) From George H. Patterson, 


Cell Image Library, CIL721. 


including the cytochrome P450 family. These enzymes are 267 
noteworthy for their lack of substrate specificity, being able to 
oxidize thousands of different hydrophobic compounds and 
convert them into more hydrophilic, more readily excreted 
derivatives. The results are not always positive. For example, the 
relatively harmless compound benzo[a]pyrene formed when meat 
is charred on a grill is converted into a potent carcinogen by the 
“detoxifying” enzymes of the SER. Cytochrome P450s metabolize 
many prescribed medications, and genetic variation in these 
enzymes among humans may explain differences from one person 
to the next in the effectiveness and side effects of many drugs. 


e Sequestering calcium ions within the cytoplasm of cells. The 
regulated release of Ca** from the SER of skeletal and cardiac 
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FIGURE 8.10 The smooth ER (SER). Electron micrograph of a Leydig cell muscle cells (known as the sarcoplasmic reticulum in muscle 

from the testis showing the extensive smooth ER where steroid hormones cells) triggers contraction. 

are synthesized. 

Source: Don Fawcett/Photo Researchers, Inc. The Rough Endoplasmic Reticulum 

e Detoxification in the liver of a wide variety of organic compounds, _Early investigations into the functions of the RER were carried out 
including barbiturates and ethanol, whose chronic use canleadto _on cells that secrete large quantities of proteins, such as the acinar 
proliferation of the SER in liver cells. Detoxification is carried out cells of the pancreas (Figure 8.3) or the mucus-secreting cells of the 
by a collection of oxygen-transferring enzymes (oxygenases), lining of the digestive tract (FIGURE 8.11). It is evident from the 
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FIGURE 8.11 Polarized structure of a secretory cell. (a) Drawing of a mucus-secreting 
goblet cell from the rat colon. (b) Low-power electron micrograph of a mucus-secreting cell 
from Brunner’s gland of the mouse small intestine. Both types of cells display a distinctly 
polarized arrangement of organelles, reflecting their role in secreting large quantities of 
mucoproteins. The basal ends of the cells contain the nucleus and rough ER. Proteins synthe- 
sized in the rough ER move into the closely associated Golgi complex and from there into 
membrane-bound carriers in which the final secretory product is concentrated. The apical 
regions of the cells are filled with secretory granules containing the mucoproteins ready for 
release into a duct. 

Source: (a) From Marian Neutra and C. P. Leblond, Copyright 1966, Rockefeller University Press. 
Originally published in The Journal of Cell Biology Volume 30:119. (b): From Alain Rambourg 
(a) and Yves Clermont, Eur. J. Cell Biol. 51:196, 1990; © 1990, reprinted with permission of Elsevier. 
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268 drawing (Figure 8.11a) and micrograph (Figure 8.11b) that the 


Bupjoyjes, Sueiqudy\ Pue ‘UO!}DUNY ‘aIN}ONIIS :swa}shs eueIquUa| ZIse|do1AD e 8 YALdVHD 


organelles of these epithelial secretory cells are positioned in the cell in 
such a way as to produce a distinct polarity from one end to the other. 
The nucleus and an extensive array of RER cisternae are located near 
the basal surface of the cell, which faces the blood supply. The Golgi 
complex is located in the central region of the cell. The apical surface 
of the cell faces a duct that carries the secreted proteins out of the 
organ. The cytoplasm at the apical end of the cell is filled with secre- 
tory granules whose contents are ready to be released into the duct 
upon arrival of the appropriate signal. The polarity of these glandular 
epithelial cells reflects the movement of secretory proteins through 
the cell from their site of synthesis to their site of discharge. The rough 
ER is the starting point of the biosynthetic pathway: It is the site of 
synthesis of the proteins, carbohydrate chains, and phospholipids that 
journey through the membranous compartments of the cell. 


REVIEW | 


1. What are the major morphological differences 
between the RER and SER? What are the major 
differences in their functions? 
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8.4 Functions of the Rough 
Endoplasmic Reticulum 


The functions of the rough ER include the synthesis of proteins and 
the synthesis of most of the lipids of a cells membranes. The addi- 
tion of sugars to the asparagine residues of proteins begins in the 
rough ER and continues in the Golgi complex. This section will pre- 
sent the rough ER’s role in the synthesis of secreted proteins, lysoso- 
mal proteins, and integral membrane proteins. 


Synthesis of Proteins on Membrane- 
Bound versus Free Ribosomes 


The discovery of the rough endoplasmic reticulum as the site of syn- 
thesis of secretory proteins in pancreatic acinar cells was described 
earlier (Figure 8.3). Similar results were found for other types of secre- 
tory cells, including intestinal goblet cells that secrete mucoproteins, 
endocrine cells that secrete polypeptide hormones, plasma cells that 
secrete antibodies, and liver cells that secrete blood serum proteins. 

Further experiments revealed that polypeptides are synthesized 
at two distinct locales within the cell. 


1. Approximately one-third of the proteins encoded by a mamma- 
lian genome are synthesized on ribosomes attached to the cyto- 
solic surface of the RER membranes and released into the ER 
lumen in a process called co-translational translocation. 
These include (a) secreted proteins, (b) integral membrane pro- 
teins, and (c) soluble proteins that reside within compartments of 
the endomembrane system, including the ER, Golgi complex, 
lysosomes, endosomes, vesicles, and plant vacuoles. 


2. Other polypeptides are synthesized on “free” ribosomes, that is, 
on ribosomes that are not attached to the RER, and are subse- 
quently released into the cytosol. This class includes (a) proteins 
destined to remain in the cytosol (such as the enzymes of glycolysis 
and the proteins of the cytoskeleton), (b) peripheral proteins of 
the cytosolic surface of membranes (such as spectrins and 


ankyrins that are only weakly associated with the plasma mem- 
brane’s cytosolic surface), (c) proteins that are transported to the 
nucleus (Section 12.2), and (d) proteins to be incorporated into 
peroxisomes, chloroplasts, and mitochondria. Proteins in the lat- 
ter two groups are synthesized to completion in the cytosol and 
then imported posttranslationally into the appropriate organelle 
across its boundary membrane(s) (Section 8.21). 


What determines the location in a cell where a protein is synthe- 
sized? In the early 1970s, Ginter Blobel, in collaboration with David 
Sabatini and Bernhard Dobberstein of Rockefeller University, first 
proposed, and then demonstrated, that the site of synthesis of a pro- 
tein is determined by the sequence of amino acids in the N-terminal 
portion of the polypeptide, which is the first part to emerge from the 
ribosome during protein synthesis. They suggested the following: 


1. Secretory proteins contain a signal sequence at their N-terminus 
that directs the emerging polypeptide and ribosome to the ER 
membrane. 


2. The polypeptide moves into the cisternal space of the ER through 
a protein-lined, aqueous channel in the ER membrane. It was 
proposed that the polypeptide moves through the membrane as 
it is being synthesized, that is, co-translationally.” 


This proposal, known as the signal hypothesis, has been substan- 
tiated by a large body of experimental evidence. Even more impor- 
tantly, Blobel’s original concept that proteins contain built-in 
“address codes” has been shown to apply in principle to virtually all 
types of protein trafficking pathways throughout the cell. 


Synthesis of Secretory, Lysosomal, 
or Plant Vacuolar Proteins 


The synthesis of secretory, lysosomal, or plant vacuolar proteins 
usually takes place by co-translational translocation. In this process, 
newly forming proteins are deposited into the ER lumen by a ribo- 
some that is attached to the ER membrane (FIGURE 8.12). The syn- 
thesis of the polypeptide begins after a messenger RNA binds to a 
free ribosome, that is, one that is not attached to a cytoplasmic mem- 
brane. In fact, all ribosomes are thought to be identical; those 
employed in the synthesis of secretory, lysosomal, or plant vacuolar 
proteins are not distinguishable from those used for production of 
proteins that remain in the cytosol. Polypeptides that undergo co- 
translational translocation contain a signal sequence—which typi- 
cally includes a stretch of 6-15 hydrophobic amino acid residues— 
that targets the nascent polypeptide to the ER membrane and leads 
to the compartmentalization of the polypeptide within the ER lumen. 
(A nascent polypeptide is one in the process of being synthesized.) 
Although the signal sequence is usually located at or near the 
N-terminus, it occupies an internal position in some polypeptides. 
As it emerges from the ribosome, the hydrophobic signal 
sequence is recognized by a signal recognition particle (SRP), 
which consists in mammalian cells of six distinct polypeptides and a 
RNA molecule, called the 7SL RNA. The SRP binds to both the signal 
sequence on the nascent polypeptide and the ribosome (step 1, 


“It should be noted that protein transport across the ER membrane can also occur 
posttranslationally (i.e., after synthesis). In this process, the polypeptide is synthesized 
totally in the cytosol and then imported into the ER lumen through the same protein- 
conducting channels used in the cotranslational pathway. The posttranslational path- 
way is utilized much more heavily in yeast than in mammalian cells for import into the 
ER. In fact, yeast cells that are unable to engage in cotranslational transport into the 
ER are still able to survive, even though they grow more slowly than normal cells. 
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FIGURE 8.12 A schematic model of the synthesis of a secretory protein (or a lysosomal enzyme) on a membrane-bound ribosome of the RER. 

(a) Synthesis of the polypeptide begins on a free ribosome. As the signal sequence (shown in red) emerges from the ribosome, it binds to the SRP (step 1), which 
stops further translation until the SRP-ribosome-nascent chain complex can make contact with the ER membrane. The SRP-ribosome complex then collides with 
and binds to an SRP receptor (SR) situated within the ER membrane (step 2). Attachment of this complex to the SRP receptor is followed by release of the SRP 
and the association of the ribosome with a translocon of the ER membrane (step 3). These latter events are accompanied by the reciprocal hydrolyis of GTP 
molecules (not shown) bound to both the SRP and its receptor. In the model depicted here, the signal peptide then binds to the interior of the translocon, 
displacing the plug from the channel and allowing the remainder of the polypeptide to translocate through the membrane cotranslationally (step 4). After 
the nascent polypeptide passes into the lumen of the ER, the signal peptide is cleaved by a membrane protein (the signal peptidase, not shown), and the protein 
undergoes folding with the aid of ER chaperones, such as BiP. Studies suggest that translocons are organized into groups of two or four units rather than singly as 
shown here. (b) Cross-sectional view of the translocon channel from the side based on the X-ray crystal structure of an archaebacterial translocon. The hourglass 
shape of the aqueous channel and its helical plug are evident. The ring of hydrophobic side chains (green) situated at the narrowest site within the channel is also 
shown. (c) Representation of a ribosome-translocon complex in the act of synthesis and translocation of a nascent protein based on cryo-EM (Section 18.14). The 
exit channel within the ribosome is seen to be aligned with the conducting channel within the translocon. PCC, protein conducting channel; NC, nascent chain; 


P-tRNA, peptidyl-t-RNA; 40S and 60S, ribosomal subunits. 


Source: (b) Reprinted by permission from Macmillan Publishers Ltd: From Tom A. Rapoport, Nature 450:664, 2007; © Copyright 2007. (c) From Thomas 
Becker et al., Science, Vol 326, 1372, 2009, Figure 6e. Roland Beckmann, University of Munich. Reprinted with permission from AAAS. 


Figure 8.12a), temporarily arresting further synthesis of the polypep- 
tide. The SRP-ribosome-nascent polypeptide complex is then 
recruited to the ER membrane through an interaction between the 
SRP and the SRP receptor on the ER membrane (step 2). The ribo- 
some (with its nascent polypeptide) is then handed off from the SRP 
to the translocon (step 3). The translocon is a protein channel 
embedded in the ER membrane through which the nascent polypep- 
tide is able to move in its passage from the ribosome to the ER lumen. 

X-ray crystallography of the translocon (Figure 8.12b) has pro- 
vided insight into how translocation occurs. These studies have 
revealed the presence within the translocon of a pore in the shape of 
an hourglass with a ring of six hydrophobic amino acids situated at 


its narrowest diameter. In the inactive (i.e., non-translocating) state, 
the opening in the pore ring is plugged by a short @ helix. This plug 
is thought to seal the channel, preventing the unwanted passage of 
calcium and other ions between the cytosol and the ER lumen. 
Upon attachment of the ribosome to the translocon, the signal 
sequence is recognized, and the nascent polypeptide is inserted into 
the narrow aqueous channel of the translocon (step 3). It is proposed 
that contact of the signal sequence with the interior of the translocon 
leads to displacement of the plug and opening of the passageway. The 
growing polypeptide is then translocated through the hydrophobic 
pore ring and into the ER lumen (step 4 and Figure 8.12c). Because the 
pore ring observed in the crystal structure has a diameter (5-8 A) that 
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thought that the channel is able to open in a hinge-like manner as the 
nascent chain traverses the channel. Upon termination of translation 
and passage of the completed polypeptide through the translocon, the 
membrane-bound ribosome is released from the ER membrane and 
the helical plug is reinserted into the translocon channel. 

Several of the steps involved in the synthesis and trafficking of 
secretory proteins are regulated by the binding or hydrolysis of GTP. 
As will be discussed at length in Chapter 15 and elsewhere in this 
chapter, GTP-binding proteins (or G proteins) play key regulatory 
roles in many different cellular processes.’ G proteins can be present 
in at least two alternate conformations, one containing a bound GTP 
molecule and the other a bound GDP molecule. GTP-bound and 
GDP-bound versions ofa G protein have different conformations and 
thus have different abilities to bind other proteins. Because of this dif- 
ference in binding properties, G proteins act like “molecular switches”; 
the GTP-bound protein typically turns the process on, and hydrolysis 
of the bound GTP turns it off. Among the components depicted in 
Figure 8.12, both the SRP and the SRP receptor are G proteins that 
interact with one another in their GTP-bound states. Recent evidence 
has shown that, just prior to GTP hydrolysis, the G protein complex 
undergoes a dramatic conformational change that simultaneously 
exposes the signal sequence and makes room for the translocon to 
bind to the ribosome. The hydrolysis of GTP bound to these two pro- 
teins triggers the release of the signal sequence by the SRP. 


Processing of Newly Synthesized Proteins 
in the Endoplasmic Reticulum 


As it enters the RER cisterna, a nascent polypeptide is acted on by a 
variety of enzymes located within either the membrane or the lumen 
of the RER. The N-terminal portion containing the signal peptide is 
removed from most nascent polypeptides by a proteolytic enzyme, the 
signal peptidase. Carbohydrates are added to the nascent protein 
by the enzyme oligosaccharyltransferase (discussed in Section 8.6). 
Both the signal peptidase and oligosaccharyltransferase are integral 
membrane proteins associated with the translocon that act on nascent 
proteins as they enter the ER lumen. 

The RER is a major protein processing plant. To meet its obliga- 
tions, the RER lumen is packed with molecular chaperones that rec- 
ognize and bind to unfolded or misfolded proteins and give them the 
opportunity to attain their correct (native) three-dimensional struc- 
ture (Section 8.7). The ER lumen also contains a number of protein- 
processing enzymes, such as protein disulfide isomerase (PDI). Proteins 
enter the ER lumen with their cysteine residues in the reduced (—SH) 
state, but they leave the compartment with many of these residues 
joined to one another as oxidized disulfides (—SS—) (page 50). The 
formation (and rearrangement) of disulfide bonds is catalyzed by 
PDL Disulfide bonds play an important role in maintaining the stabil- 
ity of proteins that are present at the extracellular surface of the plasma 
membrane or secreted into the extracellular space. 

The endoplasmic reticulum is ideally constructed for its role as a 
port of entry for the biosynthetic pathway of the cell. Its membrane 
provides a large surface area to which many ribosomes can attach (an 
estimated 13 million per liver cell). The lumen of the ER cisternae 
provides a specialized local environment that favors the modification, 


3GTP proteins generally require accessory proteins to carry out their function. The roles 
of these proteins are discussed in Chapter 15 and illustrated in Figure 15.21. They will not 
be considered in the present chapter, even though they are involved in these activities. 


folding, and assembly of a selected subset of the cell’s proteins. The 
segregation of these newly synthesized proteins in the ER cisternae 
removes them from the cytosol and allows them to be modified and 
dispatched toward their ultimate destination, whether outside the cell 
or within one of the cytoplasm’s membranous organelles. 


Synthesis of Integral Membrane Proteins 
on ER-Bound Ribosomes 


Integral membrane proteins—other than those of mitochondria and 
chloroplasts—are also synthesized by co-translational translocation 
using the same machinery described for the synthesis of secretory 
and lysosomal proteins (Figure 8.12). Unlike soluble secretory and 
lysosomal proteins, however, which pass entirely through the ER 
membrane during translocation, integral proteins contain one or 
more hydrophobic transmembrane segments (page 127) that are 
shunted directly from the channel of the translocon into the lipid 
bilayer. How can such a transfer take place? 

The X-ray crystallographic studies of the translocon described 
above showed the translocon to have a clam-shaped conformation 
with a groove or seam along one side of the wall where the channel 
might open and close. As a polypeptide passes through the translocon, 
it is proposed that this lateral “gate” in the channel continually opens 
and closes, which gives each segment of the nascent polypeptide an 
opportunity to partition itself according to its solubility properties 
into either the aqueous compartment within the translocon channel 
or the surrounding hydrophobic core of the lipid bilayer. Those seg- 
ments of the nascent polypeptide that are sufficiently hydrophobic will 
spontaneously “dissolve” into the lipid bilayer and ultimately become 
transmembrane segments of an integral membrane protein. This con- 
cept has received strong support from an in vitro study in which trans- 
locons were given the opportunity to translocate custom-designed 
nascent proteins that contained test segments of varying hydrophobic- 
ity. The more hydrophobic the test segment, the greater the likelihood 
it would pass through the wall of the translocon and become inte- 
grated as a transmembrane segment of the bilayer. 

FIGURE 8.13 shows the synthesis of a pair of integral membrane 
proteins containing a single transmembrane segment. Single-spanning 
membrane proteins can have an orientation with their N-terminus 
facing either the cytosol or the lumen of the ER (and eventually the 
extracellular space). As noted on page 128, the most common deter- 
minant of membrane protein alignment is the presence of positively 
charged amino acid residues flanking the cytosolic end of a trans- 
membrane segment (see Figure 4.18). During the synthesis of 
membrane proteins, the inner lining of the translocon is thought 
to orient the nascent polypeptide, as indicated in Figure 8.13, so that 
the more positive end faces the cytosol. In multispanning proteins (as 
shown in Figure 4.32d), sequential transmembrane segments typically 
have opposite orientations. For these proteins, their arrangement 
within the membrane is determined by the direction in which the first 
transmembrane segment is inserted. Once that has been determined, 
every other transmembrane segment has to be rotated 180° before it 
can exit the translocon. Studies performed with purified components 
in cell-free systems suggest that a translocon, by itself, is capable of 
properly orienting transmembrane segments. It would appear that the 
translocon is more than a simple passageway through the ER mem- 
brane; it is a complex machine capable of recognizing various signal 
sequences and performing complex mechanical activities. 

In recent years, new classes of transmembrane proteins have 
been found that bypass the translocon. One such class of proteins, 
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FIGURE 8.13 A schematic model for the synthesis of an integral membrane protein that contains a single transmembrane segment near the N-terminus 
of the nascent polypeptide. The SRP and the various components of the membrane that were shown in Figure 8.12 are also involved in the synthesis of 
integral proteins, but they have been omitted for simplicity. The nascent polypeptide enters the translocon just as if it were a secretory protein (step 1). 
However, the entry of the hydrophobic transmembrane sequence into the pore blocks further translocation of the nascent polypeptide through the channel. 
Steps 2-3 show the synthesis of a transmembrane protein whose N-terminus is in the lumen of the ER and C-terminus is in the cytosol. In step 2, the lateral 
gate of the translocon has opened and expelled the transmembrane segment into the bilayer. Step 3 shows the final disposition of the protein. Steps 2a—4a 
show the synthesis of a transmembrane protein whose C-terminus is in the lumen and N-terminus is in the cystosol. In step 2a, the translocon has reoriented 
the transmembrane segment, in keeping with its reversed positively and negatively charged flanks. In step 3a, the translocon has opened laterally and 
expelled the transmembrane segment into the bilayer. Step 4a shows the final disposition of the protein. White-colored + and -signs indicate the proposed 
charge displayed by the inner lining of the translocon. The difference in charge between the phospholipids of the cytosolic and luminal leaflets of the bilayer 
(indicated by the yellow + and — signs) is also thought to play a role in determining membrane protein topology. The transmembrane segments are shown as 
helices based on studies indicating that these regions adopt a helical secondary structure within the ribosomal exit tunnel before they enter the translocon. 


called tail-anchored proteins, lack a signal sequence, but nonetheless 
manage to be properly inserted into the ER membrane. These pro- 
teins are synthesized in the cytoplasm, and targeted to the ER 
through a series of interactions with proteins in the Get (Guided 
Entry of Tail-Anchored proteins) pathway. 


REVIEW | 


1. Describe the steps that occur between the time a 
ribosome attaches to a messenger RNA encoding a 
secretory protein and the time the protein leaves the RER. 


2. How are newly synthesized integral proteins inserted 
into a membrane? 


ee 


8.5 Membrane Biosynthesis 
in the Endoplasmic Reticulum 


Membranes do not arise de novo, that is, as new entities from pools of 
protein and lipid that mix together. Instead, membranes arise from pre- 
existing membranes. Membranes grow as newly synthesized proteins, 
and lipids are inserted into existing membranes in the endoplasmic 
reticulum (ER). As will be apparent in the following discussion, mem- 
brane components move from the ER to virtually every other compart- 
ment in the cell. As the membrane moves from one compartment to the 
next, its proteins and lipids are modified by enzymes that reside in the 
cell’s various organelles. These modifications contribute to giving each 
membrane compartment a unique composition and distinct identity. 
Keep in mind that cellular membranes are asymmetric: The two 
phospholipid layers (leaflets) of a membrane have different compo- 
sitions (page 121). This asymmetry is established initially in the 
endoplasmic reticulum. Asymmetry is maintained as membrane 
carriers bud from one compartment and fuse to the next. As a result, 
domains situated at the cytosolic surface of the ER membrane can be 


identified on the cytosolic surface of transport vesicles, the cytosolic 
surface of Golgi cisternae, and the internal (cytoplasmic) surface of 
the plasma membrane (FIGURE 8.14). Similarly, domains situated at 
the luminal surface of the ER membrane maintain their orientation 


Exterior 


Rough » 
endoplasmic reticulum 


FIGURE 8.14 Maintenance of membrane asymmetry. As each protein 
is synthesized in the rough ER, it becomes inserted into the lipid bilayer 

in a predictable orientation determined by its amino acid sequence. 

This orientation is maintained throughout its travels in the endomembrane 
system, as illustrated in this figure. The carbohydrate chains, which are first 
added in the ER, provide a convenient way to assess membrane sidedness 
because they are always present on the cisternal side of the cytoplasmic 
membranes, which becomes the exoplasmic side of the plasma membrane 
following the fusion of vesicles with the plasma membrane. 
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membrane. In fact, in many ways, including its high calcium con- 
centration and abundance of proteins with disulfide bonds and car- 
bohydrate chains, the lumen of the ER (as well as other compartments 
of the secretory pathway) is a lot like the extracellular space. 

Most membrane lipids are synthesized entirely within the 
endoplasmic reticulum. The major exceptions are (1) sphingomye- 
lin and glycolipids, whose synthesis begins in the ER and is com- 
pleted in the Golgi complex; and (2) some of the unique lipids of the 
mitochondrial and chloroplast membranes, which are synthesized 
by enzymes that reside in those membranes. The enzymes involved 
in the synthesis of phospholipids are themselves integral proteins of 
the ER membrane with their active sites facing the cytosol. Newly 
synthesized phospholipids are inserted into the half of the bilayer 
facing the cytosol. Some of these lipid molecules are later flipped 
into the opposite leaflet through the action of enzymes called 
flippases. Lipids are carried from the ER to the Golgi complex and 
plasma membrane as part of the bilayer that makes up the walls of 
transport vesicles. 

The membranes of different organelles have markedly different 
lipid composition (FIGURE 8.15a), which indicates that changes take 
place as membrane flows through the cell. Several factors may con- 
tribute to such changes (Figure 8.15): 


1. Most membranous organelles contain enzymes that modify 
lipids already present within a membrane, converting one type of 
phospholipid (e.g., phosphatidylserine) to another (e.g., phos- 
phatidylcholine) (step 1, Figure 8.150). 


2. When vesicles bud from a compartment (as in Figure 8.2a), some 
types of phospholipids may be preferentially included within the 
membrane of the forming vesicle, while other types may be left 
behind (step 2, Figure 8.15b). 


3. Cells contain lipid-transfer proteins that can bind and trans- 
port lipids through the aqueous cytosol from one membrane com- 
partment to another (step 3, Figure 8.15b). These proteins 
facilitate the movement of specific lipids from the ER to other 
organelles without the involvement of transport vesicles. Lipid 
transfer is thought to occur at sites where the ER comes into very 
close proximity to the outer membrane of other organelles. 


FIGURE 8.15 Modifying the lipid composition of membranes. 

(a) Histogram indicating percentage of each of three phospholipids 
(phosphatidylcholine, phosphatidylserine, and sphingomyelin) in three 
different cellular membranes (ER, Golgi complex, and plasma membrane). 
The percentage of each lipid changes gradually as membrane flows from 
the ER to the Golgi to the plasma membrane. (b) Schematic diagram 
showing three distinct mechanisms that might explain how the 
phospholipid composition of one membrane in the endomembrane system 
can be different from another membrane in the system, even though the 
membranous compartments are spatially and temporally continuous. 

(1) The head groups of phospholipids of the bilayer are modified 
enzymatically; (2) the membrane of a forming vesicle contains a different 
phospholipid composition from the membrane it buds from; (3) lipids can 
be removed from one membrane and inserted into another membrane by 
lipid-transfer proteins. 


REVIEW | 


1. Describe some of the ways that membranous 
organelles can maintain their unique compositions 
despite the continuous traffic of membranes and 
materials moving through them. 


2. Describe how membrane asymmetry is maintained as 
membrane moves from the ER to the plasma 
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PM = Erythrocyte plasma membrane 
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8.6 Glycosylation in the Rough 
Endoplasmic Reticulum 


Nearly all of the proteins produced on membrane-bound ribosomes— 
whether integral components of a membrane, soluble lysosomal or 
vacuolar enzymes, or parts of the extracellular matrix—become gly- 
coproteins. Carbohydrate groups have key roles in the function of 
many glycoproteins, particularly as binding sites in their interac- 
tions with other macromolecules, as occurs during many cellular 
processes. They also aid in the proper folding and stabilization of the 
protein to which they are attached. The sequences of sugars that 
comprise the oligosaccharides of glycoproteins are highly specific; if 
the oligosaccharides are isolated from a purified protein of a given 
type of cell, their sequence is consistent and predictable. How is the 
order of sugars in oligosaccharides achieved? 


3 UDP 


3 UDP—® 


The addition of sugars to an oligosaccharide chain is catalyzed 273 
by a large family of membrane-bound enzymes called glycosyl- 
transferases. Each of these enzymes transfers a specific mono- 
saccharide from a nucleotide sugar, such as GDP-mannose or 
UDP-N-acetylglucosamine (FIGURE 8.16), to the growing end of the 
carbohydrate chain. The sequence in which sugars are transferred 
during assembly of an oligosaccharide depends on the sequence of 
action of glycosyltransferases that participate in the process. This in 
turn depends on the location of specific enzymes within the various 
membranes of the secretory pathway. Thus the arrangement of sug- 
ars in the oligosaccharide chains of a glycoprotein depends on the 
spatial localization of particular enzymes in the assembly line. 

The initial steps in the assembly of N-linked oligosaccharides 
(as opposed to O-linked oligosaccharides; see Figure 4.11) of both 
soluble proteins and integral membrane proteins are shown in Figure 
8.16. The basal, or core, segment of each carbohydrate chain is not 
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FIGURE 8.16 Steps in the synthesis of the core portion of an N-linked oligosaccharide in the rough ER. The first seven sugars (five mannose and two 
NAG residues) are transferred one at a time to the dolichol-PP on the cytosolic side of the ER membrane (steps 1 and 2). At this stage, the dolichol with its 
attached oligosaccharide is then flipped across the membrane (step 3), and the remaining sugars (four mannose and three glucose residues) are attached on 
the luminal side of the membrane. These latter sugars are attached one at a time on the cytosolic side of the membrane to the end of a dolichol phosphate 
molecule (as in steps 4 and 7), which then flips across the membrane (steps 5 and 8) and donates its sugar to the growing end of the oligosaccharide chain 
(steps 6 and 9). Once the oligosaccharide is completely assembled, it is transferred enzymatically to an asparagine residue (within the sequence N-X-S/T) 

of the nascent polypeptide (step 10). The dolichol-PP is flipped back across the membrane (step 11) and is ready to begin accepting sugars again 


(steps 12 and 13). 


Source: From D. Voet and J. G. Voet, Biochemistry, 2e, Copyright 1995; John Wiley & Sons, Inc. Reprinted by permission of John Wiley & Sons, Inc. 
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pendently on a lipid carrier and then transferred, as a 
block, to specific asparagine residues of the polypep- 
tide. This lipid carrier, which is named dolichol 
phosphate, is embedded in the ER membrane. 
Sugars are added to the dolichol phosphate molecule 
one at a time by membrane-bound glycosyltrans- 
ferases, beginning with step 1 of Figure 8.16. This 
part of the glycosylation process is essentially invari- 
ant; in mammalian cells, it begins with the transfer of 
N-acetylglucosamine 1-phosphate, followed by the 
transfer of another N-acetylglucosamine, then nine 
mannose and three glucose units in the precise pat- 
tern indicated in Figure 8.16. This preassembled 
block of 14 sugars is then transferred by the ER 
enzyme oligosaccharyltransferase from dolichol 
phosphate to certain asparagines in the nascent poly- 
peptide (step 10, Figure 8.16) as the polypeptide is 
being translocated into the ER lumen. 

Mutations that lead to the total absence of N-gly- 
cosylation cause the death of embryos prior to 
implantation. However, mutations that lead to partial 
disruption of the glycosylation pathway in the ER are 
responsible for serious inherited disorders affecting 
nearly every organ system. These diseases are called 
Congenital Diseases of Glycosylation, or CDGs, and 
they are usually identified through blood tests that 
detect abnormal glycosylation of serum proteins. 
One of these diseases, CDG1b, can be managed 
through a remarkably simple treatment. CDGI1b 
results from the deficiency of the enzyme phospho- 
mannose isomerase, which catalyzes the conversion 
of fructose 6-phosphate to mannose 6-phosphate, a 
crucial reaction in the pathway that makes mannose 
available for incorporation into oligosaccharides. 
The disease can be managed by providing patients with oral supple- 
ments of mannose. The treatment was first tested in a boy who was 
dying from uncontrolled gastrointestinal bleeding, one of the usual 
complications of the disease. Within months of taking mannose sup- 
plements the child was living a normal life. 

Shortly after it is transferred to the nascent polypeptide, the oli- 
gosaccharide chain undergoes a process of gradual modification. 
This modification begins in the ER with the enzymatic removal of 
two of the three terminal glucose residues (step 1, FIGURE 8.17). This 
sets the stage for an important event in the life of a newly synthesized 
glycoprotein in which it is screened by a system of quality control 
that determines whether it is fit to move on to the next compartment 
of the biosynthetic pathway. To begin this screening process, each 
glycoprotein, which at this stage contains a single remaining glucose, 
binds to an ER chaperone (calnexin or calreticulin) (step 2). Removal 
of the remaining glucose by glucosidase II leads to release of the gly- 
coprotein from the chaperone (step 3). Ifa glycoprotein at this stage 
has not completed its folding or has misfolded, it is recognized by a 
conformation-sensing enzyme (called UGGT) that adds a single glu- 
cose residue back to one of the mannose residues at the exposed end 
of the recently trimmed oligosaccharide (step 4). UGGT recognizes 
incompletely folded or misfolded proteins because they display 
exposed hydrophobic residues that are absent from properly folded 
proteins. Once the glucose residue has been added, the “tagged” gly- 
coprotein is recognized by the same ER chaperones, which give the 
protein another chance to fold properly (step 5). After a period of 
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FIGURE 8.17 Quality control: ensuring that misfolded proteins do not proceed forward. 
Based on this proposed mechanism, misfolded proteins are recognized by a glucosyltrans- 
ferase (UGGT) which adds a glucose to the end of the oligosaccharide chains. Glycoproteins 
containing monoglucosylated oligosaccharides are recognized by the membrane-bound 
chaperone calnexin and given an opportunity to achieve their correctly folded (native) state. If 
that does not occur after repeated attempts, the protein is dislocated to the cytosol and 
destroyed. The steps are described in the text. A soluble chaperone (calreticulin) participates 
in this same quality-control pathway. 


Source: L. Ellgaard et al., Science 286:984, 1999; copyright 1999, reprinted with permission 


time with the chaperone, the added glucose residue is removed and 
the conformation-sensing enzyme checks it again to see if it has 
achieved its proper three-dimensional structure. If it is still partially 
unfolded or misfolded, another glucose residue is added and the 
process is repeated until, eventually, the glycoprotein has either 
folded correctly and continues on its way (step 6) or remains mis- 
folded and is destroyed. Studies suggest that the “decision” to destroy 
the defective protein begins with the activity ofa slow-acting enzyme 
in the ER that trims a mannose residue from an exposed end of the 
oligosaccharide of a protein that has been in the ER for an extended 
period. Once one or more of these mannose residues have been 
removed (step 7), the protein can no longer be recycled and, instead, 
is sentenced to degradation (step 8). 

We will pick up the story of protein glycosylation again on page 
278, where the oligosaccharide that is assembled in the ER is enlarged 
as it passes through the Golgi complex on its journey through the 
biosynthetic pathway. 


REVIEW | 


1. Describe how N-linked oligosaccharides are 
assembled on target proteins. 
2. Describe the quality control steps that take place 


in the RER to ensure that glycoproteins are properly 
folded. 


OO 


FIGURE 8.18 A model of the mammalian unfolded protein response 
(UPR). The ER contains transmembrane proteins that function as sensors of 
stressful events that occur within the ER lumen. Under normal conditions, 
these sensors are present in an inactive state as the result of their associa- 
tion with chaperones, particularly BiP (step 1). If the number of unfolded or 
misfolded proteins should increase to a high level, the chaperones are 
recruited to aid in protein folding, which leaves the sensors in their 
unbound, activated state and capable of initiating a UPR. At least three 
distinct UPR pathways have been identified in mammalian cells, each 
activated by a different protein sensor. Two of these pathways are depicted 
in this illustration. In one of these pathways, the release of the inhibitory BiP 
protein leads to the dimerization of a sensor (called PERK) (step 2). In its 
dimeric state, PERK becomes an activated protein kinase that phosphoryl- 
ates a protein (elF2a) that is required for the initiation of protein synthesis 
(step 3). This translation factor is inactive in the phosphorylated state, which 
stops the cell from synthesizing additional proteins in the ER (step 4), giving 
the cell more time to process those proteins already present in the ER 
lumen. In the second pathway depicted here, release of the inhibitory BiP 
protein allows the sensor (called ATF6) to move on to the Golgi complex 
where the cytosolic domain of the protein is cleaved away from its 
transmembrane domain (step 2a). The cytosolic portion of the sensor 
diffuses through the cytosol (step 3a) and into the nucleus (step 4a), where 
it stimulates the expression of genes whose encoded proteins can alleviate 
the stress in the ER (step 5a). These include chaperones, coat proteins that 
form on transport vesicles, and proteins of the quality-control machinery. 
(Discussion of the third protein sensor (IRE1) can be found in Science 
334:1081, 2011.) 


8.7 Mechanisms That Ensure the 
Destruction of Misfolded Proteins 


We have just seen how proteins that fail to fold properly are detected 
by ER enzymes. It was a surprise to discover that misfolded proteins 
are not destroyed in the ER, but instead are transported into the 
cytosol by a process of dislocation. The precise mechanism of dislo- 
cation remains a matter of debate. Once in the cytosol, the oligosac- 
charide chains are removed, and the misfolded proteins are destroyed 
in proteasomes, which are protein-degrading machines whose struc- 
ture and function are discussed in Section 12.7. This process, known 
as ER-associated degradation (ERAD), ensures that aberrant proteins 
are not transported to other parts of the cell, but it can have negative 
consequences. Over 60 human diseases, including cystic fibrosis, are 
attributed to the ERAD pathway. In most patients with cystic fibro- 
sis, the plasma membrane of epithelial cells is lacking the abnormal 
protein encoded by the cystic fibrosis gene (Section 4.15). In these 
cases, the mutant protein (which often would be functional if allowed 
sufficient time to fold) is destroyed by the quality-control process of 
the ER and thus fails to reach the cell surface. 

Under certain circumstances, misfolded proteins can be gener- 
ated in the ER at a rate faster than they can be exported to the cyto- 
plasm. The accumulation of misfolded proteins, which is potentially 
lethal to cells, triggers a comprehensive “plan of action” within the 
cell known as the unfolded protein response (UPR). The ER 
contains protein sensors that monitor the concentration of unfolded 
or misfolded proteins in the ER lumen. According to the prevailing 
model outlined in FIGURE 8.18, the sensors are normally kept in an 
inactive state by molecular chaperones, particularly BiP. If circum- 
stances should lead to an accumulation of misfolded proteins, the 
BiP molecules in the ER lumen are called into service as chaperones 
for the misfolded proteins, rendering them incapable of inhibiting 
the sensors. Activation of the sensors leads to a multitude of signals 
that are transmitted into both the nucleus and cytosol and result in 
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e The expression of hundreds of different genes whose encoded 
proteins have the potential to alleviate stressful conditions within 
the ER. These include genes that encode (1) ER-based molecular 
chaperones that can help misfolded proteins reach the native 
state, (2) proteins involved in the transport of the proteins out of 
the ER, and (3) proteins involved in the selective destruction of 
abnormal proteins as discussed above. 


e Phosphorylation of a key protein (eIFa) required for protein 
synthesis. This modification inhibits protein synthesis and 
decreases the flow of newly synthesized proteins into the ER. 
This gives the cell an opportunity to remove those proteins that 
are already present in the ER lumen. 


Interestingly, the UPR is more than a cell-survival mechanism; 
it also includes the activation of a pathway that leads to the death of 
the cell. It is presumed that the UPR provides a mechanism to relieve 
itself of the stressful conditions. If these corrective measures are 
not successful, the cell-death pathway is triggered and the cell is 
destroyed. 


REVIEW | 


1. Describe the mechanisms by which the cell ensures 
that misfolded proteins (a) will not go unrecognized 
within the ER and (b) will not accumulate to excessive 
levels within the ER lumen. 


o—_____! 
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FIGURE 8.19 Visualizing membrane traffic with the use of a fluorescent tag. This series of photographs shows a small portion of a living mammalian 
cell that has been infected with the vesicular stomatitis virus (VSV) containing a VSVG-GFP chimeric gene (Figure 8.4). Once it is synthesized in the RER, 
the fusion protein emits a green fluorescence, which can be followed as the protein moves through the cell. In the series of photographs shown here, two 
vesicular-tubular carriers (VTCs) (arrows) containing the fluorescent protein have budded from the ER and are moving toward the Golgi complex (GC). 
The series of events depicted here took place over a period of 13 seconds. Bar represents 6 ~m. 


Source: From John F. Presley et al., Nature 389:82, 1997; © 1997, reprinted by permission of Macmillan Publishers Ltd. 


8.8 ER to Golgi Vesicular 
Transport 


Membrane vesicles, with their enclosed cargo, bud from the edges of 
the ER and are targeted to the Golgi complex. The RER cisternae con- 
tain specialized exit sites that are devoid of ribosomes and serve as 
places where the first transport vesicles in the biosynthetic pathway 
are formed. The trip from the ER toward the Golgi complex can be 
followed visually in living cells by tagging secretory proteins with the 
green fluorescent protein (GFP) as described on page 260. Using this 
technique, it is found that soon after they bud from the ER membrane, 
transport vesicles fuse with one another to form larger vesicles and 
interconnected tubules in the region between the ER and Golgi com- 
plex. This region has been named the ERGIC (endoplasmic reticulum 
Golgi intermediate compartment), and the vesicular-tubular carriers 
that form there are called VT'Cs (see Figure 8.25a). Once formed, the 
VTCs move farther away from the ER toward the Golgi complex. The 
movement of two of these vesicular-tubular membranous carriers 
from the ERGIC to the Golgi complex is shown in FIGURE 8.19. 
Movement of VTCs occurs along tracks composed of microtubules. 


REVIEW |-— 


1. Describe the ERGIC region and how it can be 
experimentally visualized. 


8.9 The Golgi Complex 


In the latter years of the nineteenth century, an Italian biologist, 
Camillo Golgi, was inventing new types of staining procedures that 
might reveal the organization of nerve cells within the central nerv- 
ous system. In 1898, Golgi applied a metallic stain to nerve cells 
from the cerebellum and discovered a darkly stained reticular net- 
work located near the cell nucleus. This network, which was later 
identified in other cell types and named the Golgi complex, helped 
earn its discoverer the Nobel Prize in 1906. The Golgi complex 
remained a center of controversy for decades between those who 
believed that the organelle existed in living cells and those who 
believed it was an artifact, that is, an artificial structure formed dur- 
ing preparation for microscopy. It wasn't until the Golgi complex 
was clearly identified in unfixed, freeze-fractured cells (see Figure 
18.20) that its existence was verified beyond reasonable doubt. 

The Golgi complex has a characteristic morphology consisting 
primarily of flattened, disklike, membranous cisternae with dilated 


rims and associated vesicles and tubules (FIGURE 8.20a). The cister- 
nae, whose diameters are typically 0.5 to 1.0 um, are arranged in an 
orderly stack, much like a stack of pancakes, and are curved so as to 
resemble a shallow bowl (Figure 8.20b).* Typically, a Golgi stack 
contains fewer than eight cisternae. An individual cell may contain 
from a few to several thousand distinct stacks, depending on the cell 
type. The Golgi stacks in mammalian cells form a single, large com- 
plex (Figure 8.20c) typically situated adjacent to the cell’s nucleus 
(Figure 8.20d). A closer look at an individual cisterna suggests that 
vesicles bud from a peripheral tubular domain of each cisterna 
(Figure 8.20e). As discussed later, many of these vesicles contain a 
distinct protein coat that is visible in Figure 8.20e. 

The Golgi complex is divided into several functionally distinct 
compartments arranged along an axis from the cis or entry face clos- 
est to the ER to the trans or exit face at the opposite end of the stack 
(Figures 8.20a,b). The cis-most face of the organelle is composed of 
an interconnected network of tubules referred to as the cis Golgi 
network (CGN). The CGN is thought to function primarily as a 
sorting station that distinguishes between proteins to be shipped 
back to the ER (page 284) and those that are allowed to proceed to 
the next Golgi station. The bulk of the Golgi complex consists of a 
series of large, flattened cisternae, which are divided into cis, 
medial, and trans cisternae (Figure 8.20a). The trans-most face 
of the organelle contains a distinct network of tubules and vesicles 
called the trans Golgi network (TGN). The TGN is a sorting sta- 
tion where proteins are segregated into different types of vesicles 
heading either to the plasma membrane or to various intracellular 
destinations. The membranous elements of the Golgi complex are 
thought to be supported mechanically by a peripheral membrane 
skeleton or scaffold composed of a variety of proteins, including 
members of the spectrin, ankyrin, and actin families—proteins that 
are also present as part of the plasma membrane skeleton (page 137). 
The Golgi scaffold may be physically linked with motor proteins that 
direct the movement of vesicles and tubules entering and exiting the 
Golgi complex. A separate group of fibrous proteins are thought to 
form a Golgi “matrix,” which plays a key role in the disassembly and 
reassembly of the Golgi complex during mitosis. 

FIGURE 8.21 provides visual evidence that the Golgi complex is 
not uniform in composition from one end to the other. Differences in 
composition of the membrane compartments from the cis to the trans 
face reflect the fact that the Golgi complex is primarily a “processing 
plant.” Newly synthesized membrane proteins, as well as secretory and 
lysosomal proteins, leave the ER and enter the Golgi complex at its cis 
face and then pass across the stack to the trans face. As they progress 


*A single Golgi stack in plant cells is sometimes called a dictyosome. 
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FIGURE 8.20 The Golgi complex. (a) Schematic model of a portion of a Golgi complex from an epithelial cell of the male rat reproductive tract. The 
elements of the cis and trans compartments are often discontinuous and appear as tubular networks. (b) Electron micrograph of a portion of a tobacco root 
cap cell showing the cis to trans polarity of the Golgi stack. (c) Electron tomographic image of a slice from a mouse pancreatic beta cell that synthesizes and 
secretes the protein insulin. The individual Golgi stacks are seen to be interconnected to form a continuous ribbon. The trans face (or TGN) of each Golgi 
stack has been colored red and the cis face has been colored light blue. (d) Fluorescence micrograph of a cultured mammalian cell. The position of the 
Golgi complex is revealed by the red fluorescence, which marks the localization of antibodies to a COP! coat protein. (e) Electron micrograph of a single 
isolated Golgi cisterna showing two distinct domains, a concave central domain and an irregular peripheral domain. The peripheral domain consists of a 
tubular network from which protein-coated buds are being pinched off. 

Source: (a) From A. Rambourg and Y. Clermont, copyright 1990, Eur J Cell Biol. Originally published in The Journal of Cell Biology, Volume 51:195; (b) 
Courtesy of Thomas H. Giddings and Andrew Staehelin; (c) Reproduced Courtesy of Brad Marsh, Institute for Molecular Bioscience, The University of 
Queensland, Australia. Reproduced with permission from Rockefeller University Press. Image originally published in J. Cell Biol. 187:449, 2009; (d) From 
Andrei V. Nikonov et al., J. Cell Biol. 158:500, 2002; Courtesy of Gert Kreibich, reproduced with permission of The Rockefeller University Press. 

(e) From Peggy J. Weidman and John Heuser, Trends Cell Biol. 5:303, 1995; © 1995, with permission from Elsevier Science. 
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_ FIGURE 8.21 Regional differences in membrane composition across the Golgi stack. (a) Reduced osmium tetroxide preferentially impregnates the cis 


along the stack, proteins that were originally synthesized in the rough 
ER are sequentially modified in specific ways. In the best-studied 
Golgi activity, a protein’s carbohydrates are modified by a series of 
stepwise enzymatic reactions, as discussed in the following section. 


Glycosylation in the Golgi Complex 


The Golgi complex plays a key role in the assembly of the carbohydrate 
component of glycoproteins and glycolipids. When we left the topic of 
synthesis of N-linked carbohydrate chains page 274, the glucose resi- 
dues had just been removed from the ends of the core oligosaccharide. 
As newly synthesized soluble and membrane glycoproteins pass 
through the cis and medial cisternae of the Golgi stack, most of the 
mannose residues are also removed from the core oligosaccharides, and 
other sugars are added sequentially by various glycosyltransferases. 

In the Golgi complex, as in the RER, the sequence in which sug- 
ars are incorporated into oligosaccharides is determined by the spa- 
tial arrangement of the specific glycosyltransferases that come into 
contact with the newly synthesized protein as it moves through the 
Golgi stack. The enzyme sialyltransferase, for example, which places 
a sialic acid at the terminal position of the chain in animal cells, is 
localized in the trans face of the Golgi stack, as would be expected if 
newly synthesized glycoproteins were continually moving toward this 
part of the organelle. In contrast to the glycosylation events that occur 
in the ER, which assemble a single core oligosaccharide, the glyco- 
sylation steps in the Golgi complex can be quite varied, producing 
carbohydrate domains of remarkable sequence diversity. One of many 
possible glycosylation pathways is shown in FIGURE 8.22. Unlike the 
N-linked oligosaccharides, whose synthesis begins in the ER, those 


cisternae of the Golgi complex. (b) The enzyme mannosidase II, which is involved in trimming the mannose residues from the core oligosaccharide as 
described in the text, is preferentially localized in the medial cisternae. (c) The enzyme nucleoside diphosphatase, which splits dinucleotides (e.g., UDP) 
after they have donated their sugar, is preferentially localized in the trans cisternae. 

Source: (a) From Robert S. Decker, J. Cell Biol. 61:603, 1974; (b) From Angel Velasco et al., J. Cell Biol. 122:41, 1993. All reproduced with permission of The 
Rockefeller University Press. (c) From Robert S. Decker, J. Cell Biol. 61:603, 1974. 


attached to proteins by O-linkages (Figure 4.11) are assembled 
entirely within the Golgi complex. 

The Golgi complex is also the site of synthesis of most of a cell’s 
complex polysaccharides, including the glycosaminoglycan chains 
of the proteoglycan shown in Figure 7.9a and the pectins and hemi- 
cellulose found in the cell walls of plants (see Figure 7.34c). 


The Movement of Materials through 
the Golgi Complex 


That materials move through the various compartments of the Golgi 
complex has long been established; however, two contrasting views 
of the way this occurs have dominated the field for years. Up until 
the mid-1980s, it was generally accepted that Golgi cisternae were 
transient structures. It was supposed that Golgi cisternae formed at 
the cis face of the stack by fusion of membranous carriers from the 
ER and ERGIC and that each cisterna physically moved from the cis 
to the trans end of the stack, changing in composition as it pro- 
gressed. This is known as the cisternal maturation model because, 
according to the model, each cisterna “matures” into the next cis- 
terna along the stack. 

From the mid-1980s to the mid- 1990s, the maturation model of 
Golgi movement was largely abandoned and replaced by an alternate 
model, which proposed that the cisternae of a Golgi stack remain in 
place as stable compartments. In this latter model, which is known 
as the vesicular transport model, cargo (i.e., secretory, lysosomal, and 
membrane proteins) is shuttled through the Golgi stack, from the 
CGN to the TGN, in vesicles that bud from one membrane compart- 
ment and fuse with a neighboring compartment farther along the 
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FIGURE 8.22 Steps in the glycosylation of a typical mammalian N-linked oligosaccharide in the Golgi complex. Following the removal of the three 
glucose residues, various mannose residues are subsequently removed, while a variety of sugars (N-acetylglucosamine, galactose, fucose, and sialic acid) are 


added to the oligosaccharide by specific glycosyltransferases. These enzymes are integral membrane proteins whose active sites face the lumen of the Golgi 


cisternae. This is only one of numerous glycosylation pathways. 


stack. The vesicular transport model is illustrated in FIGURE 8.23a, 
and its acceptance was based largely on the following observations: 


1. Each of the various Golgi cisternae of a stack has a distinct popu- 
lation of resident enzymes (Figure 8.21). How could the various 
cisternae have such different properties if each cisterna was giv- 
ing rise to the next one in line, as suggested by the cisternal matu- 
ration model? 


2. Large numbers of vesicles can be seen in electron micrographs to 
bud from the rims of Golgi cisternae. In 1983, James Rothman and 
his colleagues at Stanford University demonstrated, using cell-free 
preparations of Golgi membranes (page 263), that transport vesi- 
cles were capable of budding from one Golgi cisterna and fusing 
with another Golgi cisterna in vitro. This landmark experiment 
formed the basis for a hypothesis suggesting that inside the cell, 
cargo-bearing vesicles budded from cis cisternae and fused with 
cisternae situated at a more trans position in the stack. 


Although both models of Golgi function continue to have their 
proponents, the consensus of opinion has shifted back to the cister- 
nal maturation model. Several of the major reasons for this shift can 
be noted: 


e The cisternal maturation model envisions a highly dynamic 
Golgi complex in which the major elements of the organelle, the 
cisternae, are continually being formed at the cis face and 
dispersed at the trans face. According to this view, the very 
existence of the Golgi complex itself depends on the continual 
influx of transport carriers from the ER and ERGIC. As pre- 
dicted by the cisternal maturation model, when the formation of 
transport carriers from the ER is blocked either by treatment of 
cells with specific drugs or the use of temperature-sensitive 
mutants (Figure 8.4), the Golgi complex simply disappears. 
When the drugs are removed or the mutant cells are returned to 
the permissive temperature, the Golgi complex rapidly reassem- 
bles as ER-to-Golgi transport is renewed. 


e Certain materials that are produced in the endoplasmic reticu- 
lum and travel through the Golgi complex can be shown to 


remain within the Golgi cisternae and never appear within 
Golgi-associated transport vesicles. For example, studies on 
fibroblasts indicate that large complexes of procollagen 
molecules (the precursors of extracellular collagen) (illustrated 
by the red objects in Figure 8.23b) move from the cis cisternae 
to the trans cisternae without ever leaving the cisternal 

lumen. 


e It was assumed until the mid-1990s that transport vesicles always 
moved in a “forward” (anterograde) direction, that is, from a cis 
origin to a more trans destination. But a large body of evidence 
has indicated that vesicles can move in a “backward” (retrograde) 
direction, that is, from a trans donor membrane to a cis 
acceptor membrane. 


e Studies on live budding yeast cells containing fluorescently 
labeled Golgi proteins have shown directly that the composition 
of an individual Golgi cisterna can change over time—from one 
that contains early (cis) Golgi resident proteins to one that 
contains late (trans) Golgi resident proteins. The results of this 
experiment are shown in Figure 18.10, and they are not compat- 
ible with the vesicular transport model. Whether these results on 
yeast can be extrapolated to a mammalian Golgi complex, which 
has a more complex, stacked structure, remains to be 
determined. 


A current version of the cisternal maturation model is depicted 
in Figure 8.23b. Unlike the original versions of the cisternal matura- 
tion model, the version shown in Figure 8.23b acknowledges a role 
for transport vesicles, which have been clearly shown to bud from 
Golgi membranes. In this model, however, these transport vesicles 
do not shuttle cargo in an anterograde direction, but instead carry 
resident Golgi enzymes in a retrograde direction. Instead, it is the 
Golgi cisternae themselves that serve as the primary anterograde 
Golgi carriers. This model of intra-Golgi transport is supported by 
electron micrographs of the type illustrated in Figure 8.23c,d. These 
micrographs depict ultrathin sections of cultured mammalian cells 
that were cut from a frozen block. In both cases, the frozen sections 
were treated with antibodies that were linked to gold particles prior 
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FIGURE 8.23 The dynamics of transport through the Golgi complex. (a) In the vesicular transport model, cargo (black dots) is carried in an anterograde 
direction by transport vesicles, while the cisternae themselves remain as stable elements. (b) In the cisternal maturation model, the cisternae progress 
gradually from a cis to a trans position and then disperse at the TGN. Transport vesicles carry resident Golgi enzymes (indicated by the colored vesicles) in a 
retrograde direction. The red lens-shaped objects represent large cargo materials, such as procollagen complexes of fibroblasts. (c) Electron micrograph of 
an area of Golgi complex in a thin frozen section of a cell that had been infected with vesicular stomatitis virus (VSV). The black dots are nanosized gold 
particles bound by means of antibodies to VSVG protein, an anterograde cargo molecule. The cargo is restricted to the cisternae and does not appear in 
nearby vesicles (arrows). (d) Electron micrograph of similar nature to that of c but, in this case, the gold particles are not bound to cargo, but to mannosidase 
Il, a resident enzyme of the medial Golgi cisternae. The enzyme appears in both a vesicle (arrow) and cisternae. The labeled vesicle is presumably carrying 
the enzyme in a retrograde direction, which compensates for the anterograde movement of the enzyme as the result of cisternal maturation. Bar, 0.2m. 


(A third model for intra-Golgi transport is discussed in Cell 133:951, 2008.) 


Source: (a) Vesicular transport model; (b) Cisternal maturation model (c) From Alexander A. Mironov et al., courtesy of Alberto Luini, J. Cell Biol. 155:1234, 
2001 reproduced with permission of The Rockefeller University Press; (d) From Jose A. Martinez-menarguez et al., courtesy of Judith Klumperman, J. Cell 
Biol. 155:1214, 2001 reproduced with permission of The Rockefeller University Press. 


to examination in the electron microscope. Figure 8.23c shows a 
section through a Golgi complex after treatment with gold-labeled 
antibodies that bind to a cargo protein, in this case the viral protein 
VSVG (Figure 8.4). The VSVG molecules are present within the cis- 
ternae, but are absent from the nearby vesicles (arrows), indicating 
that cargo is carried in an anterograde direction within maturing 
cisternae but not within small transport vesicles. Figure 8.23d shows 
a section through a Golgi complex after treatment with gold-labeled 
antibodies that bind to a Golgi resident protein, in this case the pro- 
cessing enzyme mannosidase II. Unlike the VSVG cargo protein, 
mannosidase II molecules are found in both the cisternae and asso- 
ciated vesicles (arrow), which strongly supports the proposal that 
these vesicles are utilized to carry Golgi resident enzymes in a retro- 
grade direction. Golgi resident proteins may also move in a retro- 
grade direction through tubules that have been seen to connect 
different Golgi cisternae. 

The cisternal maturation model depicted in Figure 8.23b explains 
how different Golgi cisternae in a stack can have a unique identity. 
An enzyme such as mannosidase II, for example, which removes 
mannose residues from oligosaccharides and is largely restricted to 
the medial cisternae (Figure 8.21), can be recycled backward in trans- 
port vesicles as each cisterna moves toward the trans end of the stack. 
It should be noted that a number of prominent researchers continue 
to argue, based on other experimental results, that cargo can be car- 
ried by transport vesicles between Golgi cisternae in an anterograde 
direction. Thus, the matter remains to be settled. 


REVIEW | 


1. Describe the steps that occur as a soluble secretory 
protein, such as a digestive enzyme in a pancreas 
cell, moves from the RER to the cis Golgi cisterna; 
from a cis cisterna to the TGN. 


2. What is the role of dolichol phosphate in the 
synthesis of membrane glycoproteins? How is the 
sequence of sugars attached to the protein 
determined? 


3. How does the process of glycosylation in the Golgi 
complex compare to that in the RER? 

4. How can the cisternal maturation model of Golgi 
activity be reconciled with the presence of transport 
vesicles in the Golgi region? 


————E 


8.10 Types of Vesicle Transport 


The biosynthetic pathway of a eukaryotic cell consists of a series of 
distinct membrane-bound organelles that function in the synthesis, 
modification, and delivery of soluble and membrane proteins to their 
appropriate destination in the cell. As was illustrated in Figure 8.2a, 
materials are carried between compartments by vesicles (or other 
types of membrane-bound carriers) that bud from donor membranes 


and fuse with acceptor membranes. If one scrutinizes electron micro- 
graphs for vesicles caught in the act of budding, one finds that most 
of these membranous buds are covered on their cytosolic surface by 
a “fuzzy,” electron-dense layer. Further analysis reveals that the dark- 
staining layer consists of a protein coat formed from soluble proteins 
that assemble on the cytosolic surface of the donor membrane at sites 
where budding takes place. Each coated bud pinches off to form a 
coated vesicle such as those shown in FIGURE 8.24. Vesicles of 
similar size and structure can be formed in cell-free systems as illus- 
trated in Figure 8.6. (The discovery of coated vesicles is discussed in 
the Experimental Pathways in Section 8.18.) 

Protein coats have at least two distinct functions: (1) They act as 
a mechanical device that causes the membrane to curve and form a 
budding vesicle, and (2) they provide a mechanism for selecting the 
components to be carried by the vesicle. Selected components 
include (a) cargo consisting of secretory, lysosomal, and membrane 
proteins to be transported and (b) the machinery required to target and 
dock the vesicle to the correct acceptor membrane (Section 8.13). In 
the best understood cases (see Figures 8.26 and 8.40), the vesicle coat 
is composed of two distinct protein layers: an outer cage or scaffold- 
ing that forms the framework for the coat and an inner layer of 


Membranes 


(b) 20 nm 


FIGURE 8.24 Coated vesicles. These electron micrographs show the 
membranes of these vesicles to be covered on their outer (cytosolic) 
surface by a distinct protein coat. The micrograph on the top (a) shows 
aCOPIlI-coated vesicle, whereas the micrograph on the bottom (b) shows a 
COPI-coated vesicle. 


Source: (a-b) Courtesy of Randy Schekman and Lelio Orci. 
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the membrane’s cargo. As discussed below, the adaptors are able to 
select specific cargo molecules by virtue of their specific affinity for 
the cytosolic “tails” of integral proteins that reside in the donor mem- 
brane (see FIGURE 8.25b). 

Several distinct classes of coated vesicles have been identified; 
they are distinguished by the proteins that make up their coat, their 
appearance in the electron microscope, and their role in cell traffick- 
ing. The three best studied coated vesicles are the following: 


1. COPII-coated vesicles (Figure 8.24a) move materials from the 
ER “forward” to the ERGIC and Golgi complex. (Recall from 
Section 8.8 that the ERGIC is the intermediate compartment sit- 
uated between the ER and Golgi complex.) (COP is an acronym 
for coat proteins.) 


2. COPlI-coated vesicles (Figure 8.24b) move materials in a 
retrograde direction (1) from the ERGIC and Golgi stack “back- 
ward” toward the ER and (2) from trans Golgi cisternae “backward” 
to cis Golgi cisternae (see Figure 8.25a). Additional roles for 
COPI vesicles have been debated. 


3. Clathrin-coated vesicles move materials from the TGN to 
endosomes, lysosomes, and plant vacuoles. They also move 
materials from the plasma membrane to cytoplasmic compart- 
ments along the endocytic pathway. They have also been impli- 
cated in trafficking from endosomes and lysosomes. 


We will consider each type of coated vesicle in the following sections. 
A summary of the various transport steps along the biosynthetic or 
secretory pathway that is mediated by each of these coated vesicles is 
indicated in Figure 8.25a, and a comparison of their structures is 
shown in Figure 8.27). 


COPII-Coated Vesicles: Transporting 
Cargo from the ER to the Golgi Complex 


COPII-coated vesicles mediate the first leg of the journey through the 
biosynthetic pathway—from the ER to the ERGIC and Golgi complex 
(Figure 8.25). The COPII coat contains a number of proteins that 
were first identified in mutant yeast cells that were unable to carry out 
transport from the ER to the Golgi complex. Homologues of the yeast 
proteins were subsequently found in the coats of vesicles budding 
from the ER in mammalian cells. Antibodies to COPII-coat proteins 
block budding of vesicles from ER membranes but have no effect on 
movement of cargo at other stages in the secretory pathway. 

COPII coats select and concentrate certain components for 
transport in vesicles. Certain integral membrane proteins of the ER 
are selectively captured because they contain “ER export” signals as 
part of their cytosolic tail. These signals interact specifically with 
COPII proteins of the vesicle coat (FIGURE 8.26). Proteins selected 
by COPII-coated vesicles include (1) enzymes that act at later stages 
in the biosynthetic pathway, such as the glycosyltransferases of the 
Golgi complex (indicated as orange membrane proteins in Figure 
8.26), (2) membrane proteins involved in the docking and fusion of 
the vesicle with the target compartment, and (3) membrane proteins 
that are able to bind soluble cargo (such as the secretory proteins, 
indicated by the red spheres in Figure 8.26). Cells lacking a specific 
cargo receptor typically fail to transport a specific subset of proteins 
from the ER to the Golgi complex. For example, mutations in one 
cargo receptor (ERGIC-53) have been linked to an inherited bleeding 
disorder. Persons with this disorder fail to secrete certain coagula- 
tion factors that promote blood clotting. 
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FIGURE 8.25 Proposed movement of materials by vesicular 
transport between membranous compartments of the biosynthetic/ 
secretory pathway. (a) The three different types of coated vesicles 
indicated in this schematic drawing are thought to have distinct transport 
roles. COPII-coated vesicles mediate transport from the ER to the ERGIC 
and Golgi complex. COPI-coated vesicles return proteins from the 
ERGIC and Golgi complex to the ER. COPI-coated vesicles also transport 
Golgi enzymes between cisternae in a retrograde direction. Clathrin- 
coated vesicles mediate transport from the TGN to endosomes and 
lysosomes. Transport of materials along the endocytic pathway is not 
shown in this drawing. (b) Schematic drawing of the assembly of a 
COPII-coated vesicle at the ER. Assembly begins when Sar1 is recruited 
to the ER membrane and activated by exchange of its bound GDP with a 
bound GTP. These steps are shown in Figure 8.26. Cargo proteins of the 
ER lumen (red spheres and diamonds) bind to the luminal ends of 
transmembrane cargo receptors. These receptors are then concentrated 
within the coated vesicle through interaction of their cytosolic tails with 
components of the COPII coat. ER resident proteins (e.g., BiP) are 
generally excluded from the coated vesicles. Those that do happen to 
become included in a coated vesicle are returned to the ER as described 
later in the text. 
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FIGURE 8.26 Proposed roles of the COPII coat proteins in generating membrane curvature, assembling the protein coat, and capturing cargo. 

In step 1, Sar1-GDP molecules have been recruited to the ER membrane by a protein called a GEF (guanine-exchange factor) that catalyzes the exchange of 
the bound GDP with a bound GTP. In step 2, each Sar1-GTP molecule has extended a fingerlike a helix along the membrane within the cytosolic leaflet. This 
event expands the leaflet and induces the curvature of the lipid bilayer at that site. In step 3, a dimer composed of two COPII polypeptides (Sec23 and 
Sec24) has been recruited by the bound Sar1-GTP. The Sec23-Sec24 heterodimer is thought to further induce the curvature of the membrane in the 
formation of a vesicle. Both Sar1 and Sec23-Sec24 can bring about membrane curvature when incubated with synthetic liposomes in vitro. Transmembrane 
cargo accumulates within the forming COPII vesicle as their cytosolic tails bind to the Sec24 polypeptide of the COPII coat. Sec24 can exist in at least four 
different isoforms. It is likely that different isoforms of this protein recognize and bind membrane proteins with different sorting signals, thus broadening the 
specificity in types of materials that can be transported by COPII vesicles. In step 4, the remaining COPII polypeptides (Sec13 and Sec31) have joined the 
complex to form an outer structural scaffold of the coat. 


Source: Reprinted from Stephan Fath et al., Courtesy or Jonathan Goldberg, Cell 129:1333, 2007, with permission from Elsevier. 


Among the COPII coat proteins is a small G protein called Sar1, 
which is recruited specifically to the ER membrane. Like other 
G proteins, Sar1 plays a regulatory role, in this case by initiating vesi- 
cle formation and by regulating the assembly of the vesicle coat. 
These activities are illustrated in Figure 8.26. In step 1 of Figure 8.26, 
Sarl is recruited to the ER membrane in the GDP-bound form and 
is induced to exchange its GDP for a molecule of GTP. Upon binding 
of GTP, Sarl undergoes a conformational change that causes its 
N-terminal © helix to insert itself into the cytosolic leaflet of the ER 
bilayer (step 2). This event has been demonstrated to bend the lipid 
bilayer, which is an important step in the conversion of a flattened 
membrane into a spherical vesicle. Membrane bending is probably 
aided by a change in packing of the lipids that make up the two 
leaflets of the bilayer. In step 3, Sarl-GTP has recruited two additional 
polypeptides of the COPI coat, Sec23 and Sec24, which bind as a 
“banana-shaped” dimer. Because of its curved shape (Figure 8.27), 
the Sec23-Sec24 dimer provides additional pressure on the mem- 
brane surface to help it further bend into a curved bud. Sec24 also 
functions as the primary adaptor protein of the COPII coat that 
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of membrane proteins that are destined to traffic on to the Golgi 
complex. In Figure 8.26, step 4, the remaining subunits of the COPII 
coat, Secl3 and Sec31, bind to the membrane to form the outer 
structural cage of the protein coat. FIGURE 8.27a shows a representa- 
tion of a 40-nm vesicle with the COPII coat bound to its surface. The 
Sec13-Sec31 cage assembles into a relatively simple lattice in which 
each vertex is formed by the convergence of four Sec13-Sec31 legs 
(Figure 8.27). A certain degree of flexibility is built into the hinge 
between the Sec13-Sec31 subunits that allow them to form cages of 
varying diameter, thus accommodating vesicles of varying size. 

Once the entire COPI coat has assembled, the bud is separated 
from the ER membrane in the form of a COPII-coated vesicle. Before 
the coated vesicle can fuse with a target membrane, the protein coat 
must be disassembled and its components released into the cytosol. 
Disassembly is triggered by hydrolysis of the bound GTP to produce 
a Sarl-GDP subunit, which has decreased affinity for the vesicle 
membrane. Dissociation of Sarl-GDP from the membrane is fol- 
lowed by the release of the other COPIH subunits. 
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FIGURE 8.27 Structure of vesicle coats. (a) Molecular model of the outer Sec13-Sec31 cage of the COPII coat as it would assemble around the surface of 
a 40-nm “vesicle.” Each edge or leg of the lattice that makes up the cage consists of a heterotetramer (two Sec31 subunits seen as dark green and light 
green and two Sec13 subunits seen as orange and red). Four such legs converge to form each vertex of the lattice. Two copies of the Sar1-Sec23-Sec24 
complex (shown in red, magenta, and blue, respectively) that would form the inner layer of the COPII coat are also shown in this model. It can be seen how 
the inner surface of the Sec23-Sec24 complex conforms to the curvature of the vesicle. Inset shows a COPII lattice, which is comprised of triangular and 
square, pentagonal, and/or hexagonal faces. (b) Schematic drawing of the structures of clathrin, COPII and COPI coats. 


Source: (a) From Stephan Fath et al., Courtesy of Jonathan Goldberg, Cell 129:1333, 2007, by permission of Elsevier. (b) From S. Harrison and T. Kirchhausen, 


Nature 466:1048-1049, 2010, Reprinted by permission from Macmillan Publishers Ltd. 
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Escaped Proteins Back to the ER 


COPI-coated vesicles were first identified in experiments in which 
cells were treated with molecules similar in structure to GTP (GTP 
analogues) but, unlike GTP, they could not be hydrolyzed. Under 
these conditions, COPI-coated vesicles accumulated within the cell 
and could be isolated from homogenized cells by density gradient 
centrifugation (Section 18.6). COPI-coated vesicles accumulate in 
the presence of a nonhydrolyzable GTP analogue because, similar to 
their COPII counterparts, the coat contains a small membrane- 
bending GTP-binding protein, called Arfl, whose bound GTP must 
be hydrolyzed before the coat can disassemble. 

The COPI coat is made up of a complex, called coatamer, made 
up of seven proteins. Structural studies have suggested that, like 
clathrin (which is discussed in more detail in Section 8.17), the outer 
portion of COPI lattices may be built from three-legged triskelion 
subunits (Figure 8.27b). High-resolution structures of the COPI 
coat, however, have not yet been obtained. 

COPI-coated vesicles have been most clearly implicated in the 
retrograde transport of proteins, including the movement of (1) Golgi 
resident enzymes in a trans-to-cis direction (as indicated in Figure 8.23d, 
which shows a gold-labeled mannosidase I] molecule in a COPI 
vesicle) and (2) ER resident enzymes from the ERGIC and the Golgi 
complex back to the ER (Figure 8.25). To understand the role of 
COPI-coated vesicles in retrograde transport, we have to consider 
the retention and retrieval of resident ER proteins. 

If vesicles continually bud from membrane compartments, how 
does each compartment retain its unique composition? What deter- 
mines, for example, whether a particular protein in the membrane of 
the ER remains in the ER or proceeds on to the Golgi complex? 
Studies suggest that proteins are maintained in an organelle by a 
combination of two mechanisms: 


1. Retention of resident molecules that are excluded from transport 
vesicles. Retention may be based primarily on the physical prop- 
erties of the protein. For example, soluble proteins that are part of 
large complexes or membrane proteins with short transmem- 
brane domains are not likely to enter a transport vesicle. 


2. Retrieval of “escaped” molecules back to the compartment in which 
they normally reside. 


Proteins that normally reside in the ER, both in the lumen and 
in the membrane, contain short amino acid sequences at their C- 
terminus that serve as retrieval signals, ensuring their return to the 
ER if they should be accidentally carried forward to the ERGIC or 
Golgi complex. The retrieval of “escaped” ER proteins from these 
compartments is accomplished by specific receptors that capture 
the molecules and return them to the ER in COPI-coated vesicles 
(Figures 8.25, 8.28). Soluble resident proteins of the ER lumen (such 
as protein disulfide isomerase and the molecular chaperones that 
facilitate folding) typically possess the retrieval signal “lys-asp-glu- 
leu” (or KDEL in single-letter nomenclature). As shown in 
FIGURE 8.28, these proteins are recognized and returned to the ER 
by the KDEL receptor, an integral membrane protein that shuttles 
between the cis Golgi and the ER compartments. If the KDEL 
sequence is deleted from an ER protein, the escaped proteins are not 
returned to the ER but instead are carried forward through the 
Golgi complex. Conversely, when a cell is genetically engineered to 
express a lysosomal or secretory protein that contains an added 
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FIGURE 8.28 Retrieving ER proteins. Resident proteins of the ER 
contain amino acid sequences that lead to their retrieval from the Golgi 
complex if they are accidentally incorporated into a Golgi-bound transport 
vesicle. Soluble ER proteins bear the retrieval signal KDEL. Retrieval is 
accomplished as soluble ER proteins bind to KDEL receptors residing in the 
membranous wall of cis Golgi compartments. The KDEL receptors, in turn, 
bind to proteins of the COP! coat, which allows the entire complex to be 
recycled back to the ER. 


KDEL C-terminus, that protein is returned to the ER rather than 
being sent on to its proper destination. Membrane proteins that 
reside in the ER also have a retrieval signal that binds to the COPI 
coat, facilitating their return to the ER. The most common retrieval 
sequences for ER membrane proteins involve two closely linked 
basic residues, most commonly KKXX (where K is lysine and X is 
any residue). Each membrane compartment in the biosynthetic 
pathway may have its own retrieval signals, which helps explain 
how each compartment can maintain its unique complement of 
proteins despite the constant movement of vesicles in and out of that 
compartment. 


REVIEW | 


1. Contrast the roles of COPI- and COPIl-coated 
vesicles in protein trafficking. 


2. How do retrieval signals ensure that proteins are kept 


as residents of a particular membrane compartment? 
o—_—— 


8.11 Beyond the Golgi Complex: 
Sorting Proteins at the TGN 


Despite all the discussion of transport vesicles, we have yet to examine 
how a particular protein that has been synthesized in the ER is targeted 
toward a particular cellular destination. It is important that a cell be 
able to distinguish among the various proteins that it manufactures. 
A pancreatic cell, for example, has to segregate newly synthesized 
digestive enzymes that will be secreted into a duct, from newly synthe- 
sized cell-adhesion molecules that will ultimately reside in the plasma 
membrane, from lysosomal enzymes that are destined for lysosomes. 
This is accomplished as the cell sorts proteins destined for different 
sites into different membrane-bound carriers. The trans Golgi net- 
work (TGN), which is the last stop in the Golgi complex, functions as 
a major sorting station, directing proteins to various destinations. The 
best understood of the post-Golgi pathways is one that carries lysoso- 
mal enzymes. 


Sorting and Transport 
of Lysosomal Enzymes 


Lysosomal proteins are synthesized on membrane-bound ribosomes 
of the ER and carried to the Golgi complex along with other types 
of proteins. Once in the Golgi cisternae, soluble lysosomal enzymes 
are specifically recognized by enzymes that catalyze the two-step 
addition of a phosphate group to certain mannose sugars of the N- 
linked carbohydrate chains (FIGURE 8.29a). Thus, unlike other 
glycoproteins sorted at the TGN, lysosomal enzymes possess 
phosphorylated mannose residues, which act as sorting signals. This 
mechanism of protein sorting was discovered through studies on 
cells from humans that lacked one of the enzymes involved in phos- 
phate addition (discussed in the Human Perspective, Section 8.12). 
Lysosomal enzymes carrying a mannose 6-phosphate signal are rec- 
ognized and captured by mannose 6-phosphate receptors (MPRs), 
which are integral membrane proteins that span the TGN mem- 
branes (Figure 8.290). 


FIGURE 8.29 Targeting lysosomal enzymes to lysosomes. (a) Lysosomal 
enzymes are recognized by an enzyme in the cis cisternae that transfers a 
phosphorylated N-acetylglucosamine from a nucleotide sugar donor to one 
or more mannose residues of N-linked oligosaccharides. The glucosamine 
moiety is then removed in a second step by a second enzyme, leaving 
mannose 6-phosphate residues as part of the oligosaccharide chain. (b) 
Schematic diagram showing the pathways followed by a lysosomal enzyme 
(black) from its site of synthesis in the ER to its delivery to a lysosome. The 
mannose residues of the lysosomal enzyme are phosphorylated in the Golgi 
cisternae (step 1) and then selectively incorporated into a clathrin-coated 
vesicle at the TGN (step 2). The mannose 6-phosphate receptors are 
thought to have a dual role (step 3): They interact specifically with the 
lysosomal enzymes on the luminal side of the vesicle, and they interact 
specifically with adaptors on the cytosolic surface of the vesicle (shown in 
Figure 8.30). The mannose 6-phosphate receptors separate from the 
enzymes (step 4) and are returned to the Golgi complex (step 5). The 
lysosomal enzymes are delivered to an endosome (step 6) and eventually to 
a lysosome. Mannose 6-phosphate receptors are also present in the plasma 
membrane, where they capture lysosomal enzymes that are secreted into 
the extracellular space and return the enzymes to a pathway that directs 
them to a lysosome (step 7). 
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coated vesicles (which are the third and last type of coated vesicles to 
be discussed). The structure of clathrin-coated vesicles is described 
in detail in Figure 8.40 in connection with endocytosis, a process that 
is better understood than budding at the TGN. It will suffice at this 
point to note that the coats of these vesicles contain (1) an outer lat- 
tice composed of the protein clathrin, which forms a structural scaf- 
fold, and (2) an inner shell composed of protein adaptors, which 
covers the surface of the vesicle membrane that faces the cytosol 
(FIGURE 8.30). The term adaptor describes a molecule that physically 
links two or more components. Lysosomal enzymes are escorted 
from the TGN by a family of adaptor proteins called GGAs. 
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FIGURE 8.30 The formation of clathrin-coated vesicles at the TGN. 
Clathrin-coated vesicles that bud from the TGN contain GGA, an adaptor 
protein consisting of several distinct domains. One of the GGA domains 
binds to the cytosolic domains of membrane proteins, including those that 
will ultimately reside in the boundary membrane of the lysosome and also 
the MPR that transports lysosomal enzymes. Other GGA domains bind to 
Arf1 and to the surrounding cytosolic network of clathrin molecules. 


As indicated in the inset of Figure 8.30, a GGA molecule has sev- 
eral domains, each capable of grasping a different protein involved in 
vesicle formation. The outer ends of the GGA adaptors bind to clath- 
rin molecules, holding the clathrin scaffolding onto the surface of the 
vesicle. On their inner surface, the GGA adaptors bind to a sorting 
signal in the cytosolic tails of the mannose 6-phosphate receptors. The 
MPRs, in turn, bind to soluble lysosomal enzymes within the vesicle 
lumen (Figure 8.30, inset). As a result of these interactions with GGA 
adaptors, the MPRs in the TGN membrane and lysosomal enzymes 
within the TGN lumen become concentrated into clathrin-coated 
vesicles. As in the formation of COPI and COPII vesicles, production 
of clathrin-coated vesicles at the TGN begins with the recruitment to 
the membrane of a small GTP-binding protein, in this case Arfl, 
which sets the stage for the binding of the other coat proteins. Like 
Sarl (Figure 8.26), Arfl extends a membrane-bending © helix that 
acts in conjunction with adaptor proteins to bind clathrin and initiate 
the formation of the budding vesicle. After the vesicle has budded 
from the TGN, the clathrin coat is lost and the uncoated vesicle pro- 
ceeds to its destination, which may be an early endosome, late endo- 
some, or plant vacuole. Before they reach one of these organelles, the 
MPRs dissociate from the lysosomal enzymes and return to the TGN 
(Figure 8.29) for another round of lysosomal enzyme transport. 


Sorting and Transport of 
Nonlysosomal Proteins 


Lysosomal proteins are not the only materials that are exported from 
the TGN. As indicated in Figure 8.2, membrane proteins destined 
for the plasma membrane and secretory materials destined for 
export from the cell are also transported from the TGN, but the 
mechanisms are poorly understood. According to one model, mem- 
branous carriers are produced as the TGN fragments into vesicles 
and tubules of various size and shape. This concept fits with the cis- 
ternal maturation model, which proposes that the cisternae of the 
Golgi complex move continually toward the TGN, where they would 
have to disperse to allow continued maturation of the Golgi stack. 
Proteins that are discharged from the cell by a process of regulated 
secretion, such as digestive enzymes and hormones, are thought to 
form selective aggregates that eventually become contained in large, 
densely packed secretory granules. These aggregates are apparently 
trapped as immature secretory granules bud from the rims of the 
trans Golgi cisternae and TGN. In some cells, long tubules are seen 
to be pulled out of the TGN by motor proteins that operate along 
microtubular tracks. These tubules are then split into a number of 
vesicles or granules by membrane fission. Once they have departed 
from the TGN, the contents of the secretory granules become more 
concentrated. Eventually, the mature granules are stored in the cyto- 
plasm until their contents are released following stimulation of the 
cell by a hormone or nerve impulse. 

The targeted delivery of integral proteins to the plasma mem- 
brane appears to be based largely on sorting signals in the cytoplas- 
mic domains of the membrane proteins. Considerable research has 
focused on polarized cells, such as those depicted in Figure 8.11. In 
these cells, membrane proteins destined to reside in the apical por- 
tion of the plasma membrane contain different sorting signals from 
those destined for the lateral or basal portion. These two groups of 
plasma membrane proteins are clustered into different TGN mem- 
brane domains and transported to the cell surface in separate carriers. 
Plasma membrane proteins of nonpolarized cells, such as fibroblasts 
and white blood cells, may not require special sorting signals. Such 
proteins may simply be carried from the TGN to the cell surface in 
vesicles of the constitutive secretory pathway (Figure 8.2b). 


REVIEW | 


1. Describe the steps that ensure that a lysosomal 
enzyme will be targeted to a lysosome rather than a 


secretory vesicle. What is the role of GGA proteins? 
@ 


8.12 THE HUMAN PERSPECTIVE 


Disorders Resulting from Defects in Lysosomal Function 


Our understanding of the mechanisms by which proteins are 
targeted to particular organelles began with the discovery that 
the mannose 6-phosphate residues in lysosomal enzymes act 
as an “address” for delivery of these proteins to lysosomes. 


The discovery of the lysosome address was made in studies of 
patients with a rare and fatal inherited condition known as I-cell 
disease. Many cells in these patients contain lysosomes that are 
bloated with undegraded materials. Materials accumulate in the 


lysosomes because of the absence of hydrolytic enzymes. When 
fibroblasts from these patients were studied in culture, it was 
found that lysosomal enzymes are synthesized at normal levels 
but are secreted into the medium and not targeted to lysosomes. 
Further analysis revealed that the secreted enzymes lacked the 
mannose phosphate residues that are present on the lysosomal 
enzymes of cells from normal individuals. The I-cell defect was 
soon traced to the deficiency of an enzyme (N-acetylglucosamine 
phosphotransferase) required for mannose phosphorylation in 
the Golgi complex (see Figure 8.29a). 

In 1965, H. G. Hers of the University of Louvain in Belgium 
offered an explanation as to how the absence of a seemingly 
unimportant lysosomal enzyme, a-glucosidase, could lead to 
the development of a fatal inherited condition known as Pompe 
disease. Hers suggested that, in the absence of a-glucosidase, 
undigested glycogen accumulated in lysosomes, causing swell- 
ing of the organelles and irreversible damage to the cells and 
tissues. Diseases of this type, characterized by the deficiency of a 
single lysosomal enzyme and the corresponding accumulation of 
undegraded substrate (FIGURE 1), are called lysosomal storage 
disorders. Over 40 such diseases have been described, affecting 
approximately 1 in 5000 infants. Those diseases resulting from an 
accumulation of undegraded sphingolipids are listed in Table 1. 
The symptoms of lysosomal storage diseases can range from very 
severe to barely detectable, depending primarily on the degree 
of enzyme dysfunction. Several diseases have also been traced to 
mutations in lysosomal membrane proteins that impair the trans- 
port of substances from the lumen of the lysosome to the cytosol. 

Among the best studied lysosomal storage diseases is Tay- 
Sachs disease, which results from a deficiency of the enzyme 
B-N-hexosaminidase A, an enzyme that degrades the ganglioside 
Gyo (Figure 4.6). Gyz is a major component of the membranes 
of brain cells, and in the absence of the hydrolytic enzyme, the 
ganglioside accumulates in the bloated lysosomes of brain cells 
(Figure 1), causing dysfunction. In its severe form, which strikes 
during infancy, the disease is characterized by progressive mental 
and motor retardation, as well as skeletal, cardiac, and respiratory 
abnormalities. The disease is very rare in the general population 
but reached an incidence up to 1 in 3600 newborns among Jews 
of eastern European ancestry. The incidence of the disease has 
dropped dramatically in this ethnic population in recent years as 
the result of identification of carriers, genetic counseling of parents 
at risk, and prenatal diagnosis by amniocentesis. In fact, all of the 
known lysosomal storage diseases can be diagnosed prenatally. 

In the past few years, the prospects for treatment of lysosomal 
storage diseases have improved with the demonstration that the 
symptoms of Gaucher's disease, a deficiency of the lysosomal 
enzyme glucocerebrosidase, can be alleviated by enzyme 


TABLE 1 Sphingolipid Storage Diseases 


Disease 


Enzyme deficiency 


Principal storage substance 


FIGURE 1 Lysosomal storage disorders. Electron micrograph of a 
section through a portion of a neuron of a person with a lysosomal 
storage disease characterized by an inability to degrade Gyz ganglio- 
sides. These cytoplasmic vacuoles stain for both lysosomal enzymes and 
the ganglioside, indicating they are lysosomes in which undigested 
glycolipids have accumulated. 


Source: Courtesy of Kinuko Suzuki. 


replacement therapy. Infants with Gaucher's disease accumu- 
late large quantities of glucocerebroside lipids in the lysosomes 
of their macrophages, causing spleen enlargement and anemia. 
Initial attempts to correct the disease by infusing a solution of the 
normal human enzyme into the bloodstream were unsuccessful 
because the enzyme was taken up by liver cells, which are not 
seriously affected by the deficiency. To target macrophages, the 
enzyme was purified from human placental tissue and treated 
with three different glycosidases to remove terminal sugars on 
the enzyme’s oligosaccharide chains, which exposed underlying 
mannose residues (see Figure 8.22). Following infusion into the 
bloodstream, this modified enzyme (marketed under the name 
Cerezyme) is recognized by mannose receptors on the surface 
of macrophages and rapidly taken up by receptor-mediated 
endocytosis (Section 8.17). Because lysosomes are the natural 
target site of materials brought into the macrophage by endo- 
cytosis, the enzymes are efficiently delivered to the precise sites 


Consequences 


Gy, Gangliosidosis 


Tay-Sachs disease 
Fabry’s disease 
Sandhoff’s disease 
Gaucher's disease 


Niemann-Pick disease 
Farber’s lipogranulomatosis 


Krabbe’s disease 
Sulfatide lipidosis 


Gyn B-Galactosidase 


Hexosaminidase A 
a-Galactosidase A 
Hexosaminidases A and B 
Glucocerebrosidase 


Sphingomyelinase 
Ceramidase 


Galactocerebrosidase 
Arylsulfatase A 


Ganglioside Gy, 


Ganglioside Gy 


Trihexosylceramide 


Ganglioside Gyo 


Glucocerebroside 


Sphingomyelin 
Ceramide 


Galactocerebroside 


Sulfatide 


Mental retardation, liver enlargement, skeletal 
involvement, death by age 2 

Mental retardation, blindness, death by age 3 

Skin rash, kidney failure, pain in lower extremities 

Similar to Tay-Sachs disease but more rapidly progressing 

Liver and spleen enlargement, erosion of long bones, 
mental retardation in infantile form only 

Liver and spleen enlargement, mental retardation 

Painful and progressively deformed joints, skin nodules, 
death within a few years 

Loss of myelin, mental retardation, death by age 2 

Mental retardation, death in first decade 


and globoside 


continued 
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in the cell where the deficiency is manifested. Thousands of vic- 
tims of this disease have been successfully treated in this way. 
Enzyme replacement therapy for the treatment of several other 
lysosomal storage diseases has either been approved or is being 
investigated in clinical trials. (The movie Extraordinary Measures 
is based on the search for a replacement enzyme to treat Pompe 
disease.) Unfortunately, many of these diseases affect the central 
nervous system, which is unable to take up circulating enzymes 
because of the blood-brain barrier (page 248). 

An alternate approach, referred to as substrate reduction 
therapy, uses small-molecular-weight drugs to inhibit the synthesis 


of the substances that accumulate in the disease. Miglustat (trade 
name Zavesca), which partially inhibits glycosphingolipid biosyn- 
thesis, has been successfully used to treat Gaucher disease and 
Niemann-Pick type C disease. Finally, it can be noted that, although 
it is accompanied by considerable risk to the patient, bone marrow 
(or cord blood) transplantation has proven relatively successful in 
treating some of these diseases. It is thought that the foreign trans- 
planted cells, which contain normal copies of the gene in question, 
secrete a limited amount of the normal lysosomal enzyme. Some 
of these enzyme molecules are then taken up by the patient's own 
cells, which lessens the impact of the enzyme deficiency. 


8.13 Targeting Vesicles 
to a Particular Compartment 


Vesicle fusion requires specific interactions between different mem- 
branes. Vesicles from the ER, for example, fuse with the ERGIC or cis 
Golgi network and not with a trans cisterna. Selective fusion is one 
of the factors that ensures a directed flow through the membranous 
compartments of the cell. Despite a major research effort, we still do 
not fully understand the mechanisms by which cells target vesicles to 
particular compartments. It is thought that a vesicle contains spe- 
cific proteins associated with its membrane that govern the move- 
ments and fusion potential of that vesicle. To understand the nature 
of these proteins, we will consider the steps that occur between the 
stages of vesicle budding and vesicle fusion. 


1. Movement of the vesicle toward the specific target compartment. 
In many cases, membranous vesicles must move considerable 
distances through the cytoplasm before reaching their eventual 
target. These types of movement are mediated largely by micro- 
tubules and their associated motor proteins, which act analo- 
gously to a railroad system carrying cargo along defined pathways 
to predetermined destinations. The membranous carriers seen in 
Figure 8.19, for example, were observed to move from the ERGIC 
to the Golgi complex over microtubules. 


2. Tethering vesicles to the target compartment. The initial con- 
tacts between a transport vesicle and its target membrane, such 
as a Golgi cisterna, are thought to be mediated by a diverse col- 
lection of “tethering” proteins. Two groups of tethering proteins 
have been described (FIGURE 8.31a): rod-shaped, fibrous pro- 
teins that are capable of forming a molecular bridge between the 
two membranes over a considerable distance (50-200 nm) and 
large multiprotein complexes that appear to hold the two mem- 
branes in closer proximity. Different tethering proteins initiate 
fusion between different types of membranes. For example, one 
class of fibrous tethering proteins, called golgins, act in and 
around the Golgi complex. Among various proposed functions, 
golgins may serve as “tentacles” to reach out and capture trans- 
port vesicles carrying Golgi-bound cargo. Several different teth- 
ering complexes have been visualized with the electron 
microscope. Figure 8.31b shows tomographic images of the pre- 
synaptic zone of a mammalian nerve terminal. These 3D images 
reveal the presence of filamentous structures—presumably con- 
structed of tethering proteins—connecting synaptic vesicles both 
to one another and to the adjacent plasma membrane with which 
these vesicles would have fused. Much of the specificity between 


vesicle and target may be conferred by a family of small G pro- 
teins called Rabs, which cycle between an active GTP-bound 
state and an inactive GDP-bound state. GTP-bound Rabs associ- 
ate with membranes by a lipid anchor. With over 60 different Rab 
genes identified in humans, these proteins constitute the most 
diverse group of proteins involved in membrane trafficking. 
More importantly, different Rabs become associated with differ- 
ent membrane compartments. This preferential localization 
gives each compartment a unique surface identity, which is 
required to recruit the proteins involved in targeting specificity. 
In their GTP-bound state, Rabs play a key role in vesicle targeting 
by recruiting specific cytosolic tethering proteins to specific 
membrane surfaces (Figure 8.31a). Rabs also play a key role in 
recruiting numerous proteins involved in other aspects of mem- 
brane trafficking, including the motor proteins that move mem- 
branous vesicles through the cytoplasm (shown in Figure 9.52c). 


. Docking vesicles to the target compartment. At some point during 


the process leading to vesicle fusion, the membranes of the vesicle 
and target compartment come into close contact with one another 
as the result of an interaction between the cytosolic regions of inte- 
gral proteins of the two membranes. The key proteins that engage 
in these interactions are called SNAREs, and they constitute a 
family of more than 35 membrane proteins whose members are 
localized to specific subcellular compartments. Although SNAREs 
vary considerably in structure and size, all contain a segment in 
their cytosolic domain called a SNARE motif that consists of 60-70 
amino acids capable of forming a complex with another SNARE 
motif. SNAREs can be divided functionally into two categories, 
v-SNAREs, which become incorporated into the membranes of 
transport vesicles during budding, and t-SNAREs, which are 
located in the membranes of target compartments (Figure 8.31c). 
The best studied SNAREs are those that mediate docking of syn- 
aptic vesicles with the presynaptic membrane during the regulated 
release of neurotransmitters (page 163). In this case, the plasma 
membrane of the nerve cell contains two t-SNAREs, syntaxin and 
SNAP-25, whereas the synaptic vesicle membrane contains a sin- 
gle v-SNARE, synaptobrevin. The distribution of synaptobrevin 
molecules on a representative synaptic vesicle is shown in Figure 
8.31d. As the synaptic vesicle and presynaptic membrane approach 
one another, the SNARE motifs of t- and v-SNARE molecules 
from apposing membranes interact to form four-stranded bundles 
as shown in FIGURE 8.32a. Each bundle consists of four « helices, 
two donated by SNAP-25 and one each donated by syntaxin and 
synaptobrevin. These parallel « helices zip together to form a tightly 
interwoven complex that pulls the two apposing lipid bilayers into 


Fibrous 
tethering 


(a) Tethering 


(d) 


(c) Docking 


FIGURE 8.31 Proposed steps in the targeting of transport vesicles to target membranes. (a) According to this model, Rab proteins on the vesicle and 
target membrane are involved in recruiting tethering proteins that mediate initial contact between the two membranes. Two types of tethering proteins are 
depicted: highly elongated fibrous proteins (e.g., golgins and EEA1) and multiprotein complexes (e.g., the exocyst and TRAPP). (b) Electron tomographic 
image of a slice through a mammalian nerve terminus showing the network of synaptic vesicles that are present in close association with the presynaptic 
plasma membrane (Section 4.18). The left insets (corresponding to the white box on the right) show the presence of filamentous connectors between a pair of 
synaptic vesicles (upper inset) and between one of the synaptic vesicles and the adjacent plasma membrane (lower inset). Bars: main panel, 100 nm; insets, 
50 nm. (c) During the docking stage leading up to membrane fusion, a v-SNARE in the vesicle membrane interacts with the t-SNAREs in the target membrane 
to form a four-stranded a-helical bundle that brings the two membranes into intimate contact (see next figure). In the cases described in the text, SNAP-25, 
one of the t-SNAREs, is a peripheral membrane protein that is bound to the lipid bilayer by a lipid anchor rather than a transmembrane domain. SNAP-25 
contributes two helices to the four-helix SNARE bundle. (d) A model of a synaptic vesicle showing the distribution of only one of its constituent proteins, the 
v-SNARE synaptobrevin. The surface density and structures of the synaptobrevin molecules are based on calculations of the number of these proteins per 
vesicle and the known structure of the molecule. A complete portrait of the proteins on the surface of a synaptic vesicle is shown in the image in Figure 4.4b. 
Source: (b) From Rubén Fernandez-Busnadiego, et al., Courtesy of Wolfgang Baumeister, J. Cell Biol. 188, 147, 2010, reproduced with permission of 

The Rockefeller University Press; (d) From Shigeo Takamori et al., Cell 127:841, 2006, courtesy of Reinhard Jahn, with permission from Elsevier. 


very close association (Figures 8.31c and 8.32a). The formation of | 4. Fusion between vesicle and target membranes. When artificial 


similar four-stranded helical bundles occurs among other SNAREs 
at other sites throughout the cell, wherever membranes are destined 
to fuse. It is interesting to note that the SNAREs of the synaptic vesi- 
cle and presynaptic membrane are the targets of two of the most 
potent bacterial toxins; those responsible for botulism and tetanus. 
These deadly toxins act as proteases whose only known substrates 
are SNAREs. Cleavage of the neuronal SNAREs by the toxins blocks 
the release of neurotransmitters, which causes paralysis. 


lipid vesicles (liposomes) containing purified t-SNAREs are 
mixed with liposomes containing purified v-SNAREs, the two 
types of vesicles fuse with one another, but not with themselves. 
This finding indicates that interactions between t- and v-SNAREs 
are capable of pulling two lipid bilayers together with sufficient 
force to cause them to fuse (Figure 8.32b,c). However, a large body 
of evidence suggests that, although interaction between v- and 
t-SNAREs is required for membrane fusion, it is not sufficient by 
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Synaptic vesicle 


Synaptobrevin Transmembrane 


domain 


Syntaxin 


(d) 


FIGURE 8.32 A model of the interactions between v- and t-SNAREs leading to membrane 
fusion and exocytosis. (a) The synaptic vesicle has become docked to the plasma membrane 
through the formation of four-stranded bundles comprising o helices donated by syntaxin (red), 
synaptobrevin (blue), and SNAP-25 (green). SNAP-25 contributes two helices and lacks a 
transmembrane domain (yellow). (6) A speculative transition state in the fusion of the two 


membranes. A small water-filled cavity is shown at the center of the transmembrane helix 


Fusion pore 


L: 


itself to bring about fusion within a cell. According to the prevail- 
ing view regarding the regulated secretion of neurotransmitter 
molecules, the four-stranded SNARE bundle remains locked in 
an arrested state by interaction with accessory proteins. Vesicles 
at this stage remain docked at the membrane and ready to dis- 
charge their contents almost instantaneously once they receive an 
activating signal in the form of a rise in Ca** concentration (as 
discussed below). Regardless of how it is regulated, once the lipid 
bilayers of the two membranes merge, the SNAREs that previ- 
ously projected from separate membranes now reside in the same 
membrane (Figure 8.32c). Dissociation of the four-stranded 
SNARE complex is achieved by a doughnut-shaped, cytosolic 
protein called NSF that attaches to the SNARE bundle and, using 
energy from ATP hydrolysis, twists it apart. 


Now that we have described the events that occur during the 
fusion of a vesicle with a target membrane, we can return to the 
question: How is the specificity of this interaction determined? 
According to current consensus, the ability of a particular vesicle 
and target membrane to fuse is determined by the specific combina- 
tion of interacting proteins, including tethering proteins, Rabs, and 
SNAREs that can be assembled at that site in the cell. Taken together, 
these multiple interactions between several types of proteins provide 
a high level of specificity, ensuring that each membrane compart- 
ment can be selectively recognized. 


bundle. (c) The transmembrane helices that previously resided in the two separate membranes 
are now present in the same bilayer, and a fusion pore has opened between the vesicle and 
target membrane. The neurotransmitter contents of the vesicle can now be discharged by 
exocytosis. (a) Scanning electron micrograph of the extracellular surface of a pair of cultured 
alveolar (lung) cells stimulated to discharge proteins that had been stored in secretory granules. 
Material is seen being expelled from the cell through smooth, circular openings that are 
presumed to be dilated fusion pores. The arrow shows a fusion pore that has not dilated, which 
is readily distinguished from holes (arrowhead) accidentally formed during specimen preparation. 
Source: (a—c) Reprinted from Pehr A B Harbury, Structure 6:1490, 1998, with permission from 
Elsevier. (d) From Thomas Haller et al., J. Cell Biol. 155:286, 2001; Reproduced with permission 
of The Rockefeller University Press; 


REVIEW | 


1. What determines the specificity of interaction 
between a transport vesicle and the membrane 
compartment with which it will fuse? How are the 
SNARE proteins involved in the process of 
membrane fusion? 


8.14 Exocytosis 


The fusion of a secretory vesicle or secretory granule with the plasma 
membrane and subsequent discharge of its contents is called exocy- 
tosis. Exocytosis probably occurs on a rather continual basis in 
most cells as proteins and other materials are delivered to both the 
plasma membrane and extracellular space. However, the best stud- 
ied examples of exocytosis are those that occur during regulated 
secretion, most notably the release of neurotransmitters into the 
synaptic cleft. In these cases, membrane fusion produces an opening 
through which the contents of the vesicle or granule are released into 
the extracellular space. It was noted in Section 4.18 that the arrival of 
a nerve impulse at the terminal knob ofa neuron leads to an increase 
in the influx of Ca”* and the subsequent discharge of neurotransmit- 
ter molecules by exocytosis. In this case, fusion is regulated by a 


calcium-binding protein (synaptotagmin) present in the membrane 
of the synaptic vesicle. In other types of cells, exocytosis is generally 
triggered by release of Ca”* from cytoplasmic stores. Contact 
between the vesicle and plasma membranes is thought to lead to the 
formation of a small, protein-lined “fusion pore” (Figure 8.32c). 
Some fusion pores may simply re-close, but in most cases, the pore 
rapidly dilates to form an opening for discharge of the vesicle con- 
tents (Figure 8.32d ). Regardless of the mechanism, when a cytoplas- 
mic vesicle fuses with the plasma membrane, the luminal surface of 
the vesicle membrane becomes part of the outer surface of the 
plasma membrane, whereas the cytosolic surface of the vesicle mem- 
brane becomes part of the inner (cytosolic) surface of the plasma 
membrane (Figure 8.14). 


REVIEW | 


1. Concentration changes of what ion triggers 
exocytosis? From where do these ions originate in 
neurons and in other cell types? 


oe ——_ 


8.15 Lysosomes 


Lysosomes are an animal cell’s digestive organelles. A typical lysosome 
contains at least 50 different hydrolytic enzymes (Table 8.1) produced 
in the rough ER and targeted to these organelles. Taken together, lyso- 
somal enzymes can hydrolyze virtually every type of biological mac- 
romolecule. The enzymes of a lysosome share an important property: 
All have their optimal activity at an acid pH and thus are acid hydro- 
lases. The pH optimum of these enzymes is suited to the low pH 
of the lysosomal compartment, which is approximately 4.6. The 
high internal proton concentration is maintained by a proton pump 


TABLE 8.1 A Sampling of Lysosomal Enzymes 


Enzyme Substrate 


Phosphatases 


Acid phosphatase 
Acid phosphodiesterase 
Nucleases 
Acid ribonuclease 
Acid deoxyribonuclease 
Proteases 
Cathepsin 
Collagenase 
GAG-hydrolyzing enzymes 
Iduronate sulfatase 
B-Galactosidase 
Heparan N-sulfatase 
a-N-Acetylglucosaminidase 
Polysaccharidases and oligosaccharidases 
a-Glucosidase 
Fucosidase 
at-Mannosidase 
Sialidase 
Sphingolipid hydrolyzing enzymes 
Ceramidase 
Glucocerebrosidase 
B-Hexosaminidase 
Arylsulfatase A 
Lipid hydrolyzing enzymes 
Acid lipase 
Phospholipase 


Phosphomonoesters 
Phosphodiesters 


RNA 
DNA 


Proteins 
Collagen 


Dermatan sulfate 
Keratan sulfate 
Heparan sulfate 
Heparan sulfate 


Glycogen 
Fucosyloligosaccharides 
Mannosyloligosaccharides 
Sialyloligosaccharides 


Ceramide 
Glucosylceramide 
Gyn ganglioside 
Galactosylsulfatide 


Triacylglycerols 
Phospholipids 


(a V-type H*-ATPase, page 154) present in the organelles boundary 291 


membrane. Lysosomal membranes also contain a variety of highly 
glycosylated integral proteins whose carbohydrate chains are thought 
to form a protective lining that shields the membrane from attack by 
the enclosed enzymes. 

Although lysosomes house a predictable collection of enzymes, 
their appearance in electron micrographs is neither distinctive nor 
uniform. FIGURE 8.33 shows a small portion of a Kupffer cell, a 
phagocytic cell in the liver that engulfs aging red blood cells. The 
lysosomes of a Kupffer cell exhibit an irregular shape and variable 
electron density, illustrating how difficult it is to identify these orga- 
nelles on the basis of morphology alone. 

The presence within a cell of what is, in essence, a bag of destruc- 
tive enzymes suggests a number of possible functions. The best stud- 
ied role of lysosomes is the breakdown of materials brought into the 
cell from the extracellular environment. Many single-celled organ- 
isms ingest food particles, which are then enzymatically disassembled 
in a lysosome. The resulting nutrients pass through the lysosomal 
membrane into the cytosol. In mammals, phagocytic cells, such as 
macrophages and neutrophils, function as scavengers that ingest 
debris and potentially dangerous microorganisms (Section 8.20). 
Ingested bacteria are generally inactivated by the low pH of the lyso- 
some and then digested enzymatically. As shown in Figure 17.24, pep- 
tides produced by this digestive process are “posted” on the cell surface 
where they alert the immune system to the presence of a foreign agent. 

Lysosomes also play a key role in organelle turnover, that is, the 
regulated destruction of the cell’s own organelles and their replacement. 


FIGURE 8.33 Lysosomes. Portion of a phagocytic Kupffer cell of the liver 
showing at least 10 lysosomes of highly variable size. 

Source: From Hans Glaumann et al., J. Cell Biol. 67:887, 1975; Reproduced 
with permission of The Rockefeller University Press. 
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FIGURE 8.34 Autophagy. Electron micrograph of a mitochondrion and 
peroxisome enclosed in a double membrane wrapper. This autophagic 
vacuole (or autophagosome) would have fused with a lysosome and its 
contents digested. 


Source: Don Fawcett and Daniel Friend/Photo Researchers Inc. 


During this process, which is called autophagy, an organelle, such as 
the mitochondrion shown in FIGURE 8.34, is surrounded by a double 
membrane structure (or phagophore) to produce a double-membrane 
sequestering vesicle called an autophagosome (FIGURE 8.35). In mam- 
malian cells, autophagosomes are thought to form de novo from con- 
tact sites between the ER and mitochondrial outer membranes. Once 
formed, the outer membrane of the autophagosome fuses with a lyso- 
some, generating a structure called an autolysosome, in which both the 
inner membrane of the autophagosome and the enclosed contents are 
degraded. The products of these degradative reactions are made avail- 
able to the cell. It is estimated that 1 to 1.5 percent of the proteins 
within a healthy liver cell are degraded via autophagy per hour as part 
of a normal process of cellular renovation. Enclosure within 
autophagosomal vesicles is thought to be relatively selective rather 
than simply a random process. Autophagy probably evolved in early 
eukaryotic organisms as a response to nutrient deprivation. If a popu- 
lation of cells—whether derived from yeast, plants, or animals—is 
placed under starvation conditions, a marked increase in autophagy is 
observed. Under these conditions, cells acquire energy to maintain 
their life by cannibalizing their own organelles. The same autophagic 
process is induced in the cells of a mammalian embryo before it has 
implanted in the uterus and thus before it can take advantage of nutri- 
ents delivered via the mother’s bloodstream. 

Our understanding of the importance of autophagy in animals 
has coincided with the discovery and analysis of a host of genes 
(known as Atg genes) that are required for various steps in the 
autophagic pathway. Deletion of certain of these autophagy genes 
can have serious consequences for embryonic development or adult 
physiology of model organisms, whether it is a worm or a mouse. It 
has been shown, for example, that autophagy helps protect an organ- 
ism against intracellular threats ranging from abnormal protein 
aggregates to invading bacteria and parasites. Autophagy has also 
been implicated in the prevention of neurodegeneration. Ifautophagy 
is blocked in a particular portion of the brain of a laboratory animal, 
that region of the nervous system experiences massive loss of nerve 
cells. These findings reveal the importance of autophagy in protect- 
ing brain cells from the continuous damage to proteins and orga- 
nelles that is experienced by these long-lived cells. Autophagy may 
even play a role in the prevention of certain types of cancers and 
slowing the body’s aging process. 


-.°_ Exocytosis 


Phagophore 


Mitochondria Lm 


FIGURE 8.35 A summary of the autophagic pathway. The steps are 
described in the text. 


The role of lysosomes in autophagy is summarized in Figure 
8.35. Once the digestive process in the autolysosome has been com- 
pleted, the organelle is termed a residual body. Depending on the 
type of cell, the contents of the residual body may be eliminated 
from the cell by exocytosis, or they may be retained within the cyto- 
plasm indefinitely as a lipofuscin granule. Lipofuscin granules 
increase in number as an individual becomes older; accumulation is 
particularly evident in long-lived cells such as neurons, where these 
granules are considered a major characteristic of the aging process. 
The role of lysosomes in various diseases is discussed in the accom- 
panying Human Perspective. 


REVIEW | 


1. Describe three distinct roles of lysosomes. 


2. Describe the events that occur during the autophagic 
destruction of a mitochondrion. 


eo 


8.16 Plant Cell Vacuoles 


As much as 90 percent of the volume of many plant cells is occupied 
byasinglemembrane-bound, fluid-filled central vacuole (FIGURE 8.36). 
While simple in structure, plant vacuoles carry out a wide spectrum 
of essential functions. Many of a cell’s solutes and macromolecules, 
including ions, sugars, amino acids, proteins, and polysaccharides, 
are stored temporarily in the vacuole. Vacuoles may also store a 
host of toxic compounds. Some of these compounds (such as 
cyanide-containing glycosides and glucosinolates) are part of an 


ae Vacuole) 
= (Chloroplast 


(a) 


Cytoplasm 


Vacuole 


Nucleus 


(b) 


FIGURE 8.36 Plant cell vacuoles. (a) Each of the cylindrical leaf cells of the aquatic plant Elodea contains a large central vacuole surrounded by a layer of 
cytoplasm containing the chloroplasts that are visible in the micrograph. (b) Transmission electron micrograph of spinach leaf mesophyll cells showing the 


large central vacuole and thin layer of surrounding cytoplasm. 


Source (a) M. |. Walker/Photo Researchers, Inc.; (6) Biophoto Associates/Photo Researchers, Inc. 


arsenal of chemical weapons that are released when the cell is injured 
by an herbivore or fungus. Other toxic compounds are simply the by- 
products of metabolic reactions; because plants lack the type of excre- 
tory systems found in animals, they utilize their vacuoles to isolate 
these by-products from the rest of the cell. Some of these compounds, 
such as digitalis, have proven to have important clinical value. 

The membrane that bounds the vacuole, the tonoplast, con- 
tains a number of active transport systems that pump ions into the 
vacuolar compartment to a concentration much higher than that in 
the cytoplasm or the extracellular fluid. Because of its high ion con- 
centration, water enters the vacuole by osmosis. Hydrostatic (turgor) 
pressure exerted by the vacuole not only provides mechanical sup- 
port for the soft tissues of a plant (Figure 4.36), it also stretches the 
cell wall during cell growth. 

Plant vacuoles are also sites of intracellular digestion, not unlike 
lysosomes, which are absent in plants. In fact, plant vacuoles have 
some of the same acid hydrolases found in lysosomes. The pH of the 
vacuole is maintained at a low value by a V-type H*-ATPase within 
the tonoplast that pumps protons into the vacuolar fluid. Like the 
proteins of the lysosome, many of the proteins of a plant vacuole are 
synthesized on membrane-bound ribosomes of the RER, trans- 
ported through the Golgi complex, and sorted at the trans face of the 
Golgi before being targeted to the vacuole. 


REVIEW | 


1. Describe three distinct roles of plant vacuoles. 


2. In what ways is a plant vacuole similar to a lysosome? 
In what ways is it different? 


e—_____ 


8.17 Endocytosis 


We have described at length how a cell transports materials from the 
RER and Golgi to the plasma membrane and extracellular space. We 
can now turn to the movement of materials in the opposite direc- 
tion. We considered in Chapter 4 how low-molecular-weight solutes 
pass through the plasma membrane. But how are cells able to take in 
materials that are too large to penetrate this membrane regardless of 
its permeability properties? And how are proteins that reside in the 
plasma membrane recycled to the cell’s internal compartments? 
Both of these requirements are met by the endocytic pathway, in 
which segments of the plasma membrane invaginate to form cyto- 
plasmic vesicles that are transported into the cell interior. We will 
consider two basic processes in this section of the chapter, endocyto- 
sis and phagocytosis, which occur by different mechanisms. 
Endocytosis is primarily a process by which the cell internalizes 
cell-surface receptors and bound extracellular ligands. Phagocytosis 
describes the uptake of particulate matter. 

Endocytosis can be divided broadly into two categories: bulk-phase 
endocytosis and receptor-mediated endocytosis. Bulk-phase endocytosis 
(also known as pinocytosis) is the nonspecific uptake of extracellular flu- 
ids. Any molecules, large or small, that happen to be present in the 
enclosed fluid also gain entry into the cell. Bulk-phase endocytosis can 
be visualized by adding a substance to the culture medium, such as the 
dye lucifer yellow or the enzyme horseradish peroxidase, that is taken 
up by cells nonspecifically. Bulk-phase endocytosis also removes por- 
tions of the plasma membrane and may function primarily in the recy- 
cling of membrane between the cell-surface and interior compartments. 
In contrast, receptor-mediated endocytosis (RME), which is also 
called clathrin-mediated endocytosis, brings about the uptake of specific 
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FIGURE 8.37 Receptor-mediated endocytosis. This sequence of micrographs shows the steps in the uptake of yolk lipoproteins by the hen oocyte. 

| (a) The proteins to be taken into the cell are concentrated on the extracellular surface of an indented region of the plasma membrane, forming a coated pit. 
The cytosolic surface of the plasma membrane of the coated pit is covered with a layer of bristly, electron-dense material containing the protein clathrin. 

_ (b) The coated pit has sunk inward to form a coated bud. (c) The plasma membrane is about to pinch off as a vesicle containing the yolk protein on its 
luminal (previously extracellular) surface and clathrin on its cytosolic surface. (d) A coated vesicle that is no longer attached to the plasma membrane. The 


next step in the process is the release of the clathrin coat. 


Source: (a-d) From M. M. Perry and A. B. Gilbert, J. Cell Science 39:257, 1979, by permission of The Company of Biologists, Ltd. http://jcs.biologists.org/ 


content/39/1/257 full. pdf+html?sid=2493468d-1f02-467 a-a192-c2c876bd98b9. 


extracellular macromolecules (ligands) following their binding to 
receptors on the external surface of the plasma membrane.” 


Receptor-Mediated Endocytosis 
and the Role of Coated Pits 


Receptor-mediated endocytosis provides a means for the selective 
and efficient uptake of macromolecules that may be present at rela- 
tively low concentrations in the extracellular fluid. Cells have 


5 Over the years, several distinct mechanisms of receptor-mediated endocytosis have 


_ been described, although some of these proposed mechanisms remain controversial. 


For our purpose, we will discuss only one mechanism of endocytosis—that which is 
accomplished by means of clathrin-containing coats, the same type of protein coat 
discussed in Section 8.11 in connection with vesicles formed at the TGN. (Discussions 
of clathrin-independent endocytosis can be found in Ann. Rev. Biochem. 78, 857, 2009; 
J. Cell Sci. 122: 1713, 2009; and Curr. Opin. Cell Biol. 22, 519, 2010 and 23:413, 2011.) 


FIGURE 8.38 Coated pits. (a) Electron micrograph of a 
replica formed on the extracellular surface of a whole, 
freeze-dried fibroblast that had been incubated with 
LDL-cholesterol. Particles of LDL-cholesterol are visible as 
spherical structures located on the extracellular surface of 

the coated pit. (b) Electron micrograph of a replica of the 
cytosolic surface of a coated bud showing the invaginated 
plasma membrane surrounded by a clathrin lattice that has 
assumed a hemispheric shape. 

Source: (a—b) From John Heuser and Louise Evans, J. Cell Biol. 
84:568, 1980; Reproduced with permission of The Rockefeller 
University Press. 


receptors for the uptake of many different types of ligands, including 
hormones, growth factors, enzymes, and bloodborne proteins carry- 
ing certain nutrients. Substances that enter a cell by means of clath- 
rin-mediated RME become bound to receptors that collect in 
specialized domains of the plasma membrane, known as coated 
pits. Receptors are concentrated in coated pits at 10-20 times their 
level in the remainder of the plasma membrane. Coated pits 
(FIGURE 8.37a) are recognized in electron micrographs as sites where 
the surface is indented and the plasma membrane is covered on its 
cytoplasmic face by a bristly, electron-dense coat containing clathrin. 
Coated pits invaginate into the cytoplasm (Figure 8.37b) and then 
pinch free of the plasma membrane to form coated vesicles (Figure 
8.37c,d ). To understand the mechanism of coated vesicle formation, 
we need to examine the molecular structure of the clathrin coat. 
FIGURE 8.38 shows a coated pit as seen from the extracellular and 
cytoplasmic surfaces of the plasma membranes of cells that had been 


0.1 um 


(b) 


overlapping arrangement of the triskelions within the clathrin scaffold 295 
of a coated vesicle is shown in FIGURE 8.40. Each leg of a clathrin 
triskelion extends outward along two edges of a polygon. The clathrin 
molecules overlap in such a way that each vertex of a polygon con- 
tains a center of one of the component triskelions (Figure 8.40c). 
Like the clathrin-coated vesicles that bud from the TGN (Figure 
8.29), the coated vesicles that form during endocytosis also contain a 
layer of adaptors situated between the clathrin lattice and the surface 
of the vesicle facing the cytosol. The best studied adaptor operating 
in connection with clathrin-mediated endocytosis is AP2. Unlike the 
GGA adaptors utilized at the TGN, which consist of a single subunit 
with several domains (Figure 8.30), the AP2 adaptors that become 
incorporated into the vesicles that bud from the plasma membrane 
contain multiple subunits having different functions (Figure 8.40). 
The p subunit of AP2 adaptors engages the cytoplasmic tails of 
specific plasma membrane receptors, leading to the concentration of 
these selected receptors—and their bound cargo molecules—into the 
emerging coated vesicle (discussed further in the Experimental 
Pathways, Section 8.18). In contrast, the B-adaptin subunit of AP2 
adaptors binds and recruits the clathrin molecules of the overlying 
lattice. Figure 8.27b illustrates some of the marked structural differ- 
ences and similarities between the coats of COPII-, COPI- and 
clathrin-coated vesicles. All three coats contain two distinct layers: an 
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FIGURE 8.39 Clathrin triskelions. Electron micrograph of a metal- 
shadowed preparation of clathrin triskelions. Inset shows the triskelion is 
composed of three heavy chains. The inner portion of each heavy chain is 
linked to a smaller light chain. 


Source: Reprinted with permission from Macmillan Publishers Ltd: Ernst 
Ungewickell and Daniel Branton, Nature 289:421, 1981; © 1981. 


engaged in receptor-mediated endocytosis. When viewed from its 
cytoplasmic surface (Figure 8.38b), the bristly coat appears as a net- 


work of polygons (hexagons and pentagons) resembling a soccer ball. N- terminal 
The geometric construction of the coat is derived from the structure nook ot 
of its clathrin building blocks. Each clathrin molecule consists of ae 
three heavy chains and three light chains, joined together at the center chain 


to form a three-legged assembly called a triskelion (FIGURE 8.39). The 


FIGURE 8.40 Molecular organization of a coated vesicle. (a) Schematic drawing 
of the surface of a coated vesicle showing the arrangement of triskelions and 
adaptors in the outer clathrin coat. The sides of the polygons are formed by parts 
of the legs of overlapping triskelions. The N-terminus of each clathrin heavy chain 6 adaptin 6 Chain 
forms a “hook” that projects toward the surface of the membrane where it engages 
an adaptor. Each adaptor, which consists of four different polypeptide subunits, can 


a adaptin 


bind a diverse array of accessory proteins that are not shown in this illustration. Both ee ee Adaptor protein 
the hooks and adaptors are situated at the vertices of the polyhedrons. (Note: Not all Clathrin triskelion (AP2) complex 
of the triskelions of the lattice are shown in this figure; if they were, every vertex (a) 

would have a clathrin hub, hook, and associated adaptor.) (b) Schematic drawing of 

a cross section through the surface of a coated vesicle showing the interactions of Endocytic Extracellular 

the AP2 adaptor complexes with both the clathrin coat and membrane receptors. cargo 


Recruitment of AP2 adaptors to the plasma membrane is facilitated by the presence 
of PI(4,5)P2 molecules in the inner (cytosolic) leaflet of the membrane as shown in 
Figure 8.42. Each receptor is bound to a ligand being internalized. (c) Reconstruction 
of a clathrin cage containing 36 triskelions showing the overlapping arrangement of 
several of these trimeric molecules (shown in different colors). 


Receptor 


AP2 
adaptor 
Source: (a) Annual Review of Biochemistry. Volume 65, 1996 by Richardson, Charles C., 


Abelson, John N., Raetz, Christian R. H. copyright 1997 Reproduced with permission 
of Annual Reviews, Inc. In the format textbook via copyright clearance center; 

(b-c) From Alexander Fotin et al., Nature 432:574, 2004, courtesy of Stephen C. Harrison; 
© 2004. Reprinted with permission from Macmillan Publishers Ltd. 


296 outer geometric scaffold and an inner layer of adaptor (or adaptor- 
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like) proteins. The structure of the outer scaffolds, however, are very 
different; the subunits of the clathrin lattice (three-legged clathrin 
complexes) overlap extensively, whereas those of the COPII lattice 
(rod-like Sec13-Sec31 complexes) do not overlap at all. In addition, 
each vertex of a COPII coat is formed by four edges rather than three 
as in a clathrin coat (and as is suspected in the COPI coat). 

The structure depicted in Figure 8.40 is a greatly simplified ver- 
sion of an actual coated vesicle, which may contain upwards of two 
dozen different accessory proteins that form a dynamic network of 
interacting molecules. These proteins have roles in cargo recruitment, 
coat assembly, membrane bending and invagination, interaction with 
cytoskeletal components, vesicle release, and membrane uncoating. 
One of the best studied of these accessory proteins is dynamin. 

Dynamin is a large GTP-binding protein that is required for the 
fission of the vesicle from the membrane on which it forms. Dynamin 
self-assembles into a helical collar around the neck of an invaginated 
coated pit (FIGURE 8.41), just before it pinches off from the membrane. 
In the model depicted in Figure 8.41a, steps 3-4, hydrolysis of the 
bound GTP by the polymerized dynamin molecules induces a 
motion in the dynamin helix that severs the coated vesicle from the 
plasma membrane. According to this mechanism, dynamin acts as 
an enzyme capable of utilizing the chemical energy of GTP to gener- 
ate mechanical forces. Dissociation of the clathrin coat from the 
vesicle requires the help of an additional protein, the ATPase Hsc70, 
which is recruited to the clathrin coat by a cofactor, auxilin. 


The Role of Phosphoinositides in the 
Regulation of Coated Vesicles 


Although this discussion emphasizes the protein molecules of the 
coat and vesicle, the phospholipids of the vesicle membrane also play 
an important role. As discussed in Chapter 15, phosphate groups can 
be added to different positions of the sugar ring of the phospholipid 
phosphatidylinositol (PI), converting them into phosphoinositides 
(see Figure 15.10). Seven distinct phosphoinositides are identified: PI(3)P, 
PI(4)P, PI(5)P, PI(3,4)P,, PI(4,5)P,, PI(3,5)P,, and PI(3,4,5)P3. 
The phosphorylated rings of these phosphoinositides reside at the 
surface of the membrane where they can be recognized and bound by 
particular proteins. Different phosphoinositides are concentrated in 
different membrane compartments, which helps give each compart- 
ment a unique “surface identity.’ The inner leaflet of the plasma mem- 
brane, for example, tends to contain elevated levels of PI(4,5)P,, which 
plays an important role in recruiting proteins involved in clathrin- 
mediated endocytosis, such as dynamin and AP2 (see Figure 8.40b). 
Not only can a phosphoinositide aid in recruiting specific proteins, it 
can also induce conformational changes in a bound protein that facil- 
itate a molecular process. The clathrin adaptor AP2, for example, nor- 
mally exists in the cytosol in a locked conformation. Binding of the 
AP2 complex to PI(4,5)P, sites in the plasma membrane induces a 
major conformational change in AP2 that opens a cargo binding site 
within the adaptor, allowing it to interact with the cytoplasmic tails of 
specific membrane receptors. A structural model depicting this series 
of events is shown in FIGURE 8.42. Phosphoinositide levels also may 
play a role in clathrin lattice disassembly. Phosphatase activity on the 
newly budded vesicle has been proposed to cause the vesicle mem- 
brane to become rich in PI(4)P relative to PI(4,5)P5, which is thought 
to result in a burst of auxilin recruitment to the vesicle, leading to the 
rapid disassembly of the clathrin coat. 
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FIGURE 8.41 The role of dynamin in the formation of clathrin-coated 
vesicles. (a) The clathrin lattice of the coated pit (step 1) undergoes 
rearrangement to form an invaginated vesicle connected to the overlying 
plasma membrane by a stalk (step 2). At this point, the dynamin subunits, 
which are concentrated in the region, undergo polymerization to form a 
ring around the stalk (step 3). Changes in the conformation of the ring, 
which are thought to be induced by GTP hydrolysis (step 4), lead to fission 
of the coated vesicle from the plasma membrane and disassembly of the 
dynamin ring (step 5a). If vesicle budding occurs in the presence of GTPyS, 
a nonhydrolyzable analogue of GTP, dynamin polymerization continues 
beyond formation of a simple collar, producing a narrow tubule constructed 
from several turns of the dynamin helix (step 5b). Structural models of 
dynamin action can be found in Nature 477:556, 561, 2011. (b) Electron 
micrograph showing a coated vesicle forming in the presence of GTPyS, 
which corresponds to the stage depicted in step 5b of part a. 

Source: (a) Reprinted From P. De Camilli et al. Current Opin Neuro-Biol. 
Vol 5, P. 562, 1995, with permission from Elsevier. (6) From Kohji Takei 

et al., Nature Vol. 374, Cover 3/9/95; © 1995, reprinted by permission 

of Macmillan Publishers Ltd. 


A lipid species such as PI(4,5)P, can have a dynamic regulatory 
role because it can be rapidly formed and destroyed by enzymes that 
are localized at particular places and times within the cell. In the 
example of endocytosis, PI(4,5)P, disappears from a site of endocyto- 
sis about the time the coated vesicle is pinched away from the plasma 
membrane. Other PIs that are thought to serve as “landmarks” within 
the secretory/endocytic pathways include PI(3)P localized at early 
endosomes and intraluminal vesicles of late endosomes; PI(4)P local- 
ized at the TGN, secretory granules, and synaptic vesicles; and P1(3,5) 
P, localized at the late endosome boundary membrane. 


REVIEW | 


1. Describe the molecular structure of clathrin and the 


relationship between its structure and function. 
@O@ 


FIGURE 8.42 A structural model depicting the changes in 
protein conformation that occur upon AP2 binding to the 
plasma membrane. Binding of the AP2 adaptor is mediated 
by its interaction with PI(4,5)Pz molecules in the inner leaflet Attraction to 5 

ieee membrane 
of the plasma membrane (step 1). Binding of the adaptor to R 
the membrane induces a large conformational change in the 
adaptor (step 2) that facilitates its interaction with specific 
motifs in the cytoplasmic tails of certain membrane receptors 
(shown in yellow and orange) (step 3). 


Ptdins4,5P,, - rich membrane 


Membrane 


Cargo binding 


Source: From L. P. Jackson, et. al., Courtesy of David J. Owen, binding 
Cell 141;1228, 2010, Fig. 7.© 2010, reprinted by permission & opening 
from Eléevier. Locked conformation Open conformation 


8.18 EXPERIMENTAL PATHWAYS 


Receptor-Mediated Endocytosis 


Embryonic development begins with the fusion of a minuscule 
sperm and a much larger egg. Eggs develop from oocytes, which 
accumulate yolk that has been synthesized elsewhere in the body 
of the female. How are the high-molecular-weight yolk proteins 
able to enter the oocyte? In 1964, Thomas Roth and Keith Porter 
of Harvard University reported on the mechanism by which yolk 
proteins might be taken into the oocytes of mosquitoes.’ They 
noticed that during stages of rapid oocyte growth, there was a 
dramatic increase in the number of pitlike depressions seen on 
the surface of the oocyte. The pits, which were formed by invagi- 
nations of the plasma membrane, were covered on their inner sur- 
face by a bristly coat. In a farsighted proposal, Roth and Porter 
postulated that yolk proteins were specifically adsorbed onto the 
outer surface of the membrane of the coated pits, which would 
then invaginate as coated vesicles. The coated vesicles would lose 
their bristly coats and fuse with one another to produce the large, 
membrane-bound yolk bodies characteristic of the mature oocyte. 

The first insight into the structure of coated vesicles was 
obtained in 1969 by Toku Kanaseki and Ken Kadota of the University 
of Osaka.” Electron microscopic examination of a crude vesicle 
fraction isolated from guinea pig brains showed that coated 
vesicles were covered by a polygonal basketwork (FIGURE 1). These 


FIGURE 1 (a) Ahandmade model of an “empty basket” that would 
form the surface lattice of a coated vesicle. (b) A high-power electron 
micrograph of an empty proteinaceous basket. The numbers 5 and 6 
refer to pentagonal and hexagonal elements of the lattice, respectively. 
Source: From Toku Kanaseki and Ken Kadota, J. Cell Biol. 42:216, 1969; 
Reproduced with permission of The Rockefeller University Press. 


investigators suggested that the coatings were an apparatus to 
control the infolding of the plasma membrane during vesicle 
formation. 

The first studies of the biochemical nature of the vesicle coat 
were published in 1975 by Barbara Pearse of the Medical Research 
Council in Cambridge, England.’ Pearse developed a procedure 
in which membrane vesicles from pig brains were centrifuged 
through a succession of sucrose density gradients until a purified 
fraction of coated vesicles was obtained. Protein from the coated 
vesicles was solubilized and fractionated by SDS—polyacrylamide 
gel electrophoresis (SDS-PAGE, Section 18.7). The results indi- 
cated that the coat contained one predominant protein species 
having a molecular mass of approximately 180,000 daltons. 
Pearse named the protein clathrin.4 

As the research described above was being conducted, an 
apparently independent line of research was begun in 1973 in 
the laboratories of Michael Brown and Joseph Goldstein of the 
University of Texas Medical School in Dallas. Brown and Goldstein 
had become interested in the inherited condition familial hyper- 
cholesterolemia (FH). People who were homozygous for the 
defective gene (the FH allele) had profoundly elevated levels of 
serum cholesterol (800 mg/dl vs. 200 mg/dl for a normal person), 
invariably developed severely blocked (atherosclerotic) arteries, 
and usually died from heart attack before the age of 20. At the 
time, very little was known about the fundamental physiologic or 
biochemical defects in this disorder. 

Brown and Goldstein began their studies of FH by exam- 
ining cholesterol metabolism in cultured fibroblasts derived 
from the skin of normal and FH-afflicted individuals. They found 
that the rate-controlling enzyme in cholesterol biosynthesis, 
HMG CoA reductase, could be inhibited in normal fibroblasts 
by cholesterol-containing lipoproteins (such as LDL) placed 
in the medium (FIGURE 2).° Thus, the addition of LDL to the 
culture medium in which normal fibroblasts were growing led 
to a decreased level of activity of HMG CoA reductase and a 
corresponding decrease in the synthesis of cholesterol by the 
fibroblasts. When HMG CoA reductase levels were measured 
in FH-derived fibroblasts, they were found to be 40 to 60 times 
that of normal fibroblasts.° In addition, enzyme activity in the FH 
fibroblasts was totally unaffected by the presence of LDL in the 
medium (FIGURE 3). 

How could lipoproteins in the medium affect the activity 
of an enzyme in the cytoplasm of cultured cells? To answer this 

continued 
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FIGURE 2 HMG CoA reductase activity in normal fibroblasts was 
measured following addition of the lipoprotein fraction of calf serum 
(open squares), unfractionated calf serum (closed circles), or 
nonlipoprotein fraction of calf serum (open triangles). It is evident that 
the lipoproteins greatly depress the activity of the enzyme, while the 
nonlipoproteins have little effect. 


Source: From M. S. Brown et al., Proc. Nat'l. Acad. Sci. U.S.A. 70:2166, 
1973. 


question, Brown and Goldstein initiated studies on the interac- 
tion between the cells and the lipoproteins. They added radio- 
actively labeled LDL to culture dishes containing a single layer of 
fibroblasts derived from either FH-afflicted or normal human sub- 
jects.’ The normal fibroblasts bound the labeled LDL molecules 
with high affinity and specificity, but the mutant cells showed vir- 
tually no ability to bind these lipoprotein molecules (FIGURE 4). 
These results indicated that normal cells have a highly specific 
receptor for LDL and that this receptor was defective or missing 
in cells from patients with FH. 

To visualize the process of receptor binding and internaliza- 
tion, Brown and Goldstein teamed up with Richard Anderson, who 
had been studying cellular structure with the electron microscope. 
The group incubated fibroblasts from normal and FH subjects with 
LDL that had been covalently linked to the iron-containing pro- 
tein ferritin. Because of the iron atoms, ferritin molecules are able 
to scatter a beam of electrons and thus can be visualized in the 


FIGURE 3 Fibroblast cells from either a 
control subject (closed circles) or a patient 
with homozygous FH (open circles) were 
grown in dishes containing fetal calf serum. 
On day 6 (which corresponds to 0 hours on 
the graph), the medium was replaced with 
fresh medium containing lipoprotein- 
deficient human plasma. At the indicated 
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time, extracts were prepared, and HMG CoA 
reductase activity was measured. If we look 
at the control cells, it is apparent that at the 
beginning of the monitoring period the cells 
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have very little enzyme activity because the 0 8 16 24 32 


medium had contained enough cholesterol- 
containing lipoproteins that the cells did not 
need to synthesize their own. Once the 
medium was changed to the lipoprotein-deficient plasma, the cells were 
no longer able to use cholesterol from the medium and thus increased 
the amount of enzyme within the cell. In contrast, the cells from the FH 
patients showed no response to either the presence or absence of 
lipoproteins in the medium. 


Source: From J. L. Goldstein and M. S. Brown, Proc. Nat'l. Acad. Sci. 
U.S.A. 70:2805, 1973. 
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FIGURE 4 Time course of radioactive ['**l]-labeled LDL binding to cells 
from a normal subject (circles) and a homozygote with FH (triangles) at 
378C. The cells were incubated in a buffer containing 5 mg/ml ['*°l] LDL in 
the presence (open circles and triangles) and absence (closed circles and 
triangles) of 250 mg/ml of nonradioactive LDL. It is evident that in the 
absence of added nonradioactive LDL, the normal cells bind significant 
amounts of the labeled LDL, while the cells from the FH patients do not. 
The binding of labeled LDL is greatly reduced in the presence of 
nonradioactive LDL because the nonlabeled lipoproteins compete with the 
labeled ones for binding sites. Thus, the binding of the lipoprotein to the 
cells is specific (i.e., it is not the result of some nonspecific binding event). 


Source: From M. S. Brown and J. L. Goldstein, Proc. Nat’l. Acad. Sci. 
U.S.A. 71:790, 1974. 


electron microscope. When normal fibroblasts were incubated 
with LDL-ferritin at 4°C, a temperature at which ligands can bind 
to the cell surface but cannot be internalized, LDL-ferritin parti- 
cles were seen to be bound to the cell surface. Close examination 
revealed that the LDL particles were not randomly scattered 
over the cell surface but were localized to short segments of 
the plasma membrane where the membrane was indented and 
coated by a “fuzzy” material (FIGURE 5).° These segments of the 
membrane were similar to the coated pits originally described by 
Roth and Porter and had since been seen in a variety of cell types. 
Although cells from patients with FH were found to have a sim- 
ilar number of coated pits on their surface, no LDL-ferritin was 
bound to these mutant cells. The results supported the proposal 
that the mutant FH allele encoded a receptor that was unable to 
bind LDL. Subsequent electron microscopic studies on the inter- 
nalization of LDL-ferritin revealed the endocytic pathway by which 


FIGURE 5 Electron micrograph showing the binding of LDL to a 
coated pit of a human fibroblast. The LDL is made visible by conjugating 
the particles to electron-dense, iron-containing ferritin. Bar: 100 nm. 
Source: From R. G. W. Anderson, M. S. Brown, and J. L. Goldstein, Cell 
10:356, 1977; Reprinted by permission of Elsevier. 


these lipoprotein particles were internalized, as described in the 
chapter.’ 

Based on these results, the group postulated that the rapid 
internalization of receptor-bound LDL is strictly dependent on the 
localization of LDL receptors in coated pits. It follows from this 
postulate that, ifan LDL receptor failed to become localized within 
a coated pit, it would be unable to deliver its bound ligand to cel- 
lular lysosomes and thus would be unable to affect cholesterol 
biosynthesis within the cell. At about this time, an LDL receptor 
with a different type of mutation was discovered. LDL receptors 
bearing this new defect (known as the J. D. mutation after the 
patient in which it occurred) bound normal amounts of radio- 
actively labeled LDL, yet the receptor-bound lipoprotein failed 
to be internalized and consequently was not delivered to cyto- 
plasmic lysosomes for processing.'? Anderson and co-workers 
postulated that the LDL receptor was a transmembrane protein 
that normally became localized in coated pits because its cyto- 
plasmic domain was specifically bound by a component of the 
coated pits, possibly clathrin (but later identified as a likely sub- 
unit of an AP adaptor, as discussed below). Because of a defect 
in its cytoplasmic domain, the J. D. mutant receptor was unable 
to become localized in a cell's coated pits. People with this muta- 
tion exhibit the same phenotype as patients whose receptors are 
unable to bind LDL. Some of the historical background to the 
work just described can be found in reference." 

Subsequent studies determined that the normal LDL receptor 
is a transmembrane glycoprotein of 839 amino acids, with the 50 
amino acids at the C-terminal end of the protein extending inward 
from the membrane as a cytoplasmic domain. Analysis of the 
J. D. mutant receptor revealed that the protein contained a single 
amino acid substitution: a tyrosine residue normally located at 
position 807 was replaced by a cysteine.'* This single alteration 
in amino acid sequence obliterated the ability of the protein to 
become concentrated in coated pits. 

Over the next few years, attention turned to the amino acid 
sequences of the cytoplasmic tails of other receptors that became 
localized in coated pits. Studies on a diverse array of membrane 
receptors turned up several internalization signals that were shared 
by numerous membrane proteins. The two most common such 
signals are (1) a YXX¢ signal (as in the transferrin receptor) where 
Y is tyrosine, X can be any amino acid, and @ is an amino acid with 
a bulky hydrophobic side chain and (2) an acidic dileucine signal 
containing two adjacent leucine residues (as in the CD4 protein 
that serves as the receptor for HIV on the surface of T lympho- 
cytes). The YXX sequence of the receptor binds to the p subunit 
of the AP2 adaptors, and the dileucine motif binds to the o sub- 
unit of the AP2 adaptors (Figure 8.40b).'> X-ray crystallographic 
studies have revealed the nature of the interactions between the 
adaptor and these internalization signals. Included within the 
p_ subunit are two hydrophobic pockets, one that binds the tyrosine 
residue and the other that binds the bulky hydrophobic side chain 
of the YXXo internalization signal.'* Similarly, the dileucine signal 
binds to a hydrophobic pocket in the o subunit.'? Meanwhile, the 
AP2 adaptor complex binds to the clathrin coat by means of its 
6 subunit (Figure 8.406). As a result of these various intermolecular 
contacts, the adaptor complex and the receptor are trapped in 
coated pits prior to endocytosis. Reviewed in 16-18 

Over the past decade, many different lines of investigation 
of clathrin-mediated endocytosis have been followed. One of the 
most rewarding experimental approaches has been to label two 
or more components of the endocytic machinery with different 
fluorescent markers and follow their movements over time within a 
living cell. Using this approach, researchers have watched dynamin, 
AP2 adaptors, or various types of cargo receptors bind to clathrin- 
coated pits become enveloped within a clathrin-coated vesicle, 


which then buds off into the cytoplasm and disappears. FIGURE 6 
depicts an example of this type of experiment.'? The fluorescence 
micrographs show the surface of a cultured mammalian epithelial 
cell that is expressing a clathrin light chain that is fused to a variant 
of the green fluorescent protein. The green patches, for the most 
part, represent clathrin-coated pits situated at the cell surface. 
The red-staining spots are individual fluorescently labeled LDL 
particles that were added to the medium in which the cells were 
growing. The series of images illustrate a particular clathrin-coated 
pit (indicated by the green arrow in the second frame marked 23) 
that has captured a particular LDL particle (indicated by the red 
arrows in these same early frames). Once the LDL particle has been 
bound to an LDL receptor in the coated pit, the overlap of the 
two fluorescent dyes produces a yellow-orange spot (indicated by 
the yellow arrows in the frames marked T» and T3). The remaining 
frames (marked T,) show the uncoated vesicle containing the red 
fluorescent LDL particle moving into the adjacent cytoplasm. 
Recent advances in fluorescence microscopy have allowed 
researchers to quantify the precise numbers of fluorescent mol- 
ecules at the membrane of a living cell with high temporal res- 
olution, providing a detailed understanding of the early events 
eading to clathrin-mediated endocytosis. Some of these studies 
ake advantage of a technique known as total internal reflection 
fluorescence (TIRF) microscopy, in which fluorophores that are 
ocated within a very small distance (less than 200 nm) from the 
cover slip are excited. This typically results in the ability to only 
clearly “see” molecules that are at the cell membrane with very 
ittle background from fluorescently labeled cytoplasmic proteins. 
Using TIRF microscopy with cells expressing a GFP-labeled clath- 
rin light chain or AP2, a group led by Tom Kirchhausen at Harvard 
edical School were able to quantify the typical numbers of 
clathrin and adaptor proteins needed to initiate an endocytotic 
event.”° They found that the first event usually consisted of a single 
clathrin molecule being captured at the membrane by two mem- 
brane-bound AP2 complexes. If a second event, consisting of the 
association of another clathrin with its own pair of AP2s, happens 
quickly enough (within a couple of seconds), then the lattice is sta- 
bilized and is more likely to result in a complete endocytic event. 


FIGURE 6 A series of fluorescence images showing the capture of a 
single red-fluorescent LDL particle by a green-fluorescent clathrin- 
coated pit and its incorporation into a clathrin-coated vesicle, which 
becomes uncoated and moves into the cytoplasm. The events are 
described in the text. T;-T, represent the intervals before association 
(T,), during association (T2), joint lateral motion of LDL and clathrin prior 
to uncoating (T3), and motion of LDL after uncoating (T,), respectively. 
The times are indicated in seconds in each frame, with 0 corresponding 
to the time when the LDL and clathrin become associated. 


Source: From Marcelo Ehrlich et al., Courtesy of Tomas Kirchhausen, 
Cell 118:597, 2004; Reprinted by permission of Elsevier. 
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8.19 The Endocytic Pathway 


Molecules taken into a cell by endocytosis are routed through a well- 
defined endocytic pathway (FIGURE 8.43a). Before describing events 
that occur along the endocytic pathway, it is worth considering two 
different types of receptors that are subjected to endocytosis. One 
group of receptors, which we will refer to as “housekeeping receptors,” 
is responsible for the uptake of materials that will be used by the cell. 
The best studied examples are the transferrin and LDL (low-density 
lipoprotein) receptors, which mediate the delivery to cells of iron 
and cholesterol, respectively. The LDL receptor is discussed in detail 
at the end of this section. The red-colored receptor of Figure 8.43a 
represents a housekeeping receptor. The second group of receptors, 
which we will refer to as “signaling receptors, is responsible for 
binding extracellular ligands that carry messages that change the 
activities of the cell. These ligands, which include hormones such as 
insulin and growth factors such as EGF, bind to the surface receptor 
(shown in green in Figure 8.43a) and signal a physiologic response 
inside the cell (discussed at length in Chapter 15). Endocytosis of the 
first group of receptors leads typically to the delivery of the bound 
materials, such as iron and cholesterol, to the cell and the return of 
the receptor to the cell surface for additional rounds of uptake. 
Endocytosis of the second group of receptors often leads to the 
destruction of the receptor, a process called receptor down-regulation, 
which has the effect of reducing the sensitivity of the cell to further 
stimulation by the hormone or growth factor. Receptor down- 
regulation is a mechanism by which cells regulate their ability to 
respond to extracellular messengers. Signaling receptors are typically 
marked for endocytosis and subsequent destruction by the covalent 


attachment of a “tag” to the cytoplasmic tail of the receptor while it 
resides at the cell surface. The tag is a small protein called ubiquitin, 
which is added enzymatically. Membrane proteins that are not nor- 
mally subjected to endocytosis become internalized if they are made 
to carry an added ubiquitin.° 

Following internalization, vesicle-bound materials are trans- 
ported to a dynamic network of tubules and vesicles known collec- 
tively as endosomes, which represent distribution centers along 
the endocytic pathway. The fluid in the lumen of endosomes is acidi- 
fied by a H*-ATPase in the boundary membrane. Endosomes are 
divided into two classes: early endosomes, which are typically 
located near the peripheral region of the cell, and late endosomes, 
which are typically located closer to the nucleus. According to the 
prevailing model, early endosomes progressively mature into late 
endosomes. This transformation from an early to a late endosome is 
characterized by a decrease in pH, an exchange of Rab proteins (e.g., 
from Rab5 to Rab7), and a major change in the internal morphology 
of the structures. 

Receptors that are taken up by endocytosis are transported in 
vesicles to an early endosome, which serves as a sorting station that 


° In its best-studied role, ubiquitin is attached to a protein to target that protein for 
degradation as discussed in Section 12.21. This type of degradation signal is com- 
posed of a number of ubiquitin molecules that are joined to one another (to form a 
polyubiquitin chain) by covalent linkages at a specific residue on each ubiquitin mol- 
ecule, namely Lys48. In contrast, internalization of a membrane protein is thought to 
be triggered by attachment to that protein of either a single ubiquitin molecule or a 
short polyubiquitin chain in which the ubiquitin molecules are linked to one another 
at Lys63 rather than Lys48 (see Trends Cell Biol. 21:647.2011). 
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FIGURE 8.43 The endocytic pathway. (a) The movement of materials from 
the extracellular space to early endosomes where sorting occurs. Endocytosis of 
two types of receptor—ligand complexes is shown. Housekeeping receptors, 
such as the LDL receptor (shown in red), are typically sent back to the plasma 
membrane, whereas their ligands (blue spheres) are transferred to late 
endosomes. Signaling receptors, such as the EGF receptor (shown in green), are 
typically transported to late endosomes along with their ligands (yellow). Late 
endosomes also receive newly synthesized lysosomal enzymes (red spheres) 
from the TGN. These enzymes are carried by mannose 6-phosphate receptors 
(MPRs), which return to the TGN. The contents of late endosomes are trans- 
ferred to lysosomes by a number of routes (not shown). The inset on the left 
shows an enlarged view of a portion of a late endosome with an intraluminal 
vesicle budding inward from the outer membrane. The membranes of these 
vesicles contain receptors to be degraded. (b) Electron micrograph showing the 
internal vesicles within the lumen of a late endosome. A number of lysosomes 
are present in the vicinity. (c) The gold particles seen in this electron micrograph 
(which appear as black particles under EM) are bound to EGF receptors that 
were internalized by endocytosis and have become localized within the 
membranes of the internal vesicles of this late endosome. 

Source: (6) From J. Paul Luzio, Paul R. Pryor, and Nicholas A. Bright, Nature Revs. 
Mol. Cell Biol. 8:625, 2007; © 2007, reprinted by permission of Macmillan 
Publishers Ltd. (c) Courtesy of Clare Futter. 


directs different types of receptors and ligands along different path- 
ways (Figure 8.43a). “Housekeeping receptors” typically dissociate 
from their bound ligands as a result of the high H* concentration of 
the early endosomes. The receptors are then concentrated into spe- 
cialized tubular compartments of the early endosome, which repre- 
sent recycling centers. Vesicles that bud from these tubules carry 
receptors back to the plasma membrane for additional rounds of 
endocytosis (Figure 8.43a). In contrast, released ligands (e.g., LDLs) 
become concentrated into a sorting compartment before being 
dispatched to a late endosome and ultimately to a lysosome, where 
final processing occurs. As noted above, signaling receptors with 
attached ubiquitin tags do not recycle back to the membrane. 
Instead, these ubiquitinated receptors are sequestered into a popula- 
tion of small, spherical vesicles that crowd the interior of the late 
endosome (Figure 8.43b). This vesiculation process (shown in the 


inset on the left side of Figure 8.43a) is orchestrated by a series of 
protein complexes (called ESCRT complexes). Four different ESCRT 
complexes act in sequence to (1) sort the ubiquitinated receptors 
into a cluster within the late endosomal membrane, (2) cause that 
patch of membrane to invaginate as a bud into the lumen of the late 
endosome, and (3) sever the neck of the invagination to release the 
newly formed intraluminal vesicle. The segregation of signaling 
receptors within the lumen of a late endosome is shown in Figure 
8.43c. Because of these internal vesicles, late endosomes are also 
referred to as multivesicular bodies (MVBs). 

Among many examples of receptor-mediated endocytosis, the 
first studied and best understood is one that provides animal cells 
with exogenous cholesterol. Animal cells use cholesterol as an 
essential part of their plasma membranes and as a precursor to 
steroid hormones. Cholesterol is a hydrophobic molecule that is 
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FIGURE 8.44 LDL cholesterol. Each particle consists of esterified 
cholesterol molecules, surrounded by a mixed monomolecular layer of 
phospholipids and cholesterol and a single molecule of the protein 
apolipoprotein B-100, which interacts specifically with the LDL receptor 
projecting from the plasma membrane. 


transported in the blood as part of huge lipoprotein complexes, such 
as the low-density lipoprotein (LDL) shown in FIGURE 8.44. Each 
LDL particle contains a central core of about 1500 cholesterol mol- 
ecules esterified to long-chain fatty acids. The core is surrounded by 
a single layer of phospholipids that contains a single copy of a large 
protein, called apolipoprotein B-100, which binds specifically to LDL 
receptors on the surfaces of cells. 

LDL receptors are transported to the plasma membranes of 
cells, where they become concentrated in coated pits, even in the 
absence of LDL ligand. As a result, the receptors are on the cell sur- 
face ready to take up the bloodborne lipoproteins if they should 
become available. Once the LDL particles are bound to a coated pit, 
the pit invaginates to form a coated vesicle, the clathrin coat is disas- 
sembled, and the LDL receptors pass through the early endosomes 
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and back to the plasma membrane, as depicted in Figure 8.43a. 
Meanwhile, the LDL particles are delivered to late endosomes and 
lysosomes, where the protein component is degraded and the cho- 
lesterol is deesterified and used by the cell in membrane assembly or 
other metabolic processes (e.g., steroid hormone formation). Persons 
with a rare inherited disorder called Niemann-Pick type C disease 
lack one of the proteins required to transfer cholesterol out of lys- 
osomes. The resulting accumulation of cholesterol in these orga- 
nelles leads to nerve degeneration and death during early childhood. 
Studies of a different disease that led to the discovery of receptor- 
mediated endocytosis and LDL internalization are described in the 
accompanying Experimental Pathways. 

The level of LDL in the blood has been related to the develop- 
ment of atherosclerosis, a condition characterized by formation of 
plaques in the inner lining of arteries that reduce the flow of blood 
through the vessel and act as sites for the formation of blood clots. 
Blood clots that block the coronary arteries are the leading cause of 
myocardial infarction (heart attack). Studies suggest that atheroscle- 
rosis results from a chronic inflammatory response that is initiated 
by the deposition of LDL within the inner walls of blood vessels, as 
indicated in FIGURE 8.45. Lowering blood LDL levels is most readily 
accomplished by administration of drugs called statins (e.g., lovasta- 
tin and Lipitor) that block HMG CoA reductase, a key enzyme in the 
synthesis of cholesterol (discussed in the Experimental Pathway, 
Section 8.18). When blood cholesterol levels are lowered, the risk of 
heart attack is reduced. 

LDLs are not the only cholesterol-transporting agents in the 
blood. HDLs (high-density lipoproteins) have a similar construction 
but contain a different protein (apolipoprotein A-I) and play a dif- 
ferent physiologic role in the body. LDL serves primarily to carry 
cholesterol molecules from the liver, where they are synthesized and 
packaged, through the blood to the body’s cells. HDL carries choles- 
terol in the opposite direction. Excess cholesterol is transported out 
of the plasma membrane of the body’s cells directly to circulating 
HDL particles, which carry cholesterol to the liver for excretion. Just 
as high blood levels of LDL are associated with increased risk of 
heart disease, high blood levels of HDL have been thought to be 
associated with decreased risk, which has led to HDL being called 


Fibrous cap 


Foam cell containing LDL Plaque formation 


FIGURE 8.45 A model of atherosclerotic plaque formation. According to this model, plaque formation is initiated by various types of injury to the 
endothelial cells that line the vessel, including damage inflicted by oxygen free radicals that chemically alter the LDL-cholesterol particles. The injured 
endothelium acts as an attractant for white blood cells (leukocytes) and macrophages, which migrate beneath the endothelium and begin a process of 


chronic inflammation. The macrophages ingest the oxidized LDL, which becomes deposited in the cytoplasm as cholesterol-rich fatty droplets. These cells 
are referred to as macrophage foam cells and they are often already present in the blood vessels of adolescents and young adults. Substances released by 
the macrophages stimulate the proliferation of smooth muscle cells, which produce a dense, fibrous connective tissue matrix (fibrous cap) that bulges into 
the arterial lumen. Not only do these bulging lesions restrict blood flow, they are prone to rupture, which can trigger the formation of a blood clot and 
ensuing heart attack. 


the “good cholesterol.” There is little doubt that lowering LDL levels 
is beneficial, but the consequences of raising HDL levels is less clear- 
cut. For example, cholesterol molecules can be transferred from 
HDL to other lipoprotein particles by an enzyme called cholesteryl 
ester transfer protein (CETP), an activity that tends to lower HDL 
cholesterol levels. CETP has become a focus of research following 
the discovery of a population of Japanese families whose members 
routinely live more than 100 years and carry mutations in the CETP 
gene. Several small-molecular-weight CETP inhibitors have been 
tested in clinical trials and found to increase HDL levels in the blood. 
One of these drug candidates (torcetrapib) was dropped from fur- 
ther study despite the fact that it raised HDL blood levels. For rea- 
sons that are unclear, subjects taking the drug and a statin were 
considerably more likely to die than those in the control group who 
took only the statin. Another study focused on a mutation in an 
endothelial lipase that caused an increase in HDL. Those with the 
mutation, the study found, were not at lower risk for heart attacks. 

One of the most promising cholesterol-lowering drug targets is 
PCSK9, a proteolytic enzyme that destroys LDL receptors in the 
liver. Persons that carry inactivating mutations in the gene encoding 
PCSK9 have reduced LDL cholesterol levels and decreased heart dis- 
ease. At least two PCSK9 inhibitors (REGN727 and AMG145) are in 
clinical trials. 


REVIEW | 


1. Describe the steps that occur between the binding of 
an LDL particle to the plasma membrane of a cell and 


the entry of cholesterol molecules into the cytosol. 
st 


8.20 Phagocytosis 


Phagocytosis (“cell eating”) is carried out extensively by a few types 
of cells specialized for the uptake of relatively large particles (>0.5 um 
diameter) from the environment. Many single-celled protists, such as 
amoebas and ciliates, make their livelihood by trapping food parti- 
cles and smaller organisms and enclosing them within folds of the 
plasma membrane. The folds fuse to produce a vacuole (or phago- 
some) that pinches off inwardly from the plasma membrane. The 
phagosome fuses with a lysosome, and the material is digested within 
the resulting phagolysosome. 

In most animals, phagocytosis is a protective mechanism rather 
than a mode of feeding. Mammals possess a variety of “professional” 
phagocytes, including macrophages and neutrophils, that wander 
through the blood and tissues phagocytizing invading organisms, 
damaged and dead cells, and debris. These materials are recognized 
and bound by receptors on the surface of the phagocyte prior to 
uptake. Once inside the phagocyte, microorganisms may be killed by 
lysosomal enzymes or by oxygen free radicals generated within the 
lumen of the phagosome. The process of particle engulfment is 
pictured in FIGURE 8.46a. The steps in the digestion of engulfed 
materials are illustrated in Figure 8.46b. The engulfment of 
particulate material by phagocytosis is driven by contractile activi- 
ties of the actin-containing microfilaments that underlie the plasma 
membrane. 

Not all bacteria ingested by phagocytic cells are destroyed. In 
fact, some species hijack the phagocytic machinery to promote their 
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FIGURE 8.46 Phagocytosis.(a) The process of engulfment as illustrated 
by a polymorphonuclear leukocyte ingesting a yeast cell (lower left). (b) The 
steps that occur in the phagocytic pathway (see also page 292). 

Source: (a) From Janet Boyles and Dorothy F. Bainton, Cell 24:906, 1981, 
reprinted by permission of Elsevier. 


own survival in the body. Mycobacterium tuberculosis, the agent 
responsible for tuberculosis, for example, is taken into the cytoplasm 
of a macrophage by phagocytosis, but the bacterium is able to inhibit 
the fusion of its phagosome with a lysosome. Even if the phagosome 
does become highly acidic, the bacterium appears able to maintain 
its own physiological pH despite the lowered pH of its surrounding 
medium. The bacterium responsible for Q fever, Coxiella burnetii, 
becomes enclosed in a phagosome that does fuse with a lysosome, 
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destroy the pathogen. Listeria monocytogenes, a bacterium that 
causes meningitis, produces proteins that destroy the integrity of 
the lysosomal membrane, allowing the bacterium to escape into the 
cell’s cytosol (see Section 9.15). 


REVIEW | 


1. How does the role of phagocytosis in the life of an 
amoeba compare to its role in a multicellular animal? 
What are the major steps that occur during 
phagocytosis? 


8.21 Posttranslational Uptake 
of Proteins by Peroxisomes, 
Mitochondria, and Chloroplasts 


The division of the contents of a cell into large numbers of compart- 
ments presents many organizational challenges to the cell’s protein- 
trafficking machinery. We have seen in this chapter that protein 
trafficking within a eukaryotic cell is governed by (1) sorting sig- 
nals, such as the signal peptide of secreted proteins or mannose- 
phosphate groups of lysosomal enzymes, and (2) receptors that 
recognize these signals and deliver proteins containing them to the 
proper compartment. Four of the cell’s major organelles—nucleus, 
mitochondria, chloroplasts, and peroxisomes—import proteins 
through one or more outer boundary membranes. As in the case of 
the rough ER, proteins that are imported by these organelles con- 
tain amino acid sequences that serve as addresses that are recog- 
nized by receptors at the organelle’s outer membrane. Unlike the 
rough ER, which generally imports its proteins cotranslationally, 
the proteins of these other organelles are imported posttranslation- 
ally, that is, following their complete synthesis on free ribosomes in 
the cytosol. 

The import of proteins into the nucleus is a specialized topic 
and will be discussed separately in Section 12.2. Import of proteins 
into peroxisomes, mitochondria, and chloroplasts will be discussed 
in the following sections. 


Uptake of Proteins into Peroxisomes 


Peroxisomes are very simple organelles having only two subcom- 
partments in which an imported protein can be placed: the bound- 
ary membrane and the internal matrix (Section 5.10). Proteins 
destined for a peroxisome possess a peroxisomal targeting signal, 
either a PTS for a peroxisomal matrix protein or an mPTS for a per- 
oxisomal membrane protein. Several different PTSs, mPTSs, and 
PTS receptors have been identified. PTS receptors bind to 
peroxisome-destined proteins in the cytosol and shuttle them to the 
surface of the peroxisome, where they can enter the organelle. Recent 
studies have shown that peroxisomes form from the fusion of two 
distinct types of ER-derived vesicles, each of which carries half of a 
peroxisomal translocon (also called the importomer). Fusion of the 
vesicles allows for the formation of a complete peroxisomal translo- 
con, which then facilitates the movement of proteins into the peroxi- 
some. Unlike mitochondria and chloroplasts, whose imported 


proteins must assume an unfolded state, peroxisomes are somehow 
able to import peroxisomal matrix proteins in their native, folded 
conformation, even those that consist of several subunits. The mech- 
anism by which peroxisomes are able to accomplish this daunting 
assignment remains a matter of speculation. 


Uptake of Proteins into Mitochondria 


Mitochondria have four subcompartments into which proteins can 
be delivered: an outer mitochondrial membrane (OMM), inner 
mitochondrial membrane (IMM), intermembrane space, and matrix 
(see Figure 5.3c). Although mitochondria synthesize a few of their 
own integral membrane polypeptides (13 in mammals), roughly 99 
percent of the organelle’s proteins are encoded by the nuclear 
genome, synthesized in the cytosol, and imported posttranslation- 
ally. We will restrict the present discussion to proteins of the mito- 
chondrial matrix and inner mitochondrial membrane, which 
together constitute the vast majority of the proteins targeted to this 
organelle. As with peroxisomal proteins, and proteins of other com- 
partments, mitochondrial proteins contain signal sequences that 
target them to their home base. Most mitochondrial matrix proteins 
contain a removable targeting sequence (called the presequence) 
located at the N-terminus of the molecule (step 1, FIGURE 8.47a). 
A presequence typically includes a number of positively charged 
residues on one face of the helix and hydrophobic residues on the 
opposite face. In contrast, most proteins destined for the IMM con- 
tain internal targeting sequences that remain as part of the molecule. 

Before a protein can enter a mitochondrion, several events are 
thought to take place. First, the protein must be presented to the 
mitochondrion in a relatively extended, or unfolded, state (steps 1 
and A, Figure 8.47a). Several different molecular chaperones (e.g., 
Hsp70 and Hsp90) have been implicated in preparing polypeptides 
for uptake into mitochondria, including ones that specifically direct 
mitochondrial proteins to the cytosolic surface of the OMM (Figure 
8.47). The OMM contains a protein-import complex, the TOM com- 
plex, which includes (1) receptors that recognize and bind mito- 
chondrial proteins and (2) protein-lined channels through which 
unfolded polypeptides are translocated across the outer membrane 
(steps 2 and B).’ Proteins that are destined for the IMM or matrix 
must pass through the intermembrane space and engage a second 
protein-import complex located in the IMM, called a TIM complex 
(Figure 8.47b). The IMM contains two major TIM complexes: 
TIM22 and TIM23. TIM22 binds integral proteins of the IMM that 
contain an internal targeting sequence and inserts them into the 
lipid bilayer (steps C-D, Figure 8.47a). TIM23, in contrast, binds 
proteins with an N-terminal presequence, which includes all of the 
proteins of the matrix (as well as a number of proteins of the IMM 
that will not be discussed). TIM23 recognizes and translocates the 
matrix proteins completely through the IMM and into the inner 
aqueous compartment (step 3, Figure 8.47a). Translocation occurs 
at sites where the outer and inner mitochondrial membranes come 
into close proximity so that the imported protein can cross both 


It is interesting to note that, unlike the translocon of the ER or peroxisome, the pore- 
forming protein of the TOM complex (Tom 40) is a B-barrel protein, like the other 
integral proteins of the OMM (page 172), reflecting its evolution from the outer mem- 
brane of an ancestral bacterium. This has functional consequences, as the 
B-barrel protein cannot open laterally to allow integral proteins to insert into the OMM. 
Asa result, OMM proteins have to pass into the intermembrane space before entering 
the OMM bilayer. 
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FIGURE 8.47 Importing proteins into a mitochondrion. (a) Proposed steps taken by proteins imported posttranslationally into either the mitochondrial 
matrix or inner mitochondrial membrane. The polypeptide is targeted to a mitochondrion by a targeting sequence, which is located at the N-terminus in the 
matrix protein (step 1) and is located internally in most inner membrane proteins (step A). Cytosolic Hsp70 molecules unfold the polypeptides prior to their 
entry into the mitochondrion. The proteins are recognized by membrane receptors (red transmembrane proteins) and translocated through the OMM by 
way of pores in the TOM complex of the OMM (step 2 or B). Most integral proteins of the IMM are directed to the TIM22 complex of the IMM (step C), 
which steers them into the lipid bilayer of the IMM (step D). Mitochondrial matrix proteins are translocated through the TIM23 complex of the IMM (step 3). 
Once the protein enters the matrix, it is bound by a mitochondrial chaperone (step 4), which may either pull the polypeptide into the matrix or act like a 
Brownian ratchet to ensure that it diffuses into the matrix (these alternate chaperone mechanisms are discussed in the text). Once in the matrix, the unfolded 
protein assumes its native conformation (step 5a) with the help of Hsp60 chaperones (not shown). The presequence is removed enzymatically (step 5b). 

(b) A three-dimensional model of the mitochondrial protein-import machinery, showing the number, relative size, and topology of the various proteins 
involved in this activity. The TOM complex is a reddish color, the TIM23 complex is yellow-green, the TIM22 complex is green, and the cooperating 
chaperones are blue. 

Source: (b) From Toshiya Endo, Hayashi Yamamoto, and Masatoshi Esaki, J. Cell Science, cover of Vol. 116, #16, 2003; Reproduced by permission of The 
Company of Biologists, Ltd. http://jcs.biologists.org/content/1 16/16.cover-expansion. 


membranes simultaneously. Movement into the matrix is powered 
by the electric potential across the IMM acting on the positively 
charged targeting signal; if the potential is dissipated by addition of 
a drug such as DNP (page 187), translocation ceases. 

As it enters the matrix, a polypeptide interacts with mitochon- 
drial chaperones, such as mtHsp70 (step 4, Figure 8.47a), that 
mediate entry into the aqueous compartment. Two mechanisms 
have been proposed to explain the general action of molecular 


chaperones involved in the movement of proteins across mem- 
branes, which is a widespread phenomenon. According to one view, 
the chaperones act as force-generating motors that use energy 
derived from ATP hydrolysis to actively “pull” the unfolded poly- 
peptide through the translocation pore. According to the alternate 
view, the chaperones aid in the diffusion of the polypeptide across 
the membrane. Diffusion is a random process in which a molecule 
can move in any available direction. Consider what would happen if 
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encoded by nuclear genes are synthesized in the cytosol and 
imported through protein-lined pores in both membranes of the 
outer chloroplast envelope (step 1). Proteins destined for the stroma 
(step 1a) contain a stroma-targeting domain at their N-terminus, 
whereas proteins destined for the thylakoid (step 1b) contain both a 
stroma-targeting domain and a thylakoid-transfer domain at their 
N-terminus. Stromal proteins remain in the stroma (step 2) following 
translocation through the outer envelope and removal of their single 
targeting sequence. The presence of the thylakoid transfer domain 
causes thylakoid proteins to be translocated either into or completely 
through the thylakoid membrane (step 3). A number of the proteins of 
the thylakoid membrane are encoded by chloroplast genes and 
synthesized by chloroplast ribosomes that are bound to the outer 
surface of the thylakoid membrane (step 4). 


an unfolded polypeptide had entered a translocation pore in the 
mitochondrial membrane and had “poked its head” into the matrix. 
Then consider what would happen if a chaperone residing on the 
inner surface of the membrane were able to bind the protruding 
polypeptide in such a way that it blocked the diffusion of the 
polypeptide back through the pore and into the cytosol, but did not 
block its diffusion further into the matrix. As the polypeptide dif- 
fused further into the matrix, it would be bound repeatedly by the 
chaperone and at each stage prevented from diffusing backward. 
This mechanism of chaperone action is referred to as biased diffu- 
sion, and the chaperone is said to be acting as a “Brownian ratchet”; 
the term Brownian implies random diffusion, and a “ratchet” is a 
device that allows movement in only one direction. Recent studies 
suggest that both mechanisms of chaperone action are probably uti- 
lized and act cooperatively. Regardless of the mechanism of entry, 
once in the matrix the polypeptide achieves its native conformation 
(step 5a, Figure 8.47a) following enzymatic removal of the prese- 
quence (step 5b). 


Uptake of Proteins into Chloroplasts 


Chloroplasts have six subcompartments into which proteins can be 
delivered: an inner and outer envelope membrane and intervening 
intermembrane space, as well as the stroma, thylakoid membrane, 
and thylakoid lumen (Figure 8.48). Chloroplast and mitochondrial 
import mechanisms exhibit many similarities, although their 
translocation machineries have evolved independently. As in the 
mitochondria, 


1. The vast majority of chloroplast proteins (approximately 3000 in 
higher plants) are imported from the cytosol. 

2. The outer and inner envelope membranes contain distinct trans- 
location complexes (Toc and Tic complexes, respectively) that 
work together during import. 


Outer 
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3. Chaperones aid in the unfolding of the polypeptides in the cyto- 
sol and folding of the proteins in the chloroplast. 


4. Most proteins destined for the chloroplast are synthesized with a 
removable N-terminal sequence (termed the transit peptide) that 
is highly variable in length and sequence. 


The transit peptide does more than simply target a polypeptide 
to a chloroplast: it provides an “address” that localizes the polypep- 
tide to one of several possible subcompartments within the orga- 
nelle (Figure 8.48). All proteins translocated through the chloroplast 
envelope contain a stroma targeting domain as part of their transit 
peptide, which guarantees that the polypeptide will enter the 
stroma. Once in the stroma, the stroma targeting domain is removed. 
by a processing peptidase located in that compartment. Those poly- 
peptides that belong in a thylakoid membrane or thylakoid lumen 
bear an additional segment in their transit peptide, called the 


thylakoid transfer domain, that dictates entry into the thylakoids. 
Several distinct pathways have been identified by which proteins 
are either inserted into the thylakoid membrane or translocated 
into the thylakoid lumen. These pathways exhibit striking similari- 
ties to transport systems in bacterial cells, the presumed ancestors 
of chloroplasts. Many of the proteins that reside within the thyla- 
koid membrane are encoded by chloroplast genes and synthesized 
on membrane-bound ribosomes of the chloroplast, as illustrated in 
Figure 8.48, step 4. 


Analytic Questions 


1. Which, if any, of the following polypeptides would be expected to 
ack a signal peptide: collagen, acid phosphatase, hemoglobin, 
ribosomal proteins, glycophorin, proteins of the tonoplast? 


2. Suppose you suspected that a patient might be suffering from 
-cell disease (Section 8.12). How could you decide if this diagno- 
sis was accurate using cells that were cultured from the patient? 


3. In what cellular compartment would a glycoprotein be expected 
to have its greatest mannose content? its greatest N-acetylglu- 
cosamine content? its greatest sialic acid content? 


4. It was noted in Section 8.21 that peroxisomes are able to import 
folded proteins. Yet, these same organelles are impermeable to 
relatively small molecules such as NADH and acetyl-CoA. How is 
it possible to be permeable to one and not the other? 


5. Name two proteins you would expect to find as integral compo- 
nents of the RER membrane that would be absent from the SER. 
Name two proteins of the SER that would not be present in 
the RER. 


6. Suppose you wanted to study the process of regulated secretion 
in a cell that lacked mature secretory granules, that is, vesicles 
containing secretory material that was ready to be discharged. 
How might you obtain a cell that lacked such granules? 


7. It was described in Section 8.12 how glucocerebrosidase could 
be prepared that bore mannose residues at the end of its surface 
oligosaccharides rather than the usual sugar, sialic acid. A similar 
version of glucocerebrosidase with exposed mannose residues 
is being produced by recombinant DNA technology using a spe- 
cial line of cells. What characteristics do you think these cells 
might exhibit? Clue: Information to answer this question is pro- 
vided in Figure 8.22. 


8. Autoradiography depends on particles emitted from radioactive 
atoms striking a photographic emulsion that lies on top of the 
tissue section. When the emulsion is developed, the site where 
the particle struck the emulsion appears as a silver grain, as in 
Figure 8.3a. How do you think the thickness of the section might 


REVIEW | 

1. How are proteins, such as the enzymes of the TCA 
cycle, able to arrive in the mitochondrial matrix? 

2. What is the role of cytosolic and mitochondrial 
chaperones in the process of mitochondrial import? 

3. Distinguish between two possible import 
mechanisms: biased diffusion and force-generating 
motors. 

4. Describe the steps by which a polypeptide would 


10. 


is, 


12. 


13. 


14. 


15: 


16. 


17. 


move from the cytosol where it is synthesized to the 
thylakoid lumen. 


o+—_—____—__ 


affect the resolution of the technique, that is, the ability to locate 
the precise site in the cell where radioactivity is incorporated? 


. In which part of a cell would you expect the following com- 


pounds to be first incorporated: [?H]leucine, [PH]sialic acid, [SH] 
mannose, [*H]choline, [*H]glucuronic acid (a precursor of GAGs), 
[>H]pregnenolone (a precursor of steroid hormones), [H]rhamnose 
in a plant cell (rhamnose is a precursor of pectins)? 


Which of the following cells would you expect to be engaged 
most heavily in bulk-phase endocytosis: (a) an erythrocyte, (b) a 
pancreatic acinar cell, (c) a skeletal muscle cell? Why? 


Would you expect the properties of the cisternal side of Golgi 
membranes to be more similar to the extracellular or cytosolic 
side of the plasma membrane? Why? 


Which compartment(s) of a cell is associated with each of the 
following: clathrin, calcium ions in a skeletal muscle cell, dolichol 
phosphate, ribonuclease and lipase, digitalis, LDL receptors, 
COPI proteins, COPII proteins, unbound SRPs? 


f a slice of pancreatic tissue had been incubated in [?H]leucine 
continuously for 2 hours, where in the acinar cells would you 
expect to find incorporated radioactivity? 

f you were to add a drug that interfered with the ability of ribo- 
somes to bind to mRNA, what effect would this be expected to 
have on the structure of the RER? 


Not all receptors that carry out RME are located in coated pits 
prior to binding ligand, yet they too become concentrated in 
coated pits prior to internalization. How do you suppose the 
binding of a ligand to a receptor would cause it to become con- 
centrated into a coated pit? 


It was noted in the text that lysosomes lack a distinctive appear- 
ance in the electron microscope. How might you determine 
whether a particular vacuole is in fact a lysosome? 

Studies of a rare inherited disorder, Dent's disease, revealed that 
these individuals lacked a chloride ion channel in the endosomes 
of the cells of their kidney tubules. These patients suffered from 


307 


SUOIISENH DAjeUY 


308 


Bupjoyjes) Sueiquiay\ Pue ‘UO!}OUNY ‘aIN}ONIIS :swa}shs eueIquUa| ZUse|do1AD © 8 YALdVHOD 


18. 


a variety of symptoms that suggested that their endosomal com- 
partments were not as acidic as those of normal individuals. How 
do you suppose that a defect in a chloride channel could account 
for this condition? (See Quantitative Tutorial Video). 


Examine the fluorescence micrograph in Figure 8.20d. Although 
the Golgi complex is brightly stained, there is scattered red fluo- 
rescence in other parts of the cell. How can you explain the pres- 
ence of this dispersed staining? 


19. Figure 8.86 shows the localization of GFP-labeled mannosidase 


ll in a cell treated with an siRNA against an mRNA encoding a 
protein involved in an early step in the secretory pathway. How 
would you expect the fluorescence pattern to appear in a cell 
that had been treated with an siRNA against an mRNA encoding 
the protein GGA? or the protein clathrin? or a protein of the signal 
recognition particle? 
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The Cytoskeleton 
and Cell Motility 


POISONS, DRUGS, 
AND THE CYTOSKELETON 


Pisis phalloides, also known as the death cap, is a 
small, innocuous-looking mushroom found in forested 
regions across Europe. Sometimes misidentified for one 
of its edible cousins, the death cap has been held respon- 
sible for a majority of the mushroom poisonings in the 
world. Two major groups of multicyclic peptide toxins are 
produced by the death cap: amatoxins, which are potent 
inhibitors of RNA polymerase Il, and phallotoxins. 
Phalloidin, one of the phallotoxins produced by the 
mushroom, was first isolated in the 1930s and was subse- 
quently found to poison cells by binding tightly and 
specifically to actin filaments of the cytoskeleton. The very 
features that make phalloidin an effective toxin have also 
made it a powerful tool for cell biologists. Nowadays, 
fluorescently labeled phalloidin is found nearly ubiqui- 
tously in cell biology laboratories, because it is one of the 
primary means of identifying and characterizing the actin 
cytoskeleton of fixed cells under the light microscope. 


Other cytoskeletal toxins have proven to have important 
medical uses in addition to their utility in the laboratory. 
For example, the stems of the autumn crocus (Colchicum 
autumnale) have been used to treat joint pain and gout 
since ancient times. In the 1800s, chemists isolated the 


Actin networks of the leading edge of a motile cell are shown in this 
illustration. Actin is shown in shades of blue and green, capping protein in 
purple, Arp2/3 in yellow, and tropomyosin in pink. 

activity to a small molecule that they called colchicine. Source: Courtesy of Janet Iwasa and Dyche Mullins. 


Early studies on cells revealed that treatment with 
continued 
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chromosomes of many cells fully condensed and aligned in 
the center of the cell. This mitotic arrest was later found to 
be the result of high affinity binding of colchicine to tubulin, 
preventing the assembly of microtubules. Colchicine has 
proven useful in laboratories, particularly in karyotype 
studies, where it was instrumental in determining the correct 
number of chromosomes in humans. Colchicine also remains 
one of the most prescribed drugs for the treatment of gout, 
despite its narrow therapeutic range (a measure of the 
difference between a toxic and therapeutic dose). 


Taxol (also known by its generic name, paclitaxel) is probably 
the most famous cytoskeletal drug. The cytotoxic effect of 
taxol was first observed in the 1960s during a large-scale 
screen for anticancer compounds, in which cells were 
treated with an alcohol extract of the bark of a Pacific yew 
tree. Taxol was later identified as the active ingredient and 
has recently been found to be synthesized by an 
endosymbiont fungus living in the bark of the Pacific yew. 


9.1 Overview of the Major 
Functions of the Cytoskeleton 


The skeleton of a vertebrate is a familiar system consisting of hard- 
ened elements that support the soft tissues of the body and play a key 
role in mediating bodily movements. Eukaryotic cells also possess a 
“skeletal system’—a cytoskeleton—that has analogous functions. 
The cytoskeleton is composed of three well-defined filamentous 
structures—microtubules, actin filaments, and intermediate 
filaments—that together form an elaborate interactive and dynamic 
network. Each of the three types of cytoskeletal filaments is a poly- 
mer of protein subunits held together by weak, noncovalent bonds. 
This type of construction lends itself to rapid assembly and disas- 
sembly, which is dependent on complex cellular regulation. Each 
cytoskeletal element has distinct properties. Microtubules are long, 
hollow, unbranched tubes composed of subunits of the protein 
tubulin. Actin filaments are solid, thinner structures, often organized 


Like colchicine, taxol was found to bind specifically and 
tightly to tubulin, but it had a radically different effect on 
microtubules. Rather than preventing their polymerization, 
taxol stabilized microtubules and kept them from depolym- 
erizing. This effect was put to good use in cell biology 
laboratories, where taxol has played a key role in the 
discovery of microtubule-associated proteins (MAPs) and 
the study of microtubule-based motor proteins. In the clinic, 
taxol has made a major impact on the treatment of cancer 
and has been used since the 1980s as a key chemotherapeu- 
tic agent for a number of cancers, including those of the 
breast, lung, and ovary. These examples illustrate just a few 
of the many small molecules, natural and synthetic, that have 
been found to specifically target cytoskeletal components. 
As you explore this chapter, you might consider the 
following questions: What aspects of cytoskeletal proteins 
make them more amenable as drug targets? Can you think 
of other ways to “drug” the cytoskeleton towards a 
therapeutic end? 


into a branching network. Intermediate filaments are tough, ropelike 
fibers composed of a variety of related proteins. The properties of 
microtubules, intermediate filaments, and actin filaments are sum- 
marized in Table 9.1. 

An overview of the major activities of the cytoskeleton in three 
different non-muscle cells is shown in FIGURE 9.1. The cells 
depicted in this schematic illustration include a polarized epithe- 
lial cell, the tip of an elongating nerve cell, and a dividing cultured 
cell. As will be discussed in this chapter, the cytoskeleton of these 
cells functions as 


A dynamic scaffold providing structural support that can determine 
the shape of the cell and resist forces that tend to deform it. 

An internal framework responsible for positioning the various orga- 
nelles within the interior of the cell. This function is particularly 
evident in polarized epithelial cells, such as those depicted in 
Figure 8.11, in which certain organelles are arranged in a defined 
order from the apical to the basal end of the cell. 


TABLE 9.1 Properties of Microtubules, Intermediate Filaments, and Actin Filaments 


Microtubules 


Intermediate filaments Actin filaments 


Subunits incorporated into polymer GTP-«f-tubulin heterodimer 
Preferential site of incorporation + End ($-tubulin) 

Polarity Yes 

Enzymatic activity GTPase 

Motor proteins Kinesins, dyneins 

MAPs 

Stiff, hollow, inextensible tube 


Major group of associated proteins 
Structure 


25 nm outer diam. 

All eukaroytes 

Support, intracellular transport, 
cell organization 

Cytoplasm 


Dimensions 
Distribution 
Primary functions 


Subcellular distribution 


~70 different proteins, likely ATP-actin monomers 
incorporated as tetramers 

Internal + End (barbed) 

No Yes 

None ATPase 

None Myosins 

Plakins Actin-binding proteins 

Tough, flexible, extensible filament Flexible, inextensible helical 
filament 

8 nm diam. 

All eukaryotes 

Motility, contractility, intracellular 
transport 

Cytoplasm 


10-12 nm diam. 

Animals 

Structural support, mechanical 
strength 

Cytoplasm + nucleus 


Actin filaments 


Motor protein 


Intermediate 
filaments 


(b) 


toy 
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* < Microtubule 
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Motor protein 
(a) Epithelial cell (c) 


Intermediate 
filament 


Nerve cell 


Motor protein 
Actin filaments 


Dividing cell 
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’ Microtubule 


FIGURE 9.1 An overview of the structure and functions of the cytoskeleton. Schematic drawings of (a) an epithelial cell, (6) a neuron, and (c) a dividing 
cell. The microtubules of the epithelial and neurons function primarily in support and organelle transport, whereas the microtubules of the dividing cell form 
the mitotic spindle required for chromosome segregation. Intermediate filaments provide structural support for both the epithelial cell and neuron. 
Microfilaments support the microvilli of the epithelial cell and are an integral part of the motile machinery involved in neuronal elongation and cell division. 


A network of tracks that direct the movement of materials and orga- 
nelles within cells. Examples of this function include the delivery 
of mRNA molecules to specific parts of a cell, the movement of 
membranous carriers from the endoplasmic reticulum to the 
Golgi complex and the transport of vesicles containing neuro- 
transmitters down the length of a nerve cell. FIGURE 9.2 shows a 
small portion of a cultured cell, and it is evident that most of the 
fluorescent green organelles, which are labeled peroxisomes 
(page 217), are closely associated with the microtubules (red) of 


5 um 


FIGURE 9.2 An example of the role of microtubules in transporting 
organelles. The peroxisomes of this cell (shown in green and indicated by 
arrows) are closely associated with microtubules of the cytoskeleton (shown 
in red). Peroxisomes appear green because they contain a peroxisomal 
protein fused to the green fluorescent protein (Section 18.3). Microtubules 
appear red because they are stained with a fluorescently labeled antibody. 
Source: From E. A. C. Weimer, et al., J. Cell Biol. 136:78, 1997, Courtesy of 
S. Subramani, Fig. 6. Reproduced with permission of the Rockefeller 
University Press. 


the cell’s cytoskeleton. Microtubules are the tracks over which the 
peroxisomes are transported in mammalian cells. 

The force-generating apparatus that moves cells from one place to 
another. Single-celled organisms move either by “crawling” over 
the surface of a solid substratum or by propelling themselves 
through their aqueous environment with the aid of specialized, 
microtubule-containing locomotor organelles (cilia and flagella) 
that protrude from the cell’s surface. Multicellular animals have a 
variety of cells that are capable of independent locomotion, 
including sperm, white blood cells, and fibroblasts. The tip of a 
growing axon is also highly motile (Figure 9.1), and its movement 
resembles a crawling cell. 

An essential component of the cell’s division machinery. Cytoskeletal 
elements make up the apparatus responsible for separating the 
chromosomes during mitosis and meiosis and for splitting the 
parent cell into two daughter cells during cytokinesis. These 
events are discussed at length in Chapter 14. 


REVIEW | — 


1. What are five functions of the cytoskeleton? 
eo— —! 


9.2 Structure and Function 
of Microtubules 


Microtubules are hollow, tubular structures that are assembled 
from the protein tubulin. They are found in the cytoskeleton, the 
mitotic spindle, centrioles, and the core of cilia and flagella. They 
function in diverse activities such as support of the cell and 
movement of materials between the cell body and axon terminals of 
a neuron. 
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of Microtubules 


As the name implies, microtubules are hollow, relatively rigid, tubular 
structures, and they occur in nearly every eukaryotic cell. Microtubules 
are components of a diverse array of structures, including the mitotic 
spindle of dividing cells and the core of cilia and flagella. Microtubules 
have an outer diameter of 25 nm and a wall thickness of approximately 
4 nm and may extend across the length or breadth of a cell. The wall 
of a microtubule is composed of globular proteins arranged in longi- 
tudinal rows, termed protofilaments, that are aligned parallel to the 
long axis of the tubule (FIGURE 9.3a). When viewed in cross section, 
microtubules are seen to consist of 13 protofilaments aligned side 
by side in a circular pattern within the wall (Figure 9.3b). Noncovalent 
interactions between adjacent protofilaments are thought to play an 
important role in maintaining microtubule structure. 


_-4nm 


-8nm 


Each protofilament is assembled from dimeric building blocks 
consisting of one a-tubulin and one f-tubulin subunit. The two 
types of globular tubulin subunits have a similar three-dimensional 
structure and fit tightly together as shown in Figure 9.3c. The tubu- 
lin dimers are organized in a linear array along the length of each 
protofilament, as shown in Figure 9.3d. Because each assembly unit 
contains two nonidentical components (a heterodimer), the proto- 
filament is asymmetric, with an a-tubulin at one end and a B-tubulin 
at the other end. All of the protofilaments of a microtubule have the 
same polarity. Consequently, the entire polymer has polarity. One 
end of a microtubule is known as the plus end and is terminated by 
a row of {-tubulin subunits (Figure 9.3d). The opposite end is the 
minus end and is terminated by a row of a-tubulin subunits. As 
discussed later in the chapter, the structural polarity of microtubules 
is an important factor in the growth of these structures and their 
ability to participate in directed mechanical activities. 


FIGURE 9.3 The structure of microtubules. (a) Electron micrograph of 
negatively stained microtubules from brain showing the globular subunits that 
make up the protofilaments. The bumps at the surface of the microtubules are 
microtubule-associated proteins (MAPs), discussed later. (b) Electron 
micrograph of a cross section through a microtubule of a Juniperus root tip cell 
revealing the 13 subunits arranged within the wall of the tubule. The 
microtubules of these plant cells are most abundant in a cortical zone about 
100 nm thick just beneath the plasma membrane (seen in the lower right of the 
micrograph). (c) A ribbon model showing the three-dimensional structure of 
the aB-tubulin heterodimer. Note the complementary shapes of the subunits at 
their interacting surfaces. The B-tubulin subunit has a bound GTP, which is not 
hydrolyzed and is nonexchangeable. The f-tubulin subunit has a bound GDP, 
which is exchanged for a GTP prior to assembly into a polymer (Section 9.6). 
The plus end of the dimer is at the top. (d) Diagram of a longitudinal section of 
a microtubule shown in the B-lattice, which is the structure thought to occur in 
the cell. The wall consists of 13 protofilaments composed of «f-tubulin 
heterodimers stacked in a head-to-tail arrangement. Adjacent protofilaments 
are not aligned in register but are staggered about 1 nm so that the tubulin 
molecules form a helical array around the circumference of the microtubule. 
The helix is interrupted at one site where « and B subunits make lateral 
contacts. This produces a “seam” that runs the length of the microtubule. 
Source: (a) From Linda A. Amos, J. Cell Biol. 72:645, 1977, Fig. 2; Reproduced 
with permission of the Rockefeller University Press; (b) H. Sui and K. Downing, 
Structure 18:8, 2010, with permmission from Elsevier. (c) Courtesy Eva Nogales 
and Kenneth Downing. 


Microtubule-Associated Proteins 


Microtubules prepared from living tissue typically contain additional 
proteins, called microtubule-associated proteins (or MAPs). 
MAPs comprise a heterogeneous collection of proteins. The first 
MAPs to be identified are referred to as “classical MAPs” and typi- 
cally have one domain that attaches to the side of a microtubule and 
another domain that projects outward as a tail from the microtu- 
bule’s surface. The binding of one of these MAPs to the surface of a 
microtubule is depicted in FIGURE 9.4. Some MAPs can be seen in 
electron micrographs as cross-bridges connecting microtubules to 
each other, thus maintaining their parallel alignment. MAPs gener- 
ally increase the stability of microtubules and promote their assem- 
bly. The microtubule-binding activity of the various MAPs is 
controlled primarily by the addition and removal of phosphate 
groups from particular amino acid residues. An abnormally high 
level of phosphorylation of one particular MAP, called tau, has been 
implicated in the development of several fatal neurodegenerative 
disorders, including Alzheimer’s disease (Section 2.13). The brain 
cells of people with these diseases contain strange, tangled filaments 
(called neurofibrillary tangles) consisting of tau molecules that are 
excessively phosphorylated and unable to bind to microtubules. The 
neurofibrillary filaments have been proposed to contribute to the 
death of nerve cells, but whether these filaments actually help cause 
neuronal degeneration or are just a consequence of neuronal death is 
still a controversial point. Some evidence suggests that smaller 
aggregates of tau interfere with axonal transport long before neuro- 
brillary tangles are formed. But while the causal role of tau in 
Alzheimer’s disease remains a question, mutations in tau are known 
to be the direct cause of another neurodenerative disease called fron- 
totemporal dementia and Parkinsonism linked to chromosome 17 
(FTDP-17 for short). The fact that mutations in tau can cause such 
dementia, which accounts for approximately 5 percent of inherited 
cases of frontotemporal dementia, demonstrates that tau can become 
toxic to neurons and thus supports a potential role in causation of 
Alzheimer’s and other dementias. 


FIGURE 9.4 Microtubule-associated proteins (MAPs). Schematic 
diagram of a brain MAP2 molecule bound to the surface of a microtubule. 
The MAP2 molecule shown in this figure contains three tubulin-binding 
sites connected by short segments of the polypeptide chain. (An alternate 
isoform contains four binding sites). The binding sites are spaced at a 
sufficient distance to allow the MAP2 molecule to attach to three separate 
tubulin subunits in the wall of the microtubule. The tails of the MAP 
molecules project outward, allowing them to interact with other cellular 
components. 


Microtubules as Structural Supports 
and Organizers 


Microtubules are stiff enough to resist forces that might compress or 
bend the fiber. This property enables them to provide mechanical 
support much as steel girders support a tall office building or poles 
support the structure of a tent. The distribution of cytoplasmic 
microtubules in a cell helps determine the shape of that cell. In cul- 
tured animal cells, the microtubules extend in a radial array outward 
from the area around the nucleus, giving these cells their round, flat- 
tened shape (FIGURE 9.5). In contrast, the microtubules of columnar 
epithelial cells are typically oriented with their long axis parallel to 
the long axis of the cell (Figure 9.1a). This configuration suggests 
that microtubules help support the cell’s elongated shape. The role of 
microtubules as skeletal elements is particularly evident in certain 
highly elongated cellular processes, such as cilia and flagella and the 
axons of nerve cells. 

In plant cells, microtubules play an indirect role in maintaining 
cell shape through their influence on the formation of the cell wall. 
During interphase, most of a plant cell’s microtubules are located 
just beneath the plasma membrane, forming a distinct cortical zone. 
The enzyme that builds the cellulose of the cell wall, cellulose syn- 
thase (see Figure 7.34), is a transmembrane protein whose cytoplas- 
mic side is physically attached to cortical microtubules through a 
protein called CSI (Cellulose Synthase Interactive Protein 1). As a 
result, the cellulose microfibrils of the cell wall are assembled in an 
orientation that is parallel to the underlying microtubules of the cor- 
tex (FIGURE 9.6). The orientation of these cellulose microfibrils plays 
an important role in determining the growth characteristics of the 
cell and thus its shape. In most cells, newly synthesized cellulose 
microfibrils and coaligned microtubules are arranged perpendicular 
to the long axis of the cell (transversely), like the hoops around a bar- 
rel (Figure 9.6). Because the cellulose microfibrils resist lateral 
expansion, the turgor pressure exerted by the fluid in the cell vacuole 
is directed to the ends of the cell, causing cell elongation. 


FIGURE 9.5 Localization of microtubules of a flattened, cultured mouse 
cell is revealed by fluorescent anti-tubulin antibodies. Microtubules are 
seen to extend from the perinuclear region of the cell in a radial array. 
Individual microtubules can be followed and are seen to curve gradually as 
they conform to the shape of the cell. 

Source: From Mary Osborn and Klaus Weber, Cell 12:563, 1977, with 
permission from Elsevier. 


313 


SO|NQNJOIDI\ JO UO!}OUNY PUe sINJONIS e 74 


314 


AYWNOW []2D Pue UOI]axSOAD EYL e 6 YILdWHO 


FIGURE 9.6 Visualization of cellulose synthase demonstrates functional 


association with microtubules. Arabidopsis cells that co-express fluores- 
cently tagged cellulose synthase complexes (green) and microtubules (red) 
show colocalization (arrowhead). 


Source: From Paradez et al., Science. 312(3):1491-5. 


Microtubules also play a key role in maintaining the internal 
organization of cells. Treatment of cells with microtubule-disrupting 
drugs can seriously affect the location of membranous organelles, 
including the ER and Golgi complex. The Golgi complex is typically 
organized as a single ribbon located near the center of a mammalian 
cell (see Figure 8.20c), just outside the nucleus. Treatment of cells 
with nocodazole or colchicine, which promote microtubule disas- 
sembly, can disperse the Golgi elements into separate Golgi stacks 
scattered throughout the cytoplasm. When the drug is removed and 
microtubules reassemble, the Golgi membranes return to their nor- 
mal position in the cell interior. 


Microtubules as Agents 
of Intracellular Motility 


Living cells bristle with activity as macromolecules and organelles 
move in a directed manner from place to place. While this hustle and 
bustle can be appreciated by watching the movement of particulate 
materials in a living cell, the underlying processes are usually diffi- 
cult to study because most cells lack a highly ordered cytoskeleton. 
We know, for example, that the transport of materials from one 
membrane compartment to another depends on the presence of 
microtubules because specific disruption of these cytoskeletal ele- 
ments brings the movements to a halt. In nerve cells intracellular 
movements rely on a highly organized arrangement of microtubules 
and other cytoskeletal filaments. 

The axon of an individual motor neuron may stretch from the 
spinal cord to the tip of a finger or toe. The manufacturing center of 
this neuron is its cell body, a rounded portion of the cell that resides 
within the spinal cord. When labeled amino acids are injected into 
the cell body, they are incorporated into labeled proteins that move 
into the axon and gradually travel down its length. Many different 
materials, including neurotransmitter molecules, are compartmen- 
talized within membranous vesicles in the ER and Golgi complex of 
the cell body and then transported down the length of the axon 
(FIGURE 9.7a). Non-membrane-bound cargo, such as RNAs, ribosomes, 
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FIGURE 9.7 Axonal transport. (a) Schematic drawing of a nerve cell 
showing the movement of vesicles down the length of an axon along tracks of 
microtubules. Vesicles move in both directions within the axon. (b) Schematic 
drawing of the organization of the microtubules and intermediate filaments 
(neurofilaments) within an axon. Vesicles containing transported materials are 
attached to the microtubules by cross-linking proteins, including motor 
proteins, such as kinesin and dynein. (c) Electron micrograph of a portion of 
an axon from a cultured nerve cell, showing the numerous parallel microtu- 
bules that function as tracks for axonal transport. The two membrane-bound 
organelles shown in this micrograph were seen under a light microscope to 
be moving along the axon at the time the nerve cell was fixed. 

Source: From A.C. Breuer, C.N. Christian, M. Henkart, and P.G. Nelson, J. Cell 
Biol. 65:568, 1975, Fig. 3b. Reproduced with permission of Rockefeller 
University Press. 
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FIGURE 9.8 Visualizing axonal transport. (a-c) These video micrographs show the progression of a membranous organelle along a branched axon. The 
cell body is far out of the field to the upper left, while the termini (growth cones, page 379) are out of the field to the bottom right. The position of the 
organelle is indicated by the arrowheads. The organelle being followed (an autophagic vacuole) moves in the retrograde direction across the branch point 


and continues to move toward the cell body. Bar, 10 pm. 


Source: From Peter J. Hollenbeck, J. Cell Biol. 121:307, 1993, Fig. 1. Reproduced with permission of Rockefeller University Press. 


and even cytoskeletal elements are also transported along this vast 
stretch of extended cytoplasm. Different materials move at different 
rates, with the fastest axonal transport occurring at velocities of 5 um 
per second. At this rate, an immature synaptic vesicle pulled by 
nanosized motor proteins can travel 0.4 meter in a single day, or 
about halfway from your spinal cord to the tip of your finger. 
Structures and materials traveling from the cell body toward the 
terminals of a neuron are said to move in an anterograde direction. 
Other structures, including endocytic vesicles that form at the axon 
terminals and carry regulatory factors from target cells, move in the 
opposite, or retrograde, direction—from the synapse toward the cell 
body. Defects in both anterograde and retrograde transport have 
been linked to several neurological diseases including Amyotrophic 
Lateral Sclerosis (ALS), also known as Lou Gehrig’s disease. 

Axons are filled with cytoskeletal structures, including bundles 
of actin filaments, intermediate filaments, and microtubules inter- 
connected in various ways (Figure 9.7b,c). Using video microscopy, 
investigators can follow individual vesicles as they move along the 
microtubules of an axon, either toward or away from the cell body 
(FIGURE 9.8). These movements are mediated primarily by microtu- 
bules (Figure 9.7c), which serve as tracks for a variety of motor 
proteins that generate the forces required to move objects within a 
cell. The study of motor proteins has become a major focus in cell 
and molecular biology, and a great deal of information has been col- 
lected about the nature of these molecules and their mechanism of 
action, which are the subjects of the following section. 
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1. Describe three functions of microtubules. 


9.3 Motor Proteins: Kinesins 
and Dyneins 


The motor proteins of a cell convert chemical energy (stored in ATP) 
into mechanical energy, which is used to generate force, as occurs 
when a muscle cell contracts, or to move cellular cargo attached to 


the motor. Types of cellular cargo transported by these proteins 
include ribonucleoprotein particles, vesicles, mitochondria, lysosomes, 
chromosomes, and other cytoskeletal filaments. A single cell may con- 
tain a hundred different motor proteins, each presumably specialized 
for a distinct activity, such as the movement of particular types of 
cargo in a particular region of the cell. Collectively, motor proteins can 
be grouped into three broad superfamilies: kinesins, dyneins, and 
myosins. Kinesins and dyneins move along microtubules, whereas 
myosins move along actin filaments. No motor protein is known that 
uses intermediate filament tracks. This is not surprising considering 
that intermediate filaments are not polarized and thus would not pro- 
vide directional cues to the motor. 


Motor Proteins Traverse 
the Microtubular Cytoskeleton 


Motor proteins move unidirectionally along their cytoskeletal track in a 
stepwise manner, from one binding site to the next. As the protein 
moves along, it undergoes a series of conformational changes that con- 
stitute a mechanical cycle. The steps of the mechanical cycle are coupled 
to the steps of a chemical (or catalytic) cycle, which provide the energy 
necessary to fuel the motor’s activity (see Figure 9.61 for an example). 
The steps in the chemical cycle include the binding of an ATP molecule 
to the motor, the hydrolysis of the ATP, the release of the products 
(ADP and Pi) from the motor, and the binding of a new molecule of 
ATP. The binding and hydrolysis of a single ATP molecule at the cata- 
lytic site is used to drive a power stroke that moves the motor a precise 
number of nanometers along its track. As the motor protein moves to 
successive sites along the cytoskeletal polymer, the mechanical and 
chemical cycles are repeated over and over again, pulling the cargo over 
considerable distances. Keep in mind that we are describing molecular- 
sized motors, which, unlike human-made machines, are greatly influ- 
enced by their environment. For example, motor proteins have virtually 
no momentum and are subjected to tremendous frictional resistance 
from their viscous environment. As a result, a motor protein comes to a 
stop almost immediately once energy input has ceased. 

We will begin by examining the molecular structure and func- 
tions of kinesins, which are the smallest and best understood micro- 
tubular motors. 
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In 1985, Ronald Vale and colleagues isolated a motor protein from 
the cytoplasm of squid giant axons that used microtubules as its 
track. They named the protein kinesin, which was subsequently 
found in virtually all eukaryotic cells. The kinesin molecule discovered 
in 1985, which is referred to as “conventional kinesin” or kinesin-1, is 
only one member of a superfamily of related proteins, called KRPs 
(kinesin-related proteins). KRPs are classified into 14 different 
families (kinesin-1 to kinesin-14). In the following discussion the term 
kinesin will refer only to members of the kinesin-1 family. 

Each kinesin-1 molecule is a tetramer constructed from two 
identical heavy chains and two identical light chains (FIGURE 9.9a). 
A kinesin molecule has several parts, including a pair of globular 
heads that bind a microtubule and act as ATP-hydrolyzing, force- 
generating “engines.” Each head (or motor domain) is connected to 
a neck, a rodlike stalk, and a fan-shaped tail that binds cargo to be 
hauled (Figure 9.9a). The motor portions of all KRPs have related 
amino acid sequences, reflecting their common evolutionary ances- 
try and their similar role in moving along microtubules. In contrast, 
the tails of KRPs have diverse sequences, reflecting the variety of 
cargo these motors haul. A number of different proteins have been 
identified as potential adaptors that link specific KRPs and their car- 
goes. Surprisingly, the motor domains of kinesins are strikingly sim- 
ilar in structure to that of myosins, despite the fact that kinesins are 
much smaller proteins and the two types of motors operate over dif- 
ferent tracks. Kinesins and myosins almost certainly evolved from a 
common ancestral protein present in a primitive eukaryotic cell. 

The activity of purified kinesin molecules can be observed in an 
in vitro motility assay (See Figure 9.9¢ and the Experimental Walkthrough 
Video: Microtubule gliding assay for this chapter). In such an assay, puri- 
fied kinesin is attached to a glass coverslip, then microtubules are 
added along with media that contains ATP. Because the kinesins are 
anchored to the glass, the microtubules move in the direction opposite 
to the direction that the motor is stepping. When purified kinesin-1 is 
used in such an assay, the motor proteins move along microtubules 
toward their plus end. Kinesin is therefore said to be a plus end- 
directed microtubular motor. In an axon, where all of the microtu- 
bules are oriented with their minus ends facing the cell body, kinesin 
transports vesicles and other cargo toward the synaptic terminals. The 
discovery of kinesin is discussed in the Experimental Pathways, which 
can be found on the Web at www.wiley.com/college/karp. 

A single kinesin molecule moves along a single protofilament of 
a microtubule at a velocity proportional to the ATP concentration 
(up to a maximal velocity of about 1 um per second). At low ATP 
concentrations, kinesin molecules travel slowly enough for observers 
to conclude that the protein moves in distinct steps (Figure 9.9b). 
Each step of the molecule is approximately 8 nm in length, which is 
also the length of one tubulin dimer in a protofilament, and requires 
the hydrolysis of a single ATP molecule. The methods used for meas- 
uring the step size ofkinesin are described in Section 9.4, Experimental 
Pathway: The Step Size of Kinesin. It is now generally accepted that 
kinesin moves by a “hand-over-hand” mechanism depicted in Figure 9.9b 
and the associated animation. According to this model, which is basi- 
cally similar to a person climbing a rope, the two heads alternate in 
taking the leading and lagging positions without an accompanying 
rotation of the stalk and cargo at every step. 

The movement of kinesin molecules, both in vitro and in vivo, is 
highly processive, meaning that the motor protein tends to move 
along an individual microtubule for considerable distances (over 1 um) 
without falling off. A two-headed kinesin molecule can accomplish this 
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FIGURE 9.9 Kinesin. (a) Structure of a kinesin-1 molecule, which consists 
of two heavy chains that wrap around each other to form a single, common 
stalk and two light chains associated with the globular ends of the heavy 
chains. The human genome encodes three different heavy chains and four 
different light chains for kinesin-1. The force-generating heads bind to the 
microtubule, and the tail binds to the cargo being transported. Having a 
molecular mass of approximately 380 kDa, kinesin is considerably smaller 
than the other motor proteins, myosin (muscle myosin, 520 kDa) and dynein 
(over 1000 kDa). (b) Schematic diagram of a kinesin molecule moving along 
a microtubular track. In the alternate hand-over-hand model shown here, 
the two heads carry out equivalent but alternating movements, not unlike 
that of a person walking through a garden on a linear path of stepping 
stones. As with walking, the trailing head (leg) moves 16 nm forward 
alternately on the left and right side of the stalk (body). (c) Schematic of 
kinesin in vitro motility assay. Kinesins are deposited onto a coverslip, then 
microtubules labled with a fluorescent marker (green) are added to the 
coverslip and observed with a microscope. Motor activity of the kinesins 
causes the microtubules to glide over the coverslip. 


feat because at least one of the heads is attached to the microtubule at all 
times (Figure 9.9b). A motor protein with this capability is well adapted 
for independent, long-distance transport of small parcels of cargo. 
The two heads of a kinesin molecule behave in a coordinated 
manner, so that they are always present at different stages in their 
chemical and mechanical cycles at any given time. When one head 
binds to the microtubule, the resulting conformational changes in 
the adjacent neck region of the motor protein cause the other head 
to move forward toward the next binding site on the protofilament. 
The forward-moving head probably finds its precise binding site on 
the protofilament through a rapid, diffusional (random) search. The 


microtubule is not simply a passive track in these events but plays an 
active role in stimulating certain steps in the mechanical and chemi- 
cal cycles of the kinesin molecule. Freely soluble kinesins adopt a 
folded, self-inhibited conformation and require interaction with 
cargo and a microtubule to become activated. 

Like kinesin-1, most KRPs move toward the plus end of the 
microtubule to which they are bound. One small family (called 
kinesin-14), however, including the widely studied Ncd protein of 
Drosophila, moves in the opposite direction, that is, toward the 
minus end of the microtubular track. One might expect that the 
heads of plus end-directed and minus end-directed KRPs would 
have a different structure because they contain the catalytic motor 
domains. But the heads of the two proteins are virtually indistin- 
guishable. Instead, differences in directional movement have been 
traced to differences in the adjacent neck regions of the two proteins. 
When the head of a minus end-directed Ned molecule is joined to 
the neck and stalk portions of a kinesin-1 molecule, the hybrid pro- 
tein moves toward the plus end of the track. Thus, even though the 
hybrid has a catalytic domain that normally would move toward the 
minus end of a microtubule, as long as it is joined to the neck of a 
plus end motor, it moves in the plus direction. 

A third small family (kinesin-13) of kinesin-like proteins are 
incapable of movement. The KRPs of this group bind to either end of 
a microtubule and bring about its depolymerization rather than 
moving along its length. These proteins are often referred to as 
microtubule depolymerases.' The important role of these and other 
KRPs during cell division will be explored in Chapter 14. 


‘Members of the kinesin-8 family also act as depolymerases, but they reach the tip 
of the microtubule by processive movement. 
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saw in Chapter 8 how vesicles move from one membranous com- 
partment, such as the Golgi complex, to another, such as a lysosome. 
The routes followed by cytoplasmic vesicles and organelles are 
largely defined by microtubules (see Figure 9.1), and members of the 
kinesin superfamily are strongly implicated as force-generating 
agents that drive the movement of this membrane-bounded cargo. 
In most cells, as in axons, microtubules are aligned with their plus 
ends pointed away from the center of the cell. Therefore, members of 
the kinesin superfamily tend to move vesicles and organelles (e.g., 
peroxisomes and mitochondria) in an outward direction toward the 
cell’s plasma membrane. This is illustrated in the micrographs in 
FIGURE 9.10. The pair of micrographs labeled a and c shows a cell 
that was isolated from a normal 9.5-day mouse embryo and stained 
to reveal the location of its microtubules (green) and mitochondria 
(orange). The pair of micrographs labeled b and d shows a cell that 
was isolated from a 9.5-day mouse embryo lacking both copies of the 
gene that encodes the KIF5B kinesin heavy chain (a kinesin-1 family 
member). The mitochondria of the KIF5B-deficient cell are absent 
from the peripheral regions of the cell, as would be expected if this 
plus end-directed kinesin is responsible for the outward movement 
of membranous organelles (see also Figure 9.11c). 


Cytoplasmic Dynein 


The first microtubule-associated motor was discovered in 1963 as 
the protein responsible for the movement of cilia and flagella. The 
protein was named dynein. The existence of cytoplasmic forms was 
almost immediately suspected, but it took over 20 years before a 
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FIGURE 9.10 Alteration in the phenotype of a cell lacking a member of the kinesin superfamily. (a,c) Control cell from the extraembryonic tissues of 
a normal 9.5-day mouse embryo stained in a for microtubules (green) and in c for mitochondria (red). A significant fraction of the cell’s mitochondria are 
located along microtubules in the peripheral regions of the cell. (b,d) A corresponding cell obtained from an embryo that lacked both copies of the gene 
encoding KIF5B, which is one of three kinesin-1 isoforms in mice and humans. All of the mitochondria are clustered in the central region of the cell, 


suggesting that KIF5B is responsible for transporting mitochondria in an outward direction. 


Source: From Yosuke Tanaka, et al., Courtesy of Nobutaka Hirokawa, Cell 93:1150, 1998, with permission from Elsevier. 
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brain tissue and called cytoplasmic dynein. Whereas each of us 
has many different kinesins (and myosins), each adapted for specific 
functions, we are able to manage with only two cytoplasmic dyneins, 
one of which appears responsible for most transport operations. 

Cytoplasmic dynein is a huge protein (molecular mass of 
approximately 1.5 million daltons) composed of two identical heavy 
chains and a variety of intermediate and light chains (FIGURE 9.11a). 
Each dynein heavy chain consists of a large globular head with an 
elongated projection (stalk). The dynein head, which is an order of 
magnitude larger than a kinesin head, acts as a force-generating 
engine. Each stalk contains the all-important microtubule-binding site 
situated at its tip. The longer projection, known as the stem (or tail), 
binds the intermediate and light chains, whose functions are not well 
defined. Structural analyses indicate that the dynein motor domain 
consists of a number of distinct modules organized in the shape of a 
wheel (see Figure 9.31), which makes it fundamentally different in 
both architecture and mode of operation from kinesin and myosin. 

In vitro motility assays indicate that cytoplasmic dynein moves 
processively along a microtubule toward the polymer’s minus end— 
opposite that of most kinesins (Figure 9.11b). Cytoplasmic dynein is 
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present throughout the animal kingdom as well as in fungi and pro- 
tists, but appears to have been lost during the evolution of higher 
plants, which instead have an expanded array of minus end-directed 
kinesins (kinesin-14 members). A body of evidence suggests at least 
two well-studied roles for cytoplasmic dynein: 


1. As a force-generating agent in positioning the spindle and mov- 
ing chromosomes during mitosis (discussed in Chapter 14). 


2. Asa minus end-directed microtubular motor with a role in posi- 
tioning the centrosome and Golgi complex and moving orga- 
nelles, vesicles, and particles through the cytoplasm. 


In nerve cells, cytoplasmic dynein has been implicated in the ret- 
rograde movement of membranous organelles and the anterograde 
movement of microtubules. In fibroblasts and other nonneural cells, 
cytoplasmic dynein is thought to transport membranous organelles 
from peripheral locations toward the center of the cell (Figure 9.11c). 
Dynein-driven cargo includes endosomes and lysosomes, ER-derived 
vesicles heading toward the Golgi complex, RNA molecules, and the 
HIV virus, which is transported to the nucleus of an infected cell. 
Cytoplasmic dynein does not interact directly with membrane- 
bounded cargo but requires an intervening adaptor—most often the 
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FIGURE 9.11 Cytoplasmic dynein and organelle transport by microtubule-tracking motor proteins. (a) Structure of a cytoplasmic dynein molecule, 
which contains two dynein heavy chains and a number of smaller intermediate and light chains at the base of the molecule. Each dynein heavy chain 
contains a large, globular, force-generating head, a protruding stalk containing a binding site for the microtubule, and a stem. (b) Schematic diagram of two 
vesicles moving in opposite directions along the same microtubule, one powered by kinesin moving toward the plus end of the track, and the other by 
cytoplasmic dynein moving toward the minus end of the track. In the model shown here, each vesicle contains both types of motor proteins, but the kinesin 
molecules are inactivated in the upper vesicle and the dynein molecules are inactivated in the lower vesicle. Both motor proteins are attached to the vesicle 
membrane by an intermediary: kinesin can be attached to vesicles by a variety of integral and peripheral membrane proteins, and dynein by a soluble 
protein complex called dynactin. (c) Schematic illustration of kinesin-mediated and dynein-mediated transport of vesicles, vesicular-tubular clusters (VTCs), 


and organelles in a nonpolarized, cultured cell. 
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activity by increasing the processivity of dynein. microtubule. As discussed in Figure 9.44, myosin may also be present 
According to the simplified model shown in Figure 9.11c, kine- on some of these organelles, which provides the opportunity for the 
sins and cytoplasmic dynein move similar materials in opposite direc- _ organelles to move along actin filaments as well as microtubules. - 
tions over the same railway network. As indicated in Figure 9.11), ° 
individual organelles may bind kinesin and dynein simultaneously, x 
which endows these organelles with the capability to move in opposite © REVIEW | @ 
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9.4 EXPERIMENTAL PATHWAY 


The Step Size of Kinesin 


Motor proteins are able to generate mechanical force by under- 
going a series of conformational changes that cause the motor to 
take steps along the underlying cytoskeletal substrate (actin fila- 
ments for myosin motors or microtubules for kinesin motors). What 
do these conformational stepping changes look like? The simplest 
assumption is that as the motor walks along the microtubule or actin 
polymer, it might step the distance of a single tubulin dimer or actin 
subunit every time one ATP is hydrolyzed; indeed, this was how 
the stepping was drawn in cartoons soon after the motors proteins 
were discovered. But theoretical considerations showed that it was 
potentially possible for a motor to take much larger steps.' Initial 
attempts to estimate step size used in vitro motility assays to mea- 
sure the speed of motor movement. These assays are performed 
by coating a coverslip with myosin or kinesin, adding either actin 
filaments or microtubules along with a source of ATP in the buffer, 
then imaging the coverslip under a microscope. Motor activity then 
results in the actin or microtubule being translocated across the 
field of view at a speed equal to the speed at which the motor is 
walking along the filament (Figure 9.9). By combined such in vitro 
motility assays with assays for ATP hydrolysis rate carried out under 
similar experimental conditions, one can estimate how far a motor 
moves per ATP hydrolyzed. Attempts at this measurement resulted 
in a surprisingly wide range of estimates for the stepping size. For 
example, in the case of myosin, Uyeda and Spudich at Stanford 
University obtained step sizes consistent with myosin stepping 
from one actin monomer to the next along the filament,” while 
Toshio Yanagida’s group working at Osaka University estimated 
that the stepping size was 10 times larger.? One complication with 
this approach is that it assumes a perfect coupling between ATP 
hydrolysis and stepping, so that the number of steps taken in a 
given length of time can be calculated from the ATP hydrolysis rate. 
The very large step sizes estimated from some of the in vitro assays 
were taken as evidence that the coupling between ATP hydrolysis 
and mechanical stepping might be much looser than was initially 
thought. The debate about step size thus raised serious questions 
about how nucleotide hydrolysis was used to generate motion. 
Resolving the step size question required a way to measure move- 
ments of a protein on a size scale of a few nanometers. 

The first direct measurement of a motor taking single steps 
was an analysis of kinesin stepping performed by Karel Svoboda 
and Steve Block at the Rowland Institute* and took advantage of 
the optical trap technology that Block had invented. Although 
prior work on step size had mostly been focused on myosin, 
Block's group chose to study the stepping of kinesin, rather than 
myosin, for several reasons. First, kinesin was less likely to drift 
away from the substrate than myosin because it spends more of its 


cycle bound, thus eliminating potential problems with the motor 
drifting randomly during the measurement. Second, microtubules 
are more rigid than actin so that there should be less movement of 
the substrate itself, allowing them to focus on the movement of the 
motor itself. Finally, kinesin is able to move beads as a single motor, 
which allows an attached bead to serve as a “handle” for holding 
the kinesin and positioning it over a microtubule. This conscious 
choice to study stepping in kinesin illustrates an important point 
in experimental design: When testing a controversial hypothesis, it 
pays to use the most tractable model system available so that the 
results will be as unambiguous as possible. 

Once beads were prepared that had kinesin molecules 
attached, single beads were held in the laser trap while the attached 
kinesin took steps along the microtubule. Holding the bead in a 
laser trap turned out to be extremely important because it reduced 
random Brownian motion of the bead that otherwise would have 
obscured the underlying stepping motion. The bead also made the 
motion of the kinesin easy to observe, because laser light was 
strongly scattered by the bead. This created a bright signal that 
could be observed and measured. Individual kinesins, without a 
bead attached, would not be visible in a microscope. So the bead 
provided two features: It made the kinesin activity visible, and it 
provided a handle for manipulating and holding the motor using an 
optical trap. 

One obvious question is whether the physics of light put 
any fundamental limit on the ability to observe small stepping 
motions. The distance between adjacent tubulin dimers on a 
microtubule (see Figure 9.3d) is 8 nm, far below the resolution 
limit of light microscopy which under ideal imaging conditions 
is approximately 200 nm. Does this mean that nanometer-scale 
movements cannot be resolved? No! As discussed in Chapter 18, 
resolution only limits how close two objects can get before we 
cannot tell them apart in the microscope. The resolution limit 
does not affect how precisely we can determine the position of an 
object. Instead, what matters is how much light can be collected 
from the object through the objective lens of the microscope. If 
there is too little light, the image will be noisy, and this noise will 
dominate any position measurement. But with a silica bead illumi- 
nated by a laser, the light could be cranked up to a high enough 
level that there was hardly any noise in the image. 

In the experiments of Svoboda et al., position of the bead 
was measured using laser interferometry. In this method, a laser 
beam was split into two parallel beams of polarized light that were 
polarized in different directions. After the two beams were col- 
lected by the objective, they were recombined to regenerate the 
original laser beam. As a bead moved in front of one of the two 
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beams, it changed the recombined beam in a way that could be 
converted into bead position using sensitive electronics. As shown 
in FIGURE 1 when the motor began to pull, the bead moved, but 
rather than moving smoothly, it made sudden jumps from one 
position to another. The average size of these jumps was 8 nm, 
the exact spacing between tubulin dimers along a microtubule 
protofilament (see Figure 9.3d). 

In the first report, these steps were hard to make out over the 
background noise in the measurement, and measurement of the 
steps required careful mathematical analysis of the displacement 
versus time curve. More recent methods, however, have reduced 
the noise and make the steps even clearer. It is now possible to 
visualize single kinesin motors without having to attach them to 
a bead using fluorescent molecules attached to the protein.® The 
ability to visualize a single fluorophore and measure its position 
with nanometer accuracy has been the result of improvements 
in camera sensitivity and has resulted in further insights into the 
conformational changes that kinesin performs as it walks. 

Single molecule analysis of kinesin was critical for showing that 
a two-headed motor moves by a hand-over-hand mechanism, as 
illustrated in Figure 9.9b. In the hand-over-hand model, the two 
heads would take turns being in front as the dimeric motor walked 
along the microtubule. This mode of motion would resemble 
a human walking down a sidewalk, except for one important 
structural difference—humans have an axis of bilateral symmetry, 
so that our left foot is the mirror image of our right foot, whereas 
in motor protein dimers the two halves are identical and, in crystal 
structures, the motor domains are oriented 120 degrees relative 
to each other, close to the ideal 180-degree rotational symmetry 
expected for a homodimeric complex. Assuming that both motor 
domains undergo identical motions when they take a step, the 
dimeric motor would have to rotate by 180 degrees with each step, 
always in the same direction, hence the hand-over-hand mecha- 
nism was predicted to result in the dimer as a whole rotating around 
and around. An alternative type of motion had been proposed in 
which one head always stayed in front and the other always lagged 
behind, like a person limping. This model, known as the inch-worm 
mechanism, would not require the large rotational motions that the 
hand-over-hand model seemed to require. 

These two models were tested in experiments conducted by 
Gelles and co-workers at Brandeis University, who measured the 
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FIGURE 1 Single steps of kinesin as it walks along the microtubule. 
Displacement from the initial position increases as the motor drags the 
bead along. But instead of displacement increasing continuously, it 
jumps in 8 nm increments, which represent the kinesin stepping from 
one tubulin dimer to the next one along the microtubule. 


Source: Courtesy of Dr. Steven Block, Stanford University. 


rotation of the kinesin tail as the motor stepped.’ By anchoring 
the tail of kinesin to a slide and watching the movement of a free 
microtubule as the kinesin stepped along it, it was possible to 
show that the tail did not show anything like the large persis- 
tent rotation that was expected. Based on this result, Gelles and 
co-workers concluded that kinesin must use an inchworm-like 
mechanism for taking steps. To investigate this question more 
directly, Yildiz and Selvin® attached a fluorescent molecule to one 
of the two motor head domains: the part of the protein that actu- 
ally contacts the microtubule. In contrast to the movements of a 
bead or fluorophore attached to the motor tail, which showed 8 nm 
steps, individual heads showed 17 nm steps, which is the expected 
result if the motor walks by a hand-over-hand motion (Figure 9.9b), 
but which is not consistent with an inchworm mechanism. 

Exactly how the heads are able to take hand-over-hand steps 
without causing the dimer to rotate becomes a question of detailed 
conformation change around the linkers that join the heads to the 
tails. Clearly, the two heads must not move in exactly the same way, 
because if they did, the whole dimer would have to rotate. If the two 
heads undergo different conformational changes as they take their 
respective steps, then one would predict that the kinetics of stepping 
by the two heads might be slightly different. In fact, careful measure- 
ments at the single molecule level have dramatically confirmed this 
prediction by showing that the two heads step with reproducibly dif- 
ferent timing, so that even-numbered steps take a different length 
of time than odd-numbered steps.’ Such a measurement of differ- 
ing kinetics within one dimeric motor would have been absolutely 
impossible by bulk measurements and illustrate the power of single 
molecule analysis for understanding motor mechanics. 

These precision measurements of movement by single 
motor proteins during the past 20 years have taken the field of 
cell biology dramatically closer to the field of physics and have 
given direct experimental support to the animated diagrams that 
illustrate the conceptual view of how motor proteins move. 
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9.5 Microtubule-Organizing 
Centers (MTOCs) 


The function ofa microtubule within a living cell depends on its location 
and orientation, which makes it important to understand why a micro- 
tubule assembles in one place as opposed to another. When studied in 
vitro, the assembly of microtubules from o-tubulin dimers occurs in 
two distinct phases: a slow phase of nucleation, in which a small portion 
of the microtubule is initially formed, and a much more rapid phase of 
elongation. Unlike the case in vitro, nucleation of microtubules takes 
place rapidly inside a cell, where it occurs in association with a variety of 
specialized structures called microtubule-organizing centers (or 
MTOCs). The best studied MTOC is the centrosome. 


Centrosomes 


In animal cells, the microtubules of the cytoskeleton are typically nucle- 
ated by the centrosome, a complex structure that contains two barrel- 
shaped centrioles surrounded by amorphous, electron-dense 
pericentriolar material (or PCM) (FIGURE 9.12a,b). Centrioles are 
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cylindrical structures about 0.2 um in diameter and typically about 321 


twice as long. Centrioles contain nine evenly spaced blades, each of 
which contain three microtubules, designated the A, B, and C tubules. 
Only the A tubule is a complete microtubule (Figure 9.12a,b). The 
nine A tubules are connected to a central hub with nine spokes called 
the cartwheel. The nine-fold symmetry of the centriole results from 
the structure of the SAS-6 protein, which self-assembles into a nine- 
fold symmetric disc that forms the core of the cartwheel. New centri- 
oles typically form adjacent to, and at right angles with, pre-existing 
centrioles (Figure 9.12a,c). This process of centriole duplication 
means that most cells contain a pair of connected centrioles in which 
one centriole, termed the mother, is at least one cell cycle older than 
the other, which is termed the daughter. Daughter centrioles initially 
form at right angles to the mother centriole, but this precise spatial 
relation is subsequently lost. Further discussion of centriole duplica- 
tion will be found in Chapter 14. Centrioles recruit PCM to form a 
new centrosome in which the centrioles are embedded in a cloud of 
PCM (Figure 9.12d). As discussed below, the centrosome is the 
major site of microtubule initiation in animal cells and typically 
remains at the center of the cell’s microtubular network (Figure 9.12e). 
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FIGURE 9.12 The centrosome. (a) Schematic diagram of a centrosome showing the paired centrioles; the surrounding pericentriolar material (PCM); and 
microtubules emanating from the PCM, where nucleation occurs. (b) Electron micrograph of a cross section of a centriole showing the pinwheel arrangement 
of the nine peripheral fibrils, each of which consists of one complete microtubule and two incomplete microtubules. (c) Electron micrograph showing two 
pairs of centrioles. Each pair consists of a longer parental centriole and a smaller daughter centriole (arrow), which is undergoing elongation in this phase of 
the cell cycle (discussed in Section 14.2). (d) Electron micrographic reconstruction of a 1.0 M potassium iodide—extracted centrosome, showing the PCM to 
contain a loosely organized fibrous lattice. (e) Fluorescence micrograph of a cultured mammalian cell showing the centrosome (stained yellow by an 
antibody against a centrosomal protein) at the center of an extensive microtubular network. 


Source: (a) S. J. Doxsey et al., Cell 76:643, 1994, by permission of Cell Press. Cell by Cell Press. Reproduced with permission of Cell Press in the format 
journal via copyright clearance center; (b) Courtesy of B.R. Brinkley; (c) From Jerome B. Rattner and Stephanie G. Phillips, J. Cell Biol. 57:363, 1973, Fig. 4. 
Reproduced with permission of Rockefeller University Press; (d) From Bradely J. Schnackenberg et al., Courtesy Robert E. Palazzo, Proc. Nat'l. Acad. Sci. 
U.S.A. 95:9298, 1998. © 1998 National Academy of Sciences, U.S.A.; (e) From Toshiro Ohta, et al., J. Cell Biol. 156:88, 2002, Fig. 1. Reproduced with 


permission of the Rockefeller University Press. Courtesy of Ryoko Kuriyama. 
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FIGURE 9.13 Microtubule nucleation at the centrosome. Human MKN-1 cells were treated with nocodazole to depolymerize microtubules. The drug 


was then washed out to allow microtubules to regrow. Images show fluorescence microscopy of cells stained with antibodies recognizing microtubules at 


5 min and 10 min after nocodazole wash-out. At the five minute time point, microtubules can be see starting to all grow out from the same point in the 


cell—this is the centrosome. 


Source: From Fumoto K, Kadono M, Izumi N, Kikuchi A. 2009. Axin localizes to the centrosome and is involved in microtubule nucleation. EMBO Rep.10, 606-13. 


FIGURE 9.13a shows an early experiment that demonstrates the 
role of the centrosome in the initiation and organization of the 
microtubular cytoskeleton. The microtubules of a cultured animal 
cell were first depolymerized by incubating the cells in colcemid, a 
drug that binds to tubulin subunits and blocks their use by the cell. 
Microtubule reassembly was then monitored by removing the chem- 
ical, fixing cells at various time intervals, and treating the fixed cells 
with fluorescent anti-tubulin antibodies. Within a few minutes after 
removal of the inhibiting conditions, one or two bright fluorescent 
spots are seen in the cytoplasm of each cell. Within 15 to 30 minutes 
(Figure 9.13a), the number of labeled filaments radiating out of 
these foci increases dramatically. When these same cells are sec- 
tioned and examined in the electron microscope, the newly formed 
microtubules are found to radiate outward from a centrosome. Close 
examination shows that the microtubules do not actually penetrate 
into the centrosome and make contact with the centrioles, but termi- 
nate in the dense PCM that resides at the centrosome periphery. It is 
this material that initiates the formation of microtubules (see 
FIGURE 9.14c). 

Because centrosomes are sites of microtubule nucleation, the 
microtubules of the cytoskeleton are all polarized the same way: The 
minus end is associated with the centrosome, and the plus (or grow- 
ing) end is situated at the opposite tip (Figure 9.13b). The fraction of 
microtubules that remain associated with the centrosome is highly 
variable from one cell type to another. The centrosome of a nonpo- 
larized cell (e.g., a fibroblast) is typically situated near the center of 
the cell and tends to remain associated with the minus ends ofa large 
number of microtubules (as in Figure 9.12e). In contrast, many of 
the microtubules in a polarized epithelial cell are anchored by their 
minus ends at dispersed sites near the apical end of the cell as their 
plus ends extend toward the cell’s basal surface (Figure 9.1). Similarly, 
the axon of a nerve cell contains large numbers of microtubules that 
have no association with the centrosome, which is located in the 
neuron’s cell body. Many of these axonal microtubules are thought to 
form in conjunction with the centrosome, but are then severed from 
the MTOC and transported into the axon by motor proteins. Certain 
animal cells, including mouse oocytes, lack centrosomes entirely, yet 


they are still capable of forming complex microtubular structures, 
such as the meiotic spindle (as discussed in Chapter 14). Human 
patients with genetic defects in centrosome-associated proteins dis- 
play microcephaly, a reduction in brain size, presumably because the 
neuronal proliferation and migration are particularly sensitive to 
reductions in centrosome function. 


Basal Bodies and Other MTOCs 


Centrosomes are not the only MTOCs in cells. The outer microtu- 
bules in a cilium or flagellum are generated as outgrowths from the 
microtubules in a structure called a basal body, which resides at 
the base of the cilium or flagellum (see Figure 9.28). Basal bodies are 
identical in structure to centrioles, and in fact, basal bodies can turn 
into centrioles and vice versa. For example, the basal body that gives 
rise to the flagellum of a sperm cell is derived from a centriole that 
had been part of the meiotic spindle of the spermatocyte from which 
the sperm arose. Conversely, the sperm basal body typically becomes 
a centriole during the first mitotic division of the fertilized egg. Plant 
cells lack both centrosomes and centrioles, or any other type of obvi- 
ous MTOC. Instead, microtubules in a plant cell are nucleated 
around the surface of the nucleus and widely throughout the cortex 
(see FIGURE 9.15). 


Microtubule Nucleation 


Regardless of their diverse appearance, all MTOCs play similar roles 
in all cells: They control the number of microtubules, their polarity, 
the number of protofilaments that make up their walls, and the time 
and location of their assembly. In addition, all MTOCs share a com- 
mon protein component—a type of tubulin called y-tubulin, which 
was first discovered in the late 1980s by Berl Oakley and co-workers 
using genetic screens in the fungus Aspergillus. Unlike the a- and 
6-tubulins, which make up about 2.5 percent of the protein of a non- 
neural cell, y-tubulin constitutes only about 0.005 percent of the 
cell’s total protein. Fluorescent antibodies to y-tubulin stain all types 
of MTOCs, including the pericentriolar material of centrosomes 
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FIGURE 9.14 The role of y-tubulin in centrosome function. (a) Image shows a mitotic spindle in a human UsOS cell stained for y-tubulin (orange), tubulin 
(green), and DNA (blue). (6) A reconstruction based on electron micrographs of a portion of a centrosome that had been incubated with purified tubulin in 
vitro and then labeled with antibodies against y-tubulin. The antibodies were linked to gold particles to make them visible (as white dots) in the 
reconstruction. During the incubation with tubulin, the centrosome served as an MTOC to nucleate microtubules whose minus ends are seen to be labeled 
with clusters of gold, which are often arranged in semicircles or rings. The accompanying drawing shows the outline of the microtubule seen in the 
reconstruction. (c) A model for y-tubulin function during the assembly of microtubules. Nucleation begins as a-f-tubulin dimers bind to an open ring of 
y-tubulin molecules (brown), which are held in place by a number of non-tubulin proteins (green) that make up the large y-TuRC. Nucleation by the y-TuRC 
defines microtubule polarity with a ring of a-tubulin monomers situated at the minus end of the structure. 


Source: (a) Robert S. McNeil/Baylor College of Medicine/Science Source Images; (b) From Michelle Moritz, et al., Nature 378:639,1995, Fig. 1b. Reprinted by 
permission from Macmillan Publishers, Ltd. Photo provided courtesy of David A. Agard. 


(Figure 9.14a), suggesting that y-tubulin is a critical component in 
microtubule nucleation. This conclusion is supported by other 
studies. For example, mutations in y tubulin lead to reduced micro- 
tubule nucleation at MTOCs. 

To understand the mechanism of microtubule nucleation, 
researchers have focused on the structure and composition of the 
pericentriolar material (PCM) at the periphery of centrosomes. The 
insoluble fibers of the PCM (Figure 9.12d) are thought to serve as 
attachment sites for ring-shaped structures that have the same diam- 
eter as microtubules (25 nm) and contain y-tubulin. These ring- 
shaped structures were discovered when centrosomes were purified 
and incubated with gold-labeled antibodies that bound to y-tubulin. 
The gold particles were seen in the electron microscope to be clus- 
tered in semicircles or rings situated at the minus ends of microtu- 
bules (Figure 9.146). These are the ends of the microtubules that are 
embedded in the PCM of the centrosome where nucleation occurs. 
Similar y-tubulin ring complexes (or y-TuRCs) have been isolated 
from cell extracts and shown to nucleate microtubule assembly in 
vitro. Structural studies have shown that the y-TuRC is a helical 
array of y-tubulin (brown) subunits, each bound with a set of acces- 
sory proteins. The resulting open, lock-washer-like ring-shaped 
template has 13 gamma tubulins per turn on which the first row of 
aB-tubulin dimers assemble (Figure 9.14c). The number of proto- 
filaments in the microtubule is apparently dictated by the number of 
y tubulins around the ring. In this model, only the a-tubulin of the 
heterodimer can bind to a ring of y-subunits. Thus the y-TuRC 
determines the polarity of the entire microtubule and also forms 
a cap at its minus end, preventing the gain or loss of tubulin 
subunits. 


REVIEW [+ — 


1. What is the centrosome? What component of the 
centrosome nucleates microtubules and how does it 


dictate the number of protofilaments? 
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9.6 Microtubule Dynamics 


The microtubules of the cytoskeleton are dynamic polymers that are 
subject to shortening, lengthening, disassembly, and reassembly. 


The Dynamic Properties of Microtubules 


Although all microtubules appear quite similar morphologically, 
there are marked differences in their stability. Microtubules of the 
mitotic spindle or the cytoskeleton are extremely labile, that is, sensi- 
tive to disassembly. Microtubules of mature neurons are much less 
labile, and those of centrioles, cilia, and flagella are highly stable. 
These differences in microtubule stability are determined by micro- 
tubule interacting proteins including MAPs (Figure 9.4) which sta- 
bilize microtubules, proteins known as +TIPs, which bind to the 
plus-end of growing microtubules, and an enzyme called katanin, 
named after the samurai sword, that severs microtubules into shorter 
pieces. Microtubule stability is also regulated by posttranslational 
modifications to the tubulin subunits such as the covalent attach- 
ment of multiple glutamates onto the C-terminus of tubulin. Living 
cells can be subjected to a variety of artificial treatments that lead to 
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FIGURE 9.15 Four major arrays of microtubules present during the cell cycle of a plant cell. The organization of the microtubules at each stage are 


described in the text. 


Source: R.H. Goddard et al., Plant Physiol. 104:2, 1994. Plant Physiology by American Society of Plant Physiologists copyright 1994. With permission of 
American Society of Plant Biologists in the format textbook via copyright clearance center. 


the disassembly of labile cytoskeletal microtubules without disrupt- 
ing other cellular structures. Disassembly can be induced by cold 
temperature; hydrostatic pressure; elevated Ca”* concentration; and 
a variety of chemicals, including colchicine, vinblastine, vincristine, 
and nocodazole. The drug taxol stops the dynamic activities of 
microtubules by a very different mechanism. Taxol binds to the 
microtubule polymer, inhibiting its disassembly, thereby preventing 
the cell from assembling new microtubular structures as required. 
Many of these compounds, including taxol, are used in chemother- 
apy against cancer because they preferentially kill tumor cells. 

The lability of cytoskeletal microtubules reflects the fact that they 
are polymers formed by the noncovalent association of protein build- 
ing blocks. The microtubules of the cytoskeleton are normally subject 
to depolymerization and repolymerization as the requirements of the 
cell change from one time to another. The dynamic character of the 
microtubular cytoskeleton is well illustrated by plant cells. If a typical 
plant cell is followed from one mitotic division to the next, four dis- 
tinct arrays of microtubules appear, one after another (FIGURE 9.15). 


1. During most of interphase, the microtubules of a plant cell are 
distributed widely throughout the cortex, as depicted in 
Figure 9.15, stage 1. A search for y-tubulin shows this nucleation 
factor to be localized along the lengths of the cortical microtu- 
bules, suggesting that new microtubules might form directly on 
the surface of existing microtubules. This idea is supported by 
studies of tubulin incorporation in living cells (FIGURE 9.16a) 
and by in vitro assays (Figure 9.16b) that show newly formed 
microtubules branching at an angle off the sides of preexisting 
microtubules. Once formed, the daughter microtubules are likely 
severed from the parent microtubule and incorporated into the 
parallel bundles that encircle the cell (Figures 9.6a, 9.15). 


2. As the cell approaches mitosis, the microtubules disappear from 
most of the cortex, leaving only a single transverse band, called 
the preprophase band, that encircles the cell like a belt 
(Figure 9.15, stage 2). The preprophase band marks the site of the 
future division plane. 


3. As the cell progresses into mitosis, the preprophase band is lost 
and microtubules reappear in the form of the mitotic spindle 
(Figure 9.15, stage 3). 


4. After the chromosomes have been separated, the mitotic spindle dis- 
appears and is replaced by a bundle of microtubules called the phrag- 
moplast (Figure 9.15, stage 4), which plays a role in the formation of 
the cell wall that separates the two daughter cells (see Figure 14.38). 
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FIGURE 9.16 Nucleation of plant cortical microtubules. (a) The 
micrographs show a portion of a live cultured tobacco cell that is expressing 
fluorescently labeled GFP-tubulin. During the period of observation, an 
existing microtubule of the cortex nucleates the assembly of a daughter 
microtubule, which grows outward forming a Y-shaped branch. The end of a 
newly formed microtubule is indicated by the arrowhead, the branchpoint 
by the arrow. (b) Electron micrograph of a microtubule with two daughter 
microtubules branching from its surface. The branched microtubules were 
assembled in a cell-free system containing tubulin subunits. Bar, 10m. (c) A 
schematic model showing how new microtubules are nucleated at the sites 
of y-tubulin present on the surface of an existing microtubule. 


Source: (b) From Takashi Murata, et al., Nature Cell Biol. 7:961, 2005. 
Reprinted by permission from Macmillan Publishers, Ltd. 


FIGURE 9.17 Microtubules assembled in the test tube. Electron micro- 


graph of frozen, unfixed microtubules that had polymerized in vitro. The 
individual protofilaments and their globular tubulin subunits are visible. Note 
that the middle of the three microtubules contains only 11 protofilaments. 
Source: Courtesy of R.H. Wade, Institut de Biologie Structurale, Grenoble, 
France. 


These dramatic changes in the spatial organization of microtu- 
bules are thought to be accomplished by a combination of two sepa- 
rate mechanisms: (1) the rearrangement of existing microtubules and 
(2) the disassembly of existing microtubules and reassembly of new 
ones in different regions of the cell. In the latter case, the microtubules 
that make up the preprophase band are formed from the same 
subunits that a few minutes earlier were part of the cortical array or, 
before that, the phragmoplast. 


The Underlying Basis of 
Microtubule Dynamics 


Insight into factors that influence the rates of microtubule assembly 
and disassembly came initially from studies carried out in vitro. The 
first successful approach to the in vitro assembly of microtubules was 
taken in 1972 by Richard Weisenberg of Temple University, who found 
that microtubules could be disassembled and reassembled over and 
over simply by lowering and raising the temperature of the incubation 
mixture. FIGURE 9.17 shows three microtubules that were assembled 
in the test tube from purified tubulin. One of the three microtubules in 
this figure contains only 11 protofilaments (as indicated by its thinner 
diameter). It is not unexpected that microtubules assembled in vitro 
might have abnormal protofilament numbers because they lack a 
proper template (Figure 9.14c) normally provided by the y-tubulin 
ring complexes in vivo. In vitro assembly of microtubules is aided 
greatly by the addition of MAPs or by pieces of microtubules or struc- 
tures that contain microtubules (FIGURE 9.18), which serve as tem- 
plates for the addition of free subunits. In these in vitro studies, tubulin 
subunits are added primarily to the plus end of the existing polymer. 
Early in vitro studies established that GTP is required for microtu- 
bule assembly. Assembly of tubulin dimers requires that a GIP mole- 
cule be bound to the B-tubulin subunit.’ As it turns out, B-tubulin is not 


*A molecule of GTP is also bound to the a-tubulin subunit, but it is not exchangeable 
and it is not hydrolyzed following subunit incorporation. The positions of the guanine 
nucleotides in the af-tubulin heterodimer are shown in Figure 9.3c. 
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FIGURE 9.18 The assembly of tubulin onto an existing microtubular 
structure. Electron micrograph showing the in vitro assembly of brain 
tubulin onto the plus ends of the microtubules of an axoneme from a 
Chlamydomonas flagellum. 


Source: Courtesy of L.l. Binder and Joel L. Rosenbaum. 


only a structural protein, it is also an enzyme, a GTPase. GTP hydrolysis 
is not required for the actual incorporation of the dimer onto the end of 
a microtubule. Rather, the GTP is hydrolyzed to GDP shortly after the 
dimer is incorporated into a microtubule, and the resulting GDP 
remains bound to the assembled polymer. After a dimer is released 
from a microtubule during disassembly and enters the soluble pool, the 
GDP is replaced by a new GTP. This nucleotide exchange “recharges” 
the dimer, allowing it to serve once again as a building block for polym- 
erization. Because it includes GTP hydrolysis, assembly of a microtu- 
bule is not an inexpensive cellular activity. Why has such a costly 
polymerization pathway evolved? To answer this question, it is useful to 
consider the effect of GTP hydrolysis on microtubule structure. When 
a microtubule is growing, the plus end is seen under the electron micro- 
scope as an open sheet to which GTP-dimers are added (step 1, 
FIGURE 9.19a, 9.19b). During times of rapid microtubule growth, tubu- 
lin dimers are added more rapidly than their GTP can be hydrolyzed. 
The presence of a cap of tubulin-GTP dimers at the plus ends of the 
protofilaments is thought to favor the addition of more subunits and the 
growth of the microtubule. However, microtubules with open ends, as 
in step 1, Figure 9.19a, are thought to undergo a spontaneous reaction 
that leads to closure of the tube (steps 2 and 3). In this model, tube clo- 
sure is accompanied by the hydrolysis of the bound GTP, which gener- 
ates subunits that contain GDP-bound tubulin. GDP-tubulin subunits 
have a different conformation from their GTP-tubulin precursors and 
are less suited to fit into a straight protofilament. The resulting mechan- 
ical strain destabilizes the microtubule. The strain energy is released as 
the protofilaments curl outward from the plus end of the tubule and 
undergo catastrophic depolymerization (Figure 9.19a, step 4; 9.19c). 
Thus, it would appear that GTP hydrolysis is a fundamental component 
of the dynamic quality of microtubules. The strain energy stored in a 
microtubule as a result of GTP hydrolysis makes the microtubule inher- 
ently unstable and—in the absence of other stabilizing factors such as 
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FIGURE 9.19 The structural cap model of dynamic instability. According 
to the model, the growth or shrinkage of a microtubule depends on the 
state of the tubulin dimers at the plus end of the microtubule. Tubulin-GTP 
dimers are depicted in red. Tubulin-GDP dimers are depicted in blue. In a 
growing microtubule (step 1), the tip consists of an open sheet containing 
tubulin-GTP subunits. In step 2, the tube has begun to close, forcing the 
hydrolysis of the bound GTP. In step 3, the tube has closed to its end, leaving 
only tubulin-GDP subunits. GDP-tubulin subunits have a curved conformation 
compared to their GTP-bound counterparts, which makes them less able to fit 
into a straight protofilament. The strain resulting from the presence of GDP- 
tubulin subunits at the plus end of the microtubule is released as the 
protofilaments curl outward from the tubule and undergo catastrophic 
shrinkage (step 4). (6) Cryo-EM image of the growing end of a microtubule 
showing a curved, open sheet. (c) Cryo-EM image of the shrinking end of a 
microtubule showing outwardly curved protofilaments. 

Source: (a) A.A. Hyman and E. Karsenti, Cell 84:402, 1996, by permission of 
Cell Press. Cell by Cell Press. Reproduced with permission of Cell Press in the 
format journal via copyright cle center; (b) Courtesy of Thomas Meuller- 
Reichert, T.U. Dresden, Medical Theoretical Center (MTZ) and Anthony A. 
Hyman, Max Planck Institute of Molecular Cell Biology and Genetics. 


MAPs—capable of disassembling soon after its formation. Microtubules 
can shrink remarkably fast, especially in vivo, which allows a cell to 
disassemble its microtubular cytoskeleton very rapidly. 

The dynamic character of the microtubular cytoskeleton inside a 
cell can be revealed by expressing GFP-tagged tubulin in living cells or 
injecting tubulin with chemical tags that allow it to be visualized in 
fixed cells (FIGURE 9.20). If individual microtubules are observed 
under a fluorescence microscope, they appear to grow slowly for a 
period of time and then shrink rapidly and unexpectedly, as illustrated 


by the microtubule being followed in FIGURE 9.21. Because the switch 
from growth to shrinkage (an event termed catastrophe) occurs with 
high frequency in vivo, most microtubules disappear from the cell ina 
matter of minutes and are replaced by new microtubules that grow out 
from the centrosome. If particularly small amounts of fluorescently 
labeled protein are injected, the cytoskeletal filaments are no longer 
uniformly labeled as in Figure 9.4, but instead contain irregularly 
spaced fluorescent speckles as seen in Figure 9.21. These fluorescent 
speckles serve as fixed markers to follow dynamic changes in length or 
orientation of the filament. This technique, known as flourescence 
speckle microscopy allows microtubule dynamics to be measured 
with high precision inside living cells. 

In 1984, Timothy Mitchison and Marc Kirschner of the 
University of California, San Francisco, reported on the properties of 
individual microtubules and proposed that microtubule behavior in 
vivo could be explained by a phenomenon they termed dynamic 
instability. Dynamic instability explains the observation (1) that 
growing and shrinking microtubules can coexist in the same region 
of a cell and (2) that a given microtubule can switch back and forth 
unpredictably (stochastically) between growing and shortening 
phases, as in Figure 9.21. Dynamic instability is an inherent property 
of the microtuble itself and, more specifically, of the plus end of the 
microtubule. As indicated in Figure 9.19, it is the plus end where 
subunits are added during growth and lost during shrinkage. This 
does not mean that dynamic instability cannot be influenced by external 
factors. For example, cells contain a diverse array of proteins (called 
+TIPs) that bind to the dynamic plus ends of microtubules 
(FIGURE 9.22). Some of these +TIPs regulate the rate of the 


FIGURE 9.20 Microtubule dynamics in living cells. This cultured 
fibroblast was injected with a small volume of tubulin that had been 
covalently linked to biotin, a small molecule whose location in the cell is 
readily determined using fluorescent anti-biotin antibodies. Approximately 
one minute following injection, the cells were fixed, and the location of 
biotinylated tubulin that had been incorporated into insoluble microtubules 
was determined. It is evident from this fluorescence micrograph that, even 
during periods as short as one minute, tubulin subunits are widely 
incorporated at the growing ends of cytoskeletal microtubules. 


Source: From Marc Kirschner, J. Cell Biol. 102, Cover of No. 3, 1986; 
Reproduced with permission of the Rockefeller University Press. 


FIGURE 9.21 Dynamic instability. This series of photo- 
graphs shows the changes in length of a single microtubule in 
the growth cone of a neuron. The cell was microinjected with 
fluorescently labeled tubulin at sufficiently low concentration 
to produce green fluorescent speckles along the length of 
microtubules. As discussed on page 326, such speckles 
provide fixed reference points that can be followed over time. 
Each of the yellow horizontal lines connects one of these 
speckles from one time point to another. The blue line 2 
indicates the approximate plus end of the microtubule at 
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various time points. From 0 to about 200 seconds, the microtubule undergoes a gradual addition of tubulin subunits at its plus end. Then, from about 


200 to 240 seconds the microtubule experiences rapid shrinkage. 


Source: From Andrew W. Schaefer, Nurul Kabir and Paul Forscher, Yale University. J. Cell Biol.158:145, 2002, Fig. 5. Reproduced with permission of the 


Rockefeller University Press. 


FIGURE 9.22 Binding of a microtubule plus-end tracking protein (+TIP) 
The micrograph shows a live image of a human lung fibroblast expressing 
mCherry-labeled tubulin (generating red microtubules) and GFP-labeled 
EB3, a +TIP (appearing green). EB3 is seen to bind to the plus ends of the 
microtubules. 


Source: From Anna Akhmanova and Michel O. Steinmetz, J. Cell Science 
123, 3415, 2010; by permission of the Company of Biologists. Courtesy 
Anna Akhmanova and Ilya Grigoriev, Utrecht University, The Netherlands. 


microtubule’s growth or shrinkage or the frequency of interconver- 
sion between the two phases. Other +TIPs mediate the attachment of 
the plus end of the microtubule to a specific cellular structure, such 
as the kinetochore of a mitotic chromosome during cell division or 
the actin cytoskeleton of the cortex during vesicle transport. Once a 
subcellular structure is attached to the plus end of a microtubule, the 
dynamic properties of the microtubule can exert force on that struc- 
ture. Polymerization of a microtubule can push on an attached object, 
whereas depolymerization of a microtubule can pull on an attached 
object. For instance, this type of pulling force plays a major role in 
the segregation of chromosomes during cell division (Section 14.2). 
Dynamic instability provides a mechanism by which the plus 
ends of microtubules can rapidly explore the cytoplasm for appro- 
priate sites of attachment. Attachment temporarily stabilizes micro- 
tubules and allows the cell to build the complex cytoskeletal networks 
discussed in this chapter. Dynamic instability also allows cells to 
respond rapidly to changing conditions that require remodeling of 
the microtubular cytoskeleton. One of the most dramatic examples 
of such remodeling occurs at mitosis when the microtubules of the 
cytoskeleton are disassembled and remodeled into a bipolar mitotic 


spindle. This reorganization is associated with a marked change in 
microtubule stability; microtubules in interphase cells have half-lives 
that are 5 to 10 times longer than microtubules in mitotic cells. 
Unlike the microtubules of the mitotic spindle or cytoskeleton, the 
microtubules of the organelles to be discussed below lack dynamic 
activity and, instead, are highly stable. 


REVIEW | 


1. What is the role of GTP in the assembly of 
microtubules? What is meant by the term dynamic 


instability? What role does it play in cellular activity? 
eo 


9.7 Structure and Function 
of Cilia and Flagella 


Anyone who has placed a drop of pond water under the lens of a 
microscope and tried to keep a protozoan from swimming out of the 
field of view can appreciate the activity of cilia and flagella. Cilia and 
flagella are hairlike, sometimes motile organelles that project from the 
surface of a variety of eukaryotic cells. Many cells in the human body 
contain a single non-motile cilium (FIGURE 9.23a). For example inside 
the kidney, non-motile cilia act as flow sensors for fluid flow (Figure 
9.23a). In other tissues, large tufts of motile cilia create flows of fluid 
(Figure 9.23c). Bacteria also possess structures referred to as flagella, 
but prokaryotic flagella are simple filaments that bear no evolution- 
ary relationship to their eukaryotic counterparts (see Figure 1.14). 
The discussion that follows pertains only to the eukaryotic organelles. 

Cilia and flagella are two words that refer to exactly the same 
structure in different contexts, much like the terms “evening star” 
and “morning star,’ which both refer to the planet Venus but at dif- 
ferent times of day. Based largely on historical convention, most 
biologists use one or the other term based on the type of cell from 
which the organelle projects and its pattern of movement. According 
to this distinction, a cilium can be likened to an oar as it moves the 
cell in a direction perpendicular to the cilium itself. In its power 
stroke, the cilium is maintained in a rigid state (FIGURE 9.24a) as it 
pushes against the surrounding medium. In its recovery stroke, the 
cilium becomes flexible, offering little resistance to the medium. 
Cilia tend to occur in large numbers on a cell’s surface, and their 
beating activity is usually coordinated (Figure 9.24b). In multicellu- 
lar organisms, cilia move fluid and particulate material through vari- 
ous tracts. In humans, for example, the ciliated epithelium lining the 
respiratory tract propels mucus and trapped debris away from the 
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FIGURE 9.23 Cilia in the human body. (a) Immunofluorescence image 
showing a single primary cilium (green) projecting from the apical surface of 
each cell in a culture of kidney collecting duct cells. The intercellular 
junctions between the epithelial cells are shown in red. (b) Cilia visualized 

in a developing mouse embryo by scanning electron microscopy. 

c) Tufts of motile cilia lining the trachea. 

Source: (a) Courtesy of Wallace Marshall; (b) Courtesy of Wallace Marshall; 
c) Charles Daghlian/Dartmouth College Electron Microscope Facility. 


FIGURE 9.24 Ciliary beat. (a) The various stages in the beat of a cilium. 

b) The cilia on the surface of a ciliated protozoan beat in metachronal waves 
in which the cilia in a given row are in the same stage of the beat cycle, but 
those in adjacent rows are in a different stage. RS, cilia in recovery stroke; ES, 
cilia in effective power stroke. (c) Human bronchial epithelial cells in which the 
cilia at the luminal surface are labeled red and the nuclei are labeled blue. 
Source: (a) Courtesy G.A. Horridge and S.L. Tamm, Science163:818, 1969. 
Reprinted with permission from AAAS; (b) From G.A. Horridge and S.L. Tamm, 
Science 163:818, 1969. Reprinted with permission from AAAS; (c) From Seiko F. 
Okada, et al., J. Biol. Chem. 281:22995, 2006. Fig. 2c, © 2006. The American 
Society for Biochemistry and Molecular Biology. Photo courtesy Seiko F. Okada. 


lungs (Figure 9.24c). Not all cilia are motile: Almost all cells of the 
human body contain a single nonmotile cilium (called a primary 
cilium) that is thought to have a sensory function, monitoring the 
mechanical and chemical properties of extracellular fluids (Figure 
9.23a). Primary cilia contain receptors and channels that allow them 
to serve as sensory antennas. Specialized sensory cilia include cilia 
that sense fluid flow in the kidney, olfactory cilia in the nose that 
sense smell, and the outer segments of rod and cone cells in the ret- 
ina, which are in fact highly specialized primary cilia. Given the 
ubiquity of cilia and their many roles in sensation and motility, 
defects in cilia can lead to a range of human diseases, as discussed in 
the accompanying Human Perspective. 

Flagella typically occur singly or in pairs and exhibit a variety of 
different beating patterns (waveforms), depending on the cell type. 
For example, the single-celled alga pictured in FIGURE 9.25a pulls 
itself forward by waving its two flagella in an asymmetric manner 
that resembles the breaststroke of a human swimmer (Figure 9.25b). 
This same algal cell can also push itself through the medium using a 
symmetric beat resembling that of a sperm. The degree of asymme- 
try in the pattern of the beat in the algal cell is regulated by the inter- 
nal calcium ion concentration. 


Structure of Cilia and Flagella 


An electron micrograph of a cross section of a cilium or flagellum is 
one of the most familiar images in cell biology (FIGURE 9.26a). The 
entire ciliary or flagellar projection is covered by a membrane that is 
continuous with the plasma membrane of the cell. The core of the 
cilium, called the axoneme, contains an array of microtubules that 
runs longitudinally through the entire organelle. With rare excep- 
tions, the axoneme of a motile cilium or flagellum consists of nine 
peripheral doublet microtubules surrounding a central pair of single 


microtubules. This same microtubular structure, which is known as 329 


the 9 + 2 array, is seen in axonemes from protists to mammals and 
serves as another of the many reminders that all living eukaryotes 
have evolved from a common ancestor. All microtubules of the axo- | 
neme have the same polarity: Their plus ends are at the tip of the | 
projection and their minus ends at the base. Each peripheral doublet | 
consists of one complete microtubule, the A tubule, and one incom- | 
plete microtubule, the B tubule, the latter containing 10 or 11 subu- 
nits rather than the usual 13. 

The basic structure of the axoneme was first described in 1952— 
in lower plants by Irene Manton and in animals by Don Fawcett and 
Keith Porter. (Flagella were lost during the evolution of higher plants 
but still are present in the sperm of lower plants such as moss and 
ferns.) As the resolving power of electron microscopes improved, cer- | 
tain of the less obvious components became visible (Figure 9.26)). | 
The central tubules were seen to be enclosed by the central sheath, | 
which is connected to the A tubules of the peripheral doublets by a set | 
of radial spokes. The doublets are connected to one another by an 
interdoublet bridge composed of an elastic protein-based linkage | 
called the nexin link. The nexin link is now known to be part of a 
biochemical complex containing proteins that regulate dynein activity, 
called the nexin-dynein regulatory complex or N-DRC. Of particular 
importance was the observation that a pair of “arms’—an inner arm 
and an outer arm—project from the A tubule (Figure 9.26a,b). These 
arms are composed of axonemal dyneins, protein complexes related to 
cytoplasmic dynein that are responsible for generating the bending 
motion in cilia and flagella. A longitudinal section, that is, one cut 
through the axoneme parallel to its long axis, reveals the continuous 
nature of the microtubules and the discontinuous nature of the other 
elements (FIGURE 9.27a,b). 

As noted earlier, a cilium or flagellum emerges from a basal body 
(FIGURE 9.28a) similar in structure to the centriole of Figure 9.12a. If 
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FIGURE 9.25 Eukaryotic flagella. (a) The unicellular alga Chlamydomonas reinhardtii. The two flagella appear green after binding a fluorescent antibody 
directed against a major flagellar membrane protein. The red color results from autofluorescence of the cell’s chlorophyll. Unlike many flagellated organisms, 
Chlamydomonas does not need its flagella to survive and reproduce, so that mutant strains exhibiting various types of flagellar defects can be cultured. 

(b) The forward movement of Chlamydomonas is accomplished by an asymmetric waveform that resembles the breast stroke. A different type of flagellar 


waveform is shown in Figure 9.35. 


Source: Biophoto Associates / Science Source Images. 
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FIGURE 9.26 The structure of a ciliary or flagellar axoneme. (a) Cross section through a sperm axoneme. The peripheral doublets are seen to consist of 
a complete and an incomplete microtubule, whereas the two central microtubules are complete. The dynein arms are seen as “fuzzy” projections from the 
wall of the complete microtubule. The molecular structure of these arms is discussed in a later section. (b) Schematic diagram of an axoneme of a protist 
showing the structure of the microtubular fibers, the two types of dynein arms (three-headed outer arms and two-headed inner arms), the nexin links 
between the doublets, the central sheath surrounding the central microtubules, and the radial spokes projecting from the outer doublets toward the central 
sheath. A more detailed and complex view of the molecular architecture of the axoneme has been obtained with the application of cryoelectron 
tomography (Section 18.2); see PNAS 102:15889, 2005, Science 313:944, 2006, and J. Cell Biol. 187: 921, 2009. (Note: The cilia and flagella of animals 
typically contain two-headed outer dynein arms.) Source: (a) Courtesy Lewis Tilney and K. Fujiwara 
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FIGURE 9.27 Longitudinal view of an axoneme. (a) Electron micrograph of a median longitudinal section of a straight region of a cilium. The radial spokes 
are seen joining the central sheath with the A microtubule of the doublet. (b) Schematic diagram of a longitudinal section of a flagellar doublet. Radial spokes 
emerge in groups of three, which repeat (at 96 nm in this case) along the length of the microtubule. Outer dynein arms are spaced at intervals of 24 nm. 


Source: From Fred D. Warner and Peter Satir, J. Cell Biol. 63:41, 1974, Fig. 4. Reproduced with permission of the Rockefeller University Press. 
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FIGURE 9.28 Basal bodies and axonemes. (a) Electron micrograph of a longitudinal section through the basal bodies of a number of cilia at the apical 
surface of epithelial cells of a rabbit oviduct. These basal bodies arise from centrioles that are generated in the cytoplasm and migrate to sites beneath the 
plasma membrane. (b) Schematic diagram of the structural relationship between the microtubules of the basal body and the axoneme of a cilium or 


flagellum. 
Source: (a) Courtesy of R. G. W. Anderson. 


the basal body arises from a centriole that had already been present 
during the preceding cell division, the cilium that it makes is called the 
“primary cilium.” These are usually non-motile as discussed above. In 
some cells that make large numbers of cilia, the additional cilia are 
nucleated by new basal bodies that form after division, and hence are 
called secondary cilia. In such cases, the basal bodies do not form by 
the typical process of centriole duplication, but rather assemble in 
large amorphous structures called deuterosomes. The A and B tubules 
of the basal body elongate to form the doublets of the cilium or flagel- 
lum (Figure 9.280). Ifa cilium or flagellum is sheared from the surface 
of a living cell, a new organelle is regenerated as an outgrowth of the 
basal body. As with other microtubular structures, the growth of an 
axoneme occurs at the plus (outer) ends of its microtubules. How does 
a cell organize and maintain a construction site at the outer tip of an 
axoneme situated micrometers from the body of the cell where the 
synthesis of building materials takes place? 


Growth by Intraflagellar Transport 


When cilia and flagella assemble, all growth is restricted to their dis- 
tal tips, and this requires a way for proteins made inside the cell to 
get out to the tip where assembly is taking place. Biologists had been 
watching the flagella of living cells for over a hundred years, but it 
wasnt until 1993 that Joel Rosenbaum and colleagues at Yale 
University observed the movement of particles in the space between 
the peripheral doublets and the surrounding plasma membrane and 
named this movement intraflagellar transport (IFT, FIGURE 9.29). 
The moving particles seen in IFT are complexes of 20 proteins, 
known as IFT proteins, which assemble into a protein complex 
called the IFT particle. These IFT particles then line up to form lin- 
ear arrays called IFT trains, and the trains are moved out to the tip of 
the cilium by a plus-end directed kinesin motor, kinesin-2. IFT 


trains are then brought back to the cell body by a cytoplasmic dynein 
complex. The IFT particles carry cargo proteins such as tubulin, 
required for building cilia, out to the tip for assembly, and inhibition 
of IFT prevents assembly of cilia and flagella. Mutations in genes that 
encode IFT proteins have been important tools in testing the 
function of cilia in development and physiology because they result 
in complete loss of cilia. 


The Mechanism of Ciliary 
and Flagellar Locomotion 


As will be discussed later in this chapter, the contraction of muscle 
results from the sliding of actin filaments over adjacent myosin fila- 
ments. The sliding force is generated by ratchetlike cross-bridges 
that reside in the head of the myosin molecule. With the muscle 
system as a model, it was proposed that ciliary movement could be 
explained by the sliding of adjacent microtubular doublets relative to 
one another. According to this proposal, the dynein arms pictured in 
FIGURE 9.30 act as swinging cross-bridges that generate the forces 
required for ciliary or flagellar movement. The sequence of events is 
shown in FIGURE 9.31. 

In the intact axoneme, the stem of each dynein molecule (with 
its associated intermediate and light chains) is tightly anchored to 
the outer surface of the A tubule, with the globular heads and stalks 
projecting toward the B tubule of the neighboring doublet. In step 1 
of Figure 9.31, the dynein arms anchored along the A tubule of the 
lower doublet attach to binding sites on the B tubule of the upper 
doublet. In step 2, the dynein molecules undergo a conformational 
change, or power stroke, that causes the lower doublet to slide toward. 
the basal end of the upper doublet. This conformational change in a 
dynein heavy chain is depicted in Figure 9.30b-d. In step 3 of 
Figure 9.31, the dynein arms have detached from the B tubule of the 
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FIGURE 9.29 Intraflagellar transport (IFT). Electron micrograph of a 
longitudinal section of a Chlamydomonas flagellum showing two rows of 
protein particles (bounded by arrowheads) situated between the outer 
doublet microtubules and the flagellar plasma membrane. As illustrated in 
the insets, each row of particles and its associated cargo of axonemal 
proteins are moved along the outer doublet microtubule by a motor 
protein, either cytoplasmic dynein 2 if they are moving toward the flagellar 
base or kinesin 2 if the particles are moving toward the tip of the flagellum. 
Source: Micrograph from Keith G. Kozminski, et al., J. Cell Biol. 131:1520, 
1995, Fig. 1. Reproduced with permission of the Rockefeller University 
Press. Courtesy of Joel L. Rosenbaum. 


upper doublet. In step 4, the arms have reattached to the upper dou- 
blet so that another cycle can begin. (An electron micrograph of an 
axoneme showing the dynein arms from one doublet attached to the 
adjacent doublet is shown in Figure 18.21.) 

The nexin link component of the N-DRC forms an elastic link- 
age that connects adjacent doublets (Figure 9.26b). These nexin 
bridges play an important role in ciliary/flagellar movement by lim- 
iting the extent that adjacent doublets can slide over one another. 
The resistance to sliding provided by the nexin bridges causes the 
axoneme to bend in response to the sliding force. Sliding on one side 
of the axoneme alternates with sliding on the other side so that a part 
of the cilium or flagellum bends first in one direction and then in the 
opposite direction (FIGURE 9.32). This requires that, at any given 
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FIGURE 9.30 A model of the structure and function of flagellar/ciliary 
dynein. (a) Platinum replica of a rotary-shadowed, flagellar outer-arm dynein 
molecule prepared by rapid-freeze, deep-etch electron microscopy. Each of 
the three heavy chains forms a prominent globular head with an extension 
(stalk) that functions in linking the dynein arm to the neighboring doublet. An 
interpretive drawing is shown on the right. (b-d) High-resolution micrographs 
(with b’, c’ interpretive models below) of a flagellar dynein heavy chain 
before (b) and after (c) its power stroke. The motor domain, which consists of 
a number of modules arranged in a wheel, is seen to have rotated, which has 
caused the stalk to move in a leftward direction. The image shown in (d) is a 
composite of molecules showing the position of the stalk before and after the 
power stroke. The power stroke causes the microtubule bound to the tip of 
the stalk to slide 15 nm in a leftward direction relative to the motor domain. In 
vitro motility assays suggest that dynein may be capable of “shifting gears” 
so that it can take shorter, more powerful steps as it moves a load of 
increasing mass. (Note: An alternate model of dynein function is discussed in 
Science 322:1647, 2008.) 


Source: (a) Courtesy of John E. Heuser and Ursula W. Goodenough; 
(b) From Stan A. Burgess, M.L. Walker, M. Sakakiara, PJ. Knight,and 
K. Oiwa, Nature 421:717. 2003, Fig 4. Reprinted by permission from 
Macmillan Publishers, Ltd. 
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FIGURE 9.31 Schematic representation of the forces that drive ciliary 
or flagellar motility. The steps are described in the text. A true representa- 
tion of the power stroke is shown in the previous figure. 


time, the dynein arms on one side of the axoneme are active, while 
those on the other side are inactive. As a result of this difference in 
dynein activity, the doublets on the inner side of the bend (at the top 
and bottom of Figure 9.32) extend beyond those on the opposite side 
of the axoneme. A major open question in the cilia field is what 
mechanism coordinates the motion of thousands of dynein arms 
during the ciliary beating cycle. The central pair contributes to this 
coordination by signaling to the dynein arms through the radial 
spokes; however, motility is still possible in mutant flagella that lack 
the central pair, arguing that the dynein arm activities can self- 
organize in some manner which remains be to determined. 


FIGURE 9.32 The sliding-microtubule mechanism of ciliary or flagellar 
motility. Schematic diagram of the sliding of neighboring microtubules 
relative to one another. When the cilium is straight, all the outer doublets 
end at the same level (center). Ciliurn bending occurs when the doublets 

on the inner side of the bend slide beyond those on the outer side (top and 
bottom). The movement of the dynein arms responsible for the sliding of 
neighboring microtubules was shown in the previous figures. 

Source: From D. Voet and J.G. Voet, Biochemistry, 2E, p.1258; Copyright 
1995 John Wiley and Sons, Inc. Reprinted by permission of John Wiley 

and Sons, Inc. 
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1. Contrast the sensory and motile functions of cilia and 
flagella. Describe how failures of these different ciliary 
functions contribute to human diseases. 
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9.8 THE HUMAN PERSPECTIVE 


The Role of Cilia in Develooment and Disease 


We take it for granted that diseases result from failure of specific 
organs within our body, but what about failure of specific organelles 
within our cells? A recently discovered class of diseases caused by 
organelle dysfunction is the ciliopathies—a set of seemingly unre- 
lated disease symptoms that all result from defects in cilia. Many 
of these diseases were known for a long time, but nobody had 
guessed that they might have anything to do with cilia. In fact, for 
a long time it was thought that most cilia in the body did not play 


any important functions and were perhaps purely vestigial. It may 
seem surprising that an organelle present on almost all cells of 
the body could be thought of as unimportant or vestigial, but this 
idea arose because the cilia on most cells are non-motile, and it 
was assumed that motility was the key function of cilia. 

The key breakthrough that unlocked the field of cilia dis- 
ease came from a basic discovery in cell biology, namely the 
identification of the intraflagellar transport system as discussed 
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in the main text. When researchers studying flagellar assembly in 
Chlamydomonas began to determine the protein sequence of 
IFT proteins isolated from Chlamydomonas flagella, they made 
the startling discovery that one of the key IFT proteins, IFT88, 
was encoded by a gene already well known in the kidney disease 
world. This gene, originally named Tg/37, was found in arandom 
mouse mutagenesis project, and mutations in Tg/37 caused 
autosomal recessive polycystic kidney diseases (PKD). PKD is a 
major cause of kidney failure in adults, affecting millions of peo- 
ple each year worldwide. The main features of PKD are greatly 
enlarged kidneys packed with fluid-filled cysts (see FIGURE 1). 
Although the Tg737 gene had been identified many years ear- 
lier, nothing was known about the function of its protein prod- 
uct until basic studies of flagella in a unicellular model organism 
yielded the first molecular information about the IFT system. The 
fact that a gene involved in assembling cilia could play a role 
in kidney disease suggested that cilia might not be so vestigial 
after all. Further studies indicated that cilia in the collecting ducts 
of the kidney (Figure 9.23a) act as flow sensors to monitor urine 
flow, and when this flow signaling is defective, cysts develop that 
lead eventually to kidney failure. The importance of these cilia 
was reinforced when it was discovered that a pair of membrane 
proteins called polycystin 1 and 2 (PC1 and PC2) are localized on 
the surface of these kidney cilia. PC1 and PC2 form a complex in 
which PC1 serves as a sensor for fluid flow in the kidney tubules 
and PC2 serves as an associated Ca** ion channel. Mutations in 
PKD1 and PKD2, the genes that encode PC1 and PC2, lead to 
polycystic kidney disease similar to that seen when IFT88/Tg/37 
is defective. Based on these studies, it is now recognized that 
polycystic kidney disease is ultimately a disease of cilia. 

But PKD was just the tip of the iceberg. A large and still- 
growing number of diseases appear to involve defects in cilia 


FIGURE 1 Defective primary cilia in the kidney lead to polycystic 
kidney disease. Image shows a normal kidney on the left and a polycystic 


kidney on the right. 
Source: Image courtesy of the PKD (Polycystic Kidney Disease) 
Foundation, pkdcure.org 


in different tissues of the body. A good example is Bardet-Bied| 
syndrome (BBS), which is caused by mutations in any one of a 
number of genes that affect protein trafficking into cilia. Persons 
afflicted with BBS exhibit a remarkable range of abnormalities, 
including polydactyly (extra fingers or toes,), obesity, kidney dis- 
ease, mental retardation, and genital abnormalities. These dis- 
parate symptoms are thought to reflect the function of cilia as 
antennas to receive signals that regulate development and phys- 
iology. For example, the polydactyly seen in BBS and other cil- 
iopathies occurs because cilia are required for signaling by the 
hedgehog developmental signaling pathway. The fact that all of 
these dysfunctions can be traced to abnormalities in cilia illus- 
trates the widespread importance of these cellular structures in 
organ development and function. 

Many ciliopathies such as PKD or BBS involve sensory 
functions of non-motile cilia, but there are also serious health 
consequences related to the motile cilia. Cilia move mucus in the 
airway as well as the oviduct and cerebrospinal fluid in the brain 
ventricles. Defects in the motile functions of these cilia, caused 
for example by mutations in ciliary dyneins or radial spokes, can 
lead to bronchiectasis, chronic sinusitis in the case of the airway, 
and hydrocephalus in the case of the brain. All of these symptoms 
apparently result from loss of cilia-driven fluid flow. One surprising 
symptom that occurs in roughly half of all patients with non-motile 
cilia is situs inversus, a completely reversal of left-right asymmetry 
of the internal organs. When you look into a mirror, you see a 
relatively symmetrical organism, one whose left half is essentially 
a mirror image of the right half. Surgeons, on the other hand, 
see a strikingly asymmetric organism when they open a person's 
thoracic or abdominal cavity. The stomach, heart, and spleen, for 
example, are shifted to the left side of the body, whereas the liver 
is largely on the right side. On occasion, a physician will see a 
patient in which the left-right asymmetry of the visceral organs is 
reversed (a condition called situs inversus). Situs inversus appears 
to result from defects in motile cilia found on the embryo at 
the time of gastrulation. These embryonic cilia drive a leftward 
directed flow of extraembryonic fluid, and this directed flow 
determines which side of the embryo will be the left side versus 
the right side. When flow is absent, the left-right decision is made 
at random, so half of such patients develop situs inversus. The 
importance of flow was demonstrated by taking mouse embryos 
with non-motile cilia and using a pump to drive an artificially cre- 
ated flow over their surface. When the flow was leftward directed, 
the mice developed normal organ positions, and when the flow 
was rightward directed, the mice developed situs inversus. This 
explains why lack of motility leads to situs inversus but raises the 
question of how the direction of flow is detected by developing 
tissues to determine left versus right. Remarkably, it turns out that 
the cilia-driven flow in the embryo that breaks left-right symmetry 
is detected by the same polycystin proteins that sense fluid flow in 
the kidney. In the case of the embryo, these flow-sensing cilia are 
located in a ring around the patch of cells containing motile cilia, 
and the direction of flow across this patch determines which cells 
surrounding the patch become activated through the polycystin 
channels. 

The discovery of the ciliopathies has opened a new door 
to understanding the functional role of cilia in development 
and physiology. Many of the genes responsible for these var- 
ious ciliopathies were first identified in model organisms such as 
Chlamydomonas or C. elegans, which provides another example 
of the importance of basic research on nonvertebrate organisms 
in furthering our understanding of human disease. 


9.9 Intermediate Filaments 


The second of the three major cytoskeletal elements to be discussed 
was seen in the electron microscope as solid, unbranched filaments 
with a diameter of 10-12 nm. They were named intermediate fila- 
ments (or IFs). To date, intermediate filaments have only been identi- 
fied in animal cells. Intermediate filaments are strong, flexible, ropelike 
fibers that provide mechanical strength to cells that are subjected to 
physical stress, including neurons, muscle cells, and the epithelial cells 
that line the body’s cavities. Unlike actin filaments and microtubules, 
IFs area chemically heterogeneous group of structures that, in humans, 
are encoded by approximately 70 different genes. The polypeptide 
subunits of IFs can be divided into five major classes based on the type 
of cell in which they are found (Table 9.2) as well as biochemical, 
genetic, and immunologic criteria. We will restrict the present discussion 
to classes I-IV, which are found in the construction of cytoplasmic 
filaments, and consider type V IFs (the lamins), which are present as 
part of the inner lining of the nuclear envelope, in Section 12.2. IFs 
radiate through the cytoplasm of a wide variety of animal cells and are 
often interconnected to other cytoskeletal filaments by thin, wispy 
cross-bridges (FIGURE 9.33). In many cells, these cross-bridges consist 
of an elongated dimeric protein called plectin that can exist in numer- 
ous isoforms. Each plectin molecule has a binding site for an interme- 
diate filament at one end and, depending on the isoform, a binding site 
for another intermediate filament, microfilament, or microtubule at 
the other end. 

Although IF polypeptides have diverse amino acid sequences, all 
share a similar structural organization that allows them to form 
similar-looking filaments. Most notably, the polypeptides of IFs all 
contain a central, rod-shaped, «-helical domain of similar length and 
homologous amino acid sequence. This long fibrous domain makes 
the subunits of intermediate filaments very different from the globular 
tubulin and actin subunits of microtubules and actin filaments. The 
central fibrous domain is flanked on each side by globular domains of 
variable size and sequence (step 1, FIGURE 9.34). Two such polypep- 
tides spontaneously interact as their or-helical rods wrap around each 


TABLE 9.2 Properties and Distribution of the Major Mammalian 
Intermediate Filament Proteins 


Sequence 


type 


Primary tissue 


IF protein distribution 


Keratin (acidic) Epithelia 
(28 different polypeptides) 
Keratin (basic) 
(26 different polypeptides) 


Vimentin 


Epithelia 


Mesenchymal cells 

Muscle 

Astrocytes 

Peripheral neurons 

Neurons of central 
and peripheral 
nerves 


Desmin 
Glial fibrillary acidic protein (GFAP) 


Peripherin 
Neurofilament proteins 
NF-L 
NF-M 
NF-H 
Nestin 
Lamin proteins 
Lamin A 


Neuroepithelia 

All cell types 
(Nuclear envelopes) 
Lamin B 

Lamin C 


More detailed tables can be found in Trends Biochem Sci. 31:384, 2006, Genes and 
Development 21:1582, 2007, and Trends Cell Biol. 18:29, 2008. 


FIGURE 9.33 Cytoskeletal elements are connected to one another by 
protein cross-bridges. Electron micrograph of a replica of a small portion of 
the cytoskeleton of a fibroblast after selective removal of actin filaments. 
Individual components have been digitally colorized to assist visualization. 
Intermediate filaments (blue) are seen to be connected to microtubules (red) 
by long wispy cross-bridges consisting of the fibrous protein plectin (green). 
Plectin is localized by antibodies linked to colloidal gold particles (yellow). 


Source: Courtesy of Tatyana Svitkina and Gary Borisy. 


other to form a ropelike dimer approximately 45 nm in length (step 2). 
Because the two polypeptides are aligned parallel to one another in the 
same orientation, the dimer has polarity, with one end defined by the 
C-termini of the polypeptides and the opposite end by their N-termini. 


Intermediate Filament Assembly 
and Disassembly 


The basic building block of IF assembly is thought to be a rodlike 
tetramer formed by two dimers that become aligned side by side in a 
staggered fashion with their N- and C-termini pointing in opposite 
(antiparallel) directions, as shown in Figure 9.34, step 3 and in 
Figure 9.34b. Because the dimers point in opposite directions, the 
tetramer itself lacks polarity. Recent studies of the self-assembly of 
IFs in vitro suggest that eight tetramers associate with one another in 
a side-by-side (lateral) arrangement to form a filament that is one 
unit in length (about 60 nm) (step 4). Subsequent growth of the pol- 
ymer is accomplished as these unit lengths of filaments associate 
with one another in an end-to-end fashion to form the highly elon- 
gated intermediate filament (step 5). None of these assembly steps is 
thought to require the direct involvement of either ATP or GTP. 
Because the tetrameric building blocks lack polarity, so too does the 
assembled filament, which is another important feature that distin- 
guishes IFs from other cytoskeletal elements. 

Intermediate filaments tend to be less sensitive to chemical 
agents than other types of cytoskeletal elements and more difficult to 
solubilize. In fact, treatment of a cell with ionic detergents extracts 
just about everything except the cell’s intermediate filaments. 
Because of their insolubility, IFs were initially thought to be perma- 
nent, unchanging structures, so it came as a surprise to find that they 
behave dynamically in vivo. When labeled keratin subunits are 
injected into cultured skin cells, they are rapidly incorporated into 
existing IFs. Surprisingly, the subunits are not incorporated at the 
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FIGURE 9.34 A model of intermediate filament assembly and architec- 
ture. Each monomer has a pair of globular terminal domains (red or yellow) 
separated by a long a-helical region (step 1). Pairs of monomers associate in 
parallel orientation with their ends aligned to form dimers (step 2). 
Depending on the type of intermediate filament, the dimers may be 
composed of identical monomers (homodimers) or nonidentical monomers 
(heterodimers). Dimers in turn associate in an antiparallel, staggered fashion 
to form tetramers (step 3), which are thought to be the basic subunit in the 
assembly of intermediate filaments. In the model shown here, 8 tetramers 
associate laterally to form a unit length of the intermediate filament (step 4). 
Highly elongated intermediate filaments are then formed from the end-to- 
end association of these unit lengths (step 5). Once formed, intermediate 
filaments undergo a process of dynamic remodeling that is thought to 
involve the intercalation of unit lengths of filament into the body of an 
existing filament (step 6). (b) A model of a tetramer of the IF protein vimentin. 


Source: From Anna A. Sokolova, et al., Courtesy of Sergei V. Strelkov, PNAS 
103:16209, 2006, Fig. 3a. © 2006 National Academy of Sciences, U.S.A. 


ends of the filament, as might have been expected by analogy with 
microtubule and actin filament assembly, but rather into the fila- 
ment’s interior (FIGURE 9.35a). The results depicted in Figure 9.35 


FIGURE 9.35 Experimental demonstration of the dynamic character 
of intermediate filaments. These photographs show the results of an 
experiment in which biotin-labeled type | keratin was microinjected into 
cultured epithelial cells and localized 20 minutes later using immunofluores- 
cence. The photograph in a shows the localization of the injected bioti- 
nylated keratin (as revealed by fluorescent anti-biotin antibodies) that had 
become incorporated into filaments during the 20-minute period following 
injection. The photograph in b shows the distribution of intermediate 
filaments in the cell as revealed by anti-keratin antibodies. The dotlike 
pattern of fluorescence in a indicates that the injected subunits are 
incorporated into the existing filaments at sites throughout their length, 
rather than at their ends. (Compare with a similar experiment with labeled 
tubulin in Figure 9.26.) Bar, 10m. 

Source: From Rita K. Miller, Karen Vikstrom, and Robert D. Goldman, J. Cell 
Biol. 113:848, 1991, Fig. 4. Reproduced with permission of Rockefeller 
University Press. 


might reflect the exchange of unit lengths of filament directly into an 
existing IF network (as shown in step 6, Figure 9.34a). Unlike the 
other two major cytoskeletal elements, assembly and disassembly of 
IFs are controlled primarily by phosphorylation and dephosphoryla- 
tion of the subunits. For example, phosphorylation of vimentin fila- 
ments by protein kinase A leads to their disassembly. 


Types and Functions 
of Intermediate Filaments 
Keratin filaments constitute the primary structural proteins of epithe- 


lial cells (including epidermal cells, liver hepatocytes, and pancreatic 
acinar cells). FIGURE 9.36a shows a schematic view of the spatial 
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FIGURE 9.36 The organization of intermediate filaments (IFs) within an epithelial cell. (a) In this schematic 
drawing, IFs are seen to radiate throughout the cell, being anchored at both the outer surface of the nucleus 
and the inner surface of the plasma membrane. Connections to the nucleus are made via proteins that span 
both membranes of the nuclear envelope and to the plasma membrane via specialized sites of adhesion such 
as desmosomes and hemidesmosomes. IFs are also seen to be interconnected to both of the other types of 


cytoskeletal fibers. Connections to microtubules (MTs) and actin filaments (AFs) are made primarily by members 
of the plakin family of proteins, such as the dimeric plectin molecule shown in Figure 9.41. (b) Distribution of 
keratin-containing intermediate filaments in cultured skin cells (keratinocytes). The filaments are seen to form a 
cagelike network around the nucleus and also extend to the cell periphery. 

Source: (a) Reprinted with permission from H. Herrmann et al., Nature Revs. Mol. Cell Biol. 8:564, 2007; 
Copyright 2007, Macmillan Magazines Ltd. Nature Reviews Molecular Cell Biology by Nature Publishing Group. 


(b) 


arrangement of the keratin filaments of a generalized epithelial cell, and 
Figure 9.36b shows the actual arrangement within a pair of cultured 
epidermal cells. Keratin-containing IFs radiate through the cytoplasm, 
tethered to the nuclear envelope in the center of the cell and anchored 
at the outer edge of the cell by connections to the cytoplasmic plaques 
of desmosomes and hemidesmosomes (Section 7.9). Figure 9.36a also 


Reproduced with permission of Nature Publishing Group in the format journal via copyright clearance center; 
(b) From Pierre A. Coulombe and M. Bishr Omary, Curr. Opin. Cell Biol. 14:111, 2002, with permission from Elsevier. 


depicts the interconnections between IFs and the cell’s microtubules 
and actin filaments, which transforms these otherwise separate ele- 
ments into an integrated cytoskeleton. Because of these various physi- 
cal connections, the IF network is able to serve as a scaffold for 
organizing and maintaining cellular architecture and for absorbing 
mechanical stresses applied by the extracellular environment. 
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The cytoplasm of neurons contains loosely packed bundles of 
intermediate filaments whose long axes are oriented parallel to that 
of the axon (see Figure 9.7b). These IFs, or neurofilaments, as they 
are called, are composed of three distinct proteins: NF-L, NF-H, and 
NF-M, all of the type IV group of Table 9.2. Unlike the polypeptides 
of other IFs, NF-H and NF-M have sidearms that project outward 
from the neurofilament. These sidearms are thought to maintain the 
proper spacing between the parallel neurofilaments of the axon (see 
Figure 9.7b). In the early stages of differentiation when the axon is 
growing toward a target cell, it contains very few neurofilaments but 
large numbers of supporting microtubules. Once the nerve cell has 
become fully extended, it becomes filled with neurofilaments that 
provide support as the axon increases dramatically in diameter. 
Aggregation of NFs is seen in several human neurodegenerative 
disorders, including ALS and Parkinson's disease. These NF aggre- 
gates may block axonal transport, leading to the death of neurons. 

Efforts to probe IF function have relied largely on genetically 
engineered mice that fail to produce a particular IF polypeptide 
(a gene knockout) or produce an altered IF polypeptide. These stud- 
ies have revealed the importance of intermediate filaments in particu- 
lar cell types. For example, mice carrying deletions in the gene 
encoding K14, a type I keratin polypeptide normally synthesized by 
cells of the basal epidermal layer, have serious health problems. These 
mice are so sensitive to mechanical pressure that even mild trauma, 
such as that occurring during passage through the birth canal or dur- 
ing nursing by the newborn, can cause severe blistering of the skin or 
tongue. This phenotype bears strong resemblance to a rare skin- 
blistering disease in humans, called epidermolysis bullosa simplex 
(EBS).° Subsequent analysis of EBS patients has shown that they carry 
mutations in the gene that encodes the homologous K14 polypeptide 
(or the K5 polypeptide, which forms dimers with K14). These studies 
confirm the role of IFs in imparting mechanical strength to cells situ- 
ated in epithelial layers. Similarly, knockout mice that fail to produce 
the desmin polypeptide exhibit serious cardiac and skeletal muscle 
abnormalities. Desmin plays a key structural role in maintaining the 
alignment of the myofibrils of a muscle cell, and the absence of these 
IFs makes the cells extremely fragile. An inherited human disease, 
named desmin-related myopathy, is caused by mutations in the gene 
that encodes desmin. Persons with this disorder suffer from skeletal 
muscle weakness, cardiac arrhythmias, and eventual congestive heart 
failure. Not all IF polypeptides have such essential functions. For 
example, mice that lack the vimentin gene, which is expressed in 
fibroblasts, macrophages, and white blood cells, show relatively minor 
abnormalities, even though the affected cells lack cytoplasmic IFs. It 
is evident from these studies that IFs have tissue-specific functions, 
which are more important in some cells than in others. 


As mentioned in Chapter 7, similar types of blistering diseases can be caused by defects 
in proteins of the hemidesmosome, which holds the basal layer of the epidermis to 
the basement membrane. EBS can also be caused by mutations in the gene encoding the 
IF crosslinking protein plectin, but such patients also suffer from late onset muscular 
dystrophy, showing the importance of IF networks in muscle as well as skin. 


REVIEW | 


1. Give some examples that reinforce the suggestion 
that intermediate filaments are important primarily in 
tissue-specific functions rather than in basic activities 
that are common to all cells. 


2. Compare and contrast microtubule assembly and 
intermediate filament assembly. 


oe 


9.10 Actin 


Cells are capable of remarkable motility. The neural crest cells in a 
vertebrate embryo leave the developing nervous system and migrate 
across the entire width of the embryo, forming such diverse products 
as the pigment cells of the skin, teeth, and the cartilage of the jaws 
(see Figure 7.11). Legions of white blood cells patrol the tissues of 
the body searching for debris and microorganisms. Certain parts of 
cells can also be motile; broad projections of epithelial cells at the 
edge of a wound act as motile devices that pull the sheet of cells over 
the damaged area, sealing the wound. Similarly, the leading edge of a 
growing axon sends out microscopic processes that survey the sub- 
stratum and guide the cell toward a synaptic target. All of these vari- 
ous examples of motility share at least one component: They all 
depend on actin, the third major type of cytoskeletal element. Actin 
is also involved in intracellular motile processes, such as the move- 
ment of vesicles, phagocytosis, and cytokinesis. In fact, plant cells 
rely primarily on actin, rather than microtubules, to serve as tracks 
for the long-distance transport of cytoplasmic vesicles and orga- 
nelles. This bias toward actin-based motility reflects the rather 
restricted distribution of microtubules in many plant cells (see 
Figure 9.6). Actin also plays an important role in determining the 
shapes of cells and can provide structural support for various types 
of cellular projections (as in Figure 9.66). 


Actin Structure 


Actin filaments are approximately 8 nm in diameter and composed 
of globular subunits of the protein actin, which is the most abundant 
protein in most cells. In the presence of ATP, actin monomers 
polymerize to form a flexible, helical filament. As a result of its 
subunit organization (FIGURE 9.37a), an actin filament is essentially 
a two-stranded structure with two helical grooves running along its 
length (Figure 9.37b). The terms actin filament, F-actin, and micro- 
filament are basically synonyms for this type of filament. Depending 
on the type of cell and the activity in which it is engaged, actin fila- 
ments can be organized into ordered arrays, highly branched net- 
works, or tightly anchored bundles. 

All of the monomers within an actin filament are pointed in the 
same direction, resulting in a polar filament with so-called “barbed” 
and “pointed” ends. This naming convention originated from a tech- 
nique used to identify and label actin filaments in preparation for 
electron microscopy. This method utilized the ability of a proteolytic 
fragment of myosin, called S1 (see Figure 9.41), to bind tightly and 
“decorate” the sides of actin filaments. When S1 fragments are bound, 
one end of the actin filament appears pointed like an arrowhead (the 
“pointed” end), while the other end looks barbed. An example of this 
arrowhead “decoration” is shown in the microvilli of intestinal epi- 
thelial cells of FIGURE 9.38. S1 has proven useful for the identification 
and characterization of actin filaments using the electron micro- 
graph. Using light microscopy, actin can be readily identified and 
localized using fluorescently labeled phalloidin (Figure 9.66a), a 
chemical that binds to the sides of actin filaments or through the use 
of fluorescently labeled anti-actin antibodies. 

Actin was identified more than 50 years ago as one of the major 
contractile proteins of muscle cells. Since then, it has been identified 
as a major protein in virtually every type of eukaryotic cell that has 
been examined. Higher plant and animal species possess a number of 
actin-coding genes whose encoded products are specialized for differ- 
ent types of processes and structures. Actins have been remarkably 


(a) (b) 


100 nm 


FIGURE 9.37 Actin filament structure. (a) A model of an actin filament. 
The actin subunits are represented in three colors to distinguish the 
consecutive subunits more easily. The subdomains in one of the actin 
subunits are labeled 1, 2, 3, and 4 and the ATP-binding cleft in each 
subunit is evident. Actin filaments have polarity, which is denoted as a plus 
and minus end. The cleft (in the upper red subunit) is present at the minus 
end of the filament. (b) Electron micrograph of a replica of an actin filament 
showing its double-helical architecture. 

Source: (a) From Michael F. Schmid, et al., Courtesy of Wah Chiu, J. Cell 
Biol. 124:346, 1994, Fig. 4. Reproduced with permission of Rockefeller 
University Press; (b) From Robert H. Depue, Jr. and Robert V. Rice, J. Mol. 
Biol. 12:302, 1965. Reproduced with permission of Elsevier. 


conserved during the evolution of eukaryotes. For example, the 
amino acid sequences of actin molecules from a yeast cell and from 
rabbit skeletal muscle are 88 percent identical. In fact, actin mole- 
cules from diverse sources can copolymerize to form hybrid fila- 
ments. Although actin filaments can generate forces on their own 
(Section 9.15), most processes involving actin require the activity of 
motor proteins, specifically those of the myosin superfamily. 


Actin Filament Assembly and Disassembly 


Before it is incorporated into a filament, an actin monomer binds a 
molecule of ATP. Actin is an ATPase, just as tubulin isa GTPase, and 
the role of ATP in actin assembly is similar to that of GTP in micro- 
tubule assembly (page 325). The ATP associated with the actin mon- 
omer is hydrolyzed to ADP at some time after it is incorporated into 
the end of a growing actin filament. As a consequence, the bulk of an 
actin filament consists of ADP-actin subunits. 


200 nm 


FIGURE 9.38 Determining the location and polarity of actin filaments 
using the $1 subunit of myosin. Electron micrograph of a replica showing 
the actin filament bundles in the core of the microvilli of an intestinal 
epithelial cell. The cell had been fixed, treated with S1 myosin fragments, 
freeze fractured, and deep etched to expose the filamentous components 
of the cytoplasm. The intermediate filaments (IFs) at the bottom of the 
micrograph do not contain actin and therefore do not bind the $1 myosin 
fragments. These intermediate filaments originate at the desmosomes of 
the lateral surfaces of the cell. 

Source: From N. Hirokawa, L.G. Tilney, K. Fujiwara, and J. E. Heuser, J. Cell 
Biol. 94:430, 1982, fig. 3; Repoduced with permission of the Rockefeller 
University Press. 


Actin polymerization is readily demonstrated in vitro in solutions 
containing ATP-actin monomers. As in the case of microtubules, 
the initial nucleation event in filament formation occurs slowly in 
vitro, whereas the subsequent stage of filament elongation occurs 
much more rapidly. The slow nucleation stage of filament formation 
can be bypassed by including preformed actin filaments in the reac- 
tion mixture. When preformed actin filaments are incubated with a 
high concentration of labeled ATP-actin monomers, both ends of 
the filament become labeled, but the fast-growing barbed end incor- 
porates the monomers at a rate approximately 10 times that of the 
pointed end (FIGURE 9.39a). 

Figure 9.39b illustrates how the events that occur during actin 
assembly/disassembly in vitro depend on the concentration of actin 
monomers and on the elongation dynamics of the filament ends. 
The barbed and pointed ends require different minimal concentra- 
tions of ATP-actin monomers in order to elongate, a measure known 
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FIGURE 9.39 Actin assembly in vitro. (a) Electron micrograph of a short actin filament that was labeled with $1 myosin and then used to nucleate actin 
polymerization. The addition of actin subunits occurs much more rapidly at the barbed (plus) end than at the pointed (minus) end of the existing filament. 
(b) Schematic diagram of the kinetics of actin filament assembly in vitro. All of the orange subunits are part of the original seed; red subunits were present in 
solution at the beginning of the incubation. The steps are described in the text. Once a steady-state concentration of monomers is reached, subunits are 
added to the plus end at the same rate they are released from the minus end. As a result, subunits treadmill through the filament in vitro. Note: No attempt 


is made to distinguish between subunits with a bound ATP versus ADP. 
Source: Courtesy of M.S. Runge and T. D. Pollard. 


as the critical concentration. The critical concentration of the barbed 
end is much lower than the pointed end, meaning that the barbed 
end can continue to elongate at lower ATP-actin concentrations 
than the pointed end can. Suppose we were to begin by adding pre- 
formed actin filaments (seeds) to a solution of actin in the presence 
of ATP (step 1). As long as the concentration of ATP-actin mono- 
mers remains high, subunits will continue to be added at both ends 
of the filament (step 2, Figure 9.39b). As the monomers in the reac- 
tion mixture are consumed by addition to the ends of the filaments, 
the concentration of free ATP-actin continues to drop until a point 
is reached where net addition of monomers continues at the barbed 
end, which has a lower critical concentration of ATP-actin, but stops 
at the pointed end, which has a higher critical concentration for 
ATP-actin (step 3). As filament elongation continues, the free mon- 
omer concentration drops further. At this point, monomers con- 
tinue to be added to the barbed ends of the filaments, but a net loss 
of subunits occurs at their pointed end. As the free monomer 
concentration falls, a point is reached where the two reactions at 
opposite ends of the filaments are balanced so that both the lengths 
of the filaments and the concentration of free monomers remain 
constant (step 4). This type of balance between two opposing activi- 
ties is an example of steady state (Section 3.4) and occurs when the 
ATP-actin concentration is approximately 0.3 1M. Because subunits 
are being added to the barbed ends and removed from the pointed 
ends of each filament at steady state, the relative position of indi- 
vidual subunits within each filament is continually moving—a pro- 
cess known as “treadmilling” (steps 4-5). Studies on living cells 
containing fluorescently labeled actin subunits have demonstrated 
the occurrence of treadmilling in vivo (see Figure 9.46c). 


As discussed in Section 9.13, the rate of assembly and disassem- 
bly of actin filaments in the cell can be influenced by a number of 
different accessory proteins. Changes in the local conditions in a 
particular part of the cell can push events there toward either the 
assembly or the disassembly of microfilaments. By controlling this 
dynamic behavior, the cell can reorganize its actin cytoskeleton. 
Such reorganization is required for dynamic processes such as cell 
locomotion, changes in cell shape, phagocytosis, and cytokinesis. 

As noted above, actin filaments play a role in nearly all of a cell’s 
motile processes. The involvement of these filaments is most readily 
demonstrated by treating the cells with one of the following drugs 
that disrupt dynamic actin-based activities: cytochalasin, derived 
from a mold, which blocks the barbed ends of actin filaments and 
allows depolymerization at the pointed end; phalloidin, obtained 
from a poisonous mushroom, which binds to intact actin filaments 
and prevents their turnover; and latrunculin, obtained from a 
sponge, which binds to free monomers and blocks their incorpora- 
tion into the polymer. Actin-mediated processes rapidly grind to a 
halt when cells contain one of these compounds. 


REVIEW | 
1. Compare and contrast the characteristics of 
microtubule assembly versus actin filament assembly. 


2. Compare the structure of a fully assembled 
microtubule, actin filament, and intermediate filament. 


3. Describe three functions of actin filaments. 


oa 


9.11 Myosin: The Molecular 
Motor of Actin 


We have previously examined the structure and actions of two molecu- 
lar motors—kinesin and dynein—that operate in opposite directions 
over tracks of microtubules. To date, all of the motors known to operate 
in conjunction with actin filaments are members of the myosin super- 
family. Myosins—with the major exception of myosin VI, which is dis- 
cussed below—move toward the barbed end of an actin filament. 

Myosin was first isolated from mammalian skeletal muscle tissue 
and has subsequently been found in virtually all eukaryotic cells. All 
myosins share a characteristic motor (head) domain. The head con- 
tains a site that binds an actin filament and a site that binds and hydro- 
lyzes ATP to drive the myosin motor. Whereas the head domains of 
various myosins are similar, the tail domains are highly divergent. 
Myosins also contain a variety of low-molecular weight (light) chains. 
Myosins are generally divided into two broad groups: the conven- 
tional (or type II) myosins, which were first identified in muscle 
tissue, and the unconventional myosins. The unconventional 
myosins are subdivided on the basis of amino acid sequence into at 
least 17 different classes (type I and types III-XVIII). Some of these 
classes are expressed widely among eukaryotes, whereas others are 
restricted. Myosin X, for example, is found only in vertebrates, and 
myosins VIII and XI are present only in plants. Humans contain about 
40 different myosins from at least 12 classes, each presumed to have its 
own specialized function(s). Of the various myosins, those in the con- 
ventional (type II) class are best understood. 


Conventional (Type II) Myosins 


Proteins of the myosin II class are the primary motors for muscle con- 
traction but are also found in a variety of non-muscle cells. The human 
genome encodes 16 different myosin II heavy chains, three of which 
function in non-muscle cells. All myosin IIs move toward the barbed 
end of an actin filament. Among their non-muscle activities, type II 
myosins are required for splitting a cell in two during cell division, 
generating tension at focal adhesions, cell migration, and the turning 
behavior of growth cones (see Figure 9.67). The effect of inhibiting 


(a) 


myosin II activity in an advancing growth cone is demonstrated in 
FIGURE 9.40. 

An electron micrograph of a pair of myosin II molecules is 
shown in FIGURE 9.41a. Each myosin II molecule is composed of six 
polypeptide chains—one pair of heavy chains and two pairs of light 
chains—organized in such a way as to produce a highly asymmetric 
protein (Figure 9.41a). Examination of the molecule in Figure 9.41b 
shows it to consist of (1) a pair of globular heads that contain the 
catalytic site of the molecule; (2) a pair of necks, each consisting of a 
single, uninterrupted « helix and two associated light chains; and 
(3) a single, long, rod-shaped tail formed by the intertwining of long 
a-helical sections of the two heavy chains. Isolated myosin heads 
(S1 fragments of Figure 9.41b) that have been immobilized on the 
surface of a glass coverslip are capable of sliding attached actin fila- 
ments in an in vitro assay such as that shown in FIGURE 9.42. Thus, 
all of the machinery required for motor activity is contained in a 
single head. The mechanism of action of the myosin head, and the 
key role played by the neck, are discussed in Figure 9.52. The fibrous 
tail portion of a myosin II molecule plays a structural role, allowing 
the protein to form filaments. Myosin II molecules assemble so that 
the ends of the tails point toward the center of the filament and the 
globular heads point away from the center (FIGURES 9.43 and 9.49). 
As a result, the filament is described as bipolar, indicating a reversal 
of polarity at the filament’s center. Because they are bipolar, the myo- 
sin heads at the opposite ends of a myosin filament have the ability 
to pull actin filaments toward one another, as occurs in a muscle cell. 
As described in the following section, the some myosin II filaments 
assemble in transient construction, assembling when and where 
they are needed and then disassembling after they have acted. 


Unconventional Myosins 


In 1973, Thomas Pollard and Edward Korn of the National Institutes of 
Health described a unique myosin-like protein that was extracted from 
the protist Acanthamoeba. Unlike muscle myosin, this smaller uncon- 
ventional myosin had only a single head and was unable to assemble into 
filaments in vitro; the protein became known as myosin I. As reflected in 


(b) 


FIGURE 9.40 Experimental demonstration of a role for myosin Il in the directional movement of growth cones. (a) Fluorescence micrograph showing fine 
processes (neurites) growing out from a microscopic fragment of mouse embryonic nervous tissue. The neurites (stained green) are growing outward on a glass 


coverslip coated with strips of laminin (stained red). Laminin is a common component of the extracellular matrix (page 229). The tip of each nerve process contains 


a motile growth cone. When the growth cones reach the border of the laminin strip (indicated by the line with arrowheads), they turn sharply and continue to grow 


over the laminin-coated surface. Bar, 500 pm. (b) The tissue in this micrograph was obtained from a mouse embryo lacking myosin IIB. The growth cones no 


longer turn when they reach the edge of the laminin-coated surface, causing the neurites to grow forward onto a surface (black) lacking laminin. Bar, 80 pm. 


Source: (a) From Stephen G. Turney and Paul C. Bridgman, Nature Neurosci. 8:717 2005; © 2005, Reprinted by permission of Macmillan Publishers, Ltd; 
(b) From Stephen G. Turney and Paul C. Bridgman, Nature Neurosci. 8:717, 2005; © 2005, Reprinted by permission of Macmillan Publishers, Ltd. 
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(a) 20 nm (b) 


FIGURE 9.41 Structure of a myosin II molecule. (a) Electron micrograph of negatively stained myosin molecules. The two heads and tail of each molecule 
are clearly visible. (b) A highly schematic drawing of a myosin II molecule (molecular mass of 520,000 daltons). The molecule consists of one pair of heavy 
chains (purple) and two pairs of light chains, which are named as indicated. The paired heavy chains consist of a rod-shaped tail in which portions of the two 
polypeptide chains wrap around one another to form a coiled coil and a pair of globular heads. When treated with a protease under mild conditions, the 
molecule is cleaved at the junction between the neck and tail, which generates the S1 fragment. 

Source: (a) From S. A. Burgess, M. L. Walker, H. D. White, and J. Trinick, J. Cell Biol. 139:676,1997, Fig 1. Reproduced with permission of the Rockefeller 


University Press. 


Actin Myosin 


Heavy chains 


(a) 


(b) 


FIGURE 9.43 Structure of a bipolar myosin II filament. (a) Schematic 
diagram of the staggered arrangement of the individual myosin molecules 
in a myosin II filament. (b) Electron micrograph of a bipolar myosin filament 
formed in vitro. The heads of the filament are seen at both ends, leaving a 
smooth section in the center of the filament. 


Source: (6) Courtesy Hugh Huxley 


(b) 5 um 


FIGURE 9.42 In vitro motility assay for myosin. (a) Schematic drawing in which myosin heads are 
bound to a silicone-coated coverslip, which is then incubated with a preparation of actin filaments. 

(b) Results of the experiment depicted in a. Two images were taken 1.5 seconds apart and photographed 
as a double exposure on the same frame of film. The dashed lines with arrowheads show the sliding 
movement of the actin filaments over the myosin heads during the brief period between exposures. 


Source: (6) Based on work by T. Yanagida. 
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FIGURE 9.44 Myosin V—a two-headed unconventional myosin involved in organelle transport. (a) Direct visualization of a single myosin V molecule (one 


that is lacking its normal tail domain) as it moves along an actin filament in vitro in the presence of ATP. Using high-speed atomic force microscopy (HS-AFM), 


this series of images shows the movement of the molecule over a period of about one second. (b) Succesive HS-AFM images showing the hand-over-hand 


movement of a single myosin V molecule as it passes through a cluster of obstacles (consisting of streptavidin protein molecules). The swinging neck (or lever 


arm) is highlighted with a thin white line. Interpretive drawings (b) depict the motor protein as seen in several of the corresponding HS-AFM images. A 


continuous movie of the protein's movement can be seen in the supplement to this article. (c) Schematic drawing of a complete dimeric myosin V molecule, 
including its numerous light chains, with both heads bound to an actin filament and its tail domain bound to a vesicle. Rab27a and melanophilin serve as 


adaptors that link the globular ends of the tail to the vesicle membrane. The long neck of myosin V binds 6 light chains. 


Source: From Noriyuki Kodera, et al., Nature, 468:74, 2010, © 2010, reprinted by permission from Macmillan Publsihers, Ltd. Courtesy of Toshio Ando. 


the drawing of a microvillus in Figure 9.58, myosin I often serves as a 
cross-link between actin filaments of the cytoskeleton and the lipid 
bilayer of the plasma membrane. It has been suggested that myosin I can 
exert tension on the plasma membrane, which could play a role in pro- 
cesses that require movement or deformation of the membrane. 

None of the unconventional myosins are capable of filament for- 
mation and instead appear to operate primarily as individual protein 
molecules. The best-studied unconventional myosins are capable of 
moving processively along actin filaments in a manner analogous to 
the way that kinesins and cytoplasmic dynein move along microtu- 
bules. The steps taken by one of these myosins—myosin V—have 
been revealed in a remarkable series of atomic force micrographs 
that capture a single molecule as it moves rapidly along an actin fila- 
ment in vitro (FIGURE 9.44a). In order to visualize the various steps in 
the protein’s mechanical cycle, these researchers placed molecular 
obstacles in the path of the moving motor protein, which slowed the 
rate at which the protein was able to travel. Figure 9.44b shows a 
series of images that reveal the movements of the molecule during a 
single mechanical cycle. (Keep in mind that myosin V normally has 
a sizable tail segment, as depicted in Figure 9.44c, that has been 
removed for these particular experiments.) 

In the first frame of Figure 9.44b, both heads of the dimeric pro- 
tein are seen to be attached to the underlying filament, and a cluster 
of obstacles are seen in the path of the motor. In the second frame, 
the trailing head of the myosin V molecule has detached from the 
filament and is in the process of moving forward through the road- 
blocks in its path. By the fourth frame, this swinging head has made 
contact with the filament at a forward position along the filament 


and thus has become the new leading head. These images provide 
visual confirmation that myosin V moves in a hand-over-hand 
fashion similar to that of the kinesin molecule in the drawing of 
Figure 9.9b. To accomplish this type of movement, at least one of the 
two heads must be bound to its polarized track at all times. 
Figure 9.44c shows an illustration of an intact myosin V molecule. 
A seen in this illustration, myosin V is noteworthy for the length of 
each neck, which at 23 nm is about three times that of myosin II. 
Because of its long necks, which act as a swinging arm (or lever) 
during movement, myosin V can take very large steps. This is very 
important for a motor protein that moves processively along an actin 
filament made up of helical strands of subunits. The actin helix 
repeats itself about every 13 subunits (36 nm), which is just about the 
step size of a myosin V molecule (Figure 9.44b). To accomplish this 
type of movement, each myosin head must move a distance of 72 nm, 
twice the distance between two successive binding sites on the actin 
filament (Figure 9.44). As a result of its giant strides, myosin V can 
walk along the filament in a straight path even though the underlying 
“roadway” spirals 360 oo between its “feet?” Several unconventional 
myosins (including myosin I, V, and VI) are associated with various 
types of cytoplasmic vesicles and organelles. In some cases, these 
myosins may act primarily as organelle tethers and in other cases as 
organelle transporters. Some vesicles have been shown to contain 
microtubule-based motors (kinesins and/or cytoplasmic dynein) and 
microfilament-based motors (unconventional myosins), and, in fact, 
the two types of motors may be physically linked to one another. The 
movement of vesicles and other membranous carriers over long dis- 
tances within animal cells occurs on microtubules, as previously 
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described. However, once they approach the end of the microtubule, 
these membranous vesicles are often thought to switch over to micro- 
filament tracks for the local movement through the actin-rich 
periphery of the cell, which is mediated by myosins (FIGURE 9.45). 
Cooperation between microtubules and microfilaments has 
been best studied in pigment cells (Figure 9.45). In mammals, pig- 
ment granules (melanosomes) are transported into fine peripheral 
processes of the pigment cell by one of the myosin V isoforms called 
Va. Melanosomes are then transferred to hair follicles where they 
become incorporated into a developing hair. Mice lacking myosin Va 
activity are unable to transfer melanosomes into hair follicles, caus- 
ing their coat to have a much lighter color. Humans lacking a normal 
gene encoding myosin Va suffer from a rare disorder called Griscelli 
syndrome; these individuals exhibit partial albinism (lack of skin col- 
oration) and suffer other symptoms thought to be related to defects 
in vesicle transport. In 2000 it was discovered that a subset of Griscelli 
patients had a normal myosin Va gene, but lacked a functional gene 
encoding a peripheral membrane protein called Rab27a. The Rab 
family of proteins was discussed on page 288 as molecules that regu- 
late vesicle trafficking. Rabs are also involved in the attachment of 
myosin (and kinesin) motors to membrane surfaces (Figure 9.44c). 
Inner ear hair cells, whose structure is shown in FIGURE 9.46a, 
have been a particularly good system for studying the functions of 
unconventional myosins. Hair cells are named for the bundle of stiff, 
hairlike stereocilia that project from the apical surface of the cell into 
the fluid-filled cavity of the inner ear. Displacement of the stereocilia 
by mechanical stimuli leads to the opening of Ca”* channels in the 
plasma membrane and the subsequent generation of nerve impulses 
that we perceive as sound. Stereocilia have no relation to the true cilia 
discussed earlier. Instead of containing microtubules, each stereocil- 
ium is supported by a bundle of parallel actin filaments (Figure 9.46b) 
whose barbed ends are located at the outer tip of the projection and 
pointed ends at the base. Stereocilia have provided some of the most 
striking images of the dynamic nature of the actin cytoskeleton. While 
the stereocilia themselves are permanent structures, the actin bundles 
are in constant flux. Actin monomers continually bind to the barbed 
end of each filament, treadmill through the body of the filament, and 
dissociate from the pointed end. This process is captured in the fluo- 
rescence micrograph of Figure 9.46c, which shows the incorporation 
of GFP-labeled actin subunits at the barbed end of each filament. 
Several unconventional myosins (I, IH, V, VI, VII, and XV) 


FIGURE 9.45 The contrasting roles of microtubule- and microfilament- 
based motors in intracellular transport. Most vesicle transport is thought 
to be mediated by members of the kinesin and dynein families, which carry 
their cargo over relatively long distances. It is thought that some vesicles 
also carry myosin motors, such as myosin Va, which transport their cargo 
over microfilaments, including those present in the peripheral (cortical) 
regions of the cell. The two types of motors may act in a cooperative 
manner, as illustrated here in the case of a frog pigment cell in which 
pigment granules move back and forth within extended cellular processes. 


Source: After X, Wu, et al., Copyright 1998, Rockefeller University Press. 
Originally published in the Journal of Cell Biology 143:1915. 


are localized at various sites within the hair cells of the inner ear; two 
of these are shown in Figure 9.46d and e. Mutations in myosin VIa 
are the cause of Usher 1B syndrome, which is characterized by both 
deafness and blindness. The morphologic effects of mutations in the 
myosin VIIa gene on the hair cells of the inner ear of mice are shown 
in Figure 9.46fg. As in humans, mice that are homozygous for the 
mutant allele encoding this motor protein are deaf. 

Myosin VI, a processive organelle transporter in the cytoplasm of 
many cells, is distinguished by its movement in a “reverse direction,” 
that is, toward the pointed end of an actin filament. Myosin VI is 
located at the base of the stereocilia where it might connect the 
cytoskeleton to the overlying plasma membrane. In other types of cells, 
myosin V1 is thought to be involved in the formation of clathrin-coated 
vesicles at the plaama membrane, the movement of uncoated vesicles 
to early endosomes, and the maintenance of Golgi morphology. 
Mutations in myosin VI are the cause of several inherited diseases. 

Now that we have described the basics of actin and myosin 
structure and function, we can see how these two proteins interact to 
mediate a complex cellular activity. 


REVIEW | 


1. Contrast the structure and function of conventional 
myosin Il and the unconventional myosin V. 


2. How is it possible for the same vesicle to be 
transported along both microtubules and 
microfilaments? 


9.12 Muscle Organization 
and Contraction 


Skeletal muscles derive their name from the fact that most of them are 
anchored to the bones that they move. They are under voluntary con- 
trol and can be consciously commanded to contract. Skeletal muscle 
cells have a highly unorthodox structure. A single, cylindrically shaped 
muscle cell is typically 10 to 100 um thick, over 100 mm long, and con- 
tains hundreds of nuclei. Because of these properties, a skeletal muscle 
cell is more appropriately called a muscle fiber. Muscle fibers have 
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FIGURE 9.46 Hair cells, actin bundles, and unconventional myosins. (a) Drawing of a hair cell of the cochlea. The inset shows a portion of several of the 
stereocilia, which are composed of a tightly grouped bundle of actin filaments. (b) Transmission electron micrograph of a cross section of a stereocilium 
showing it is composed of a dense bundle of actin filaments. (c) Fluorescence micrograph of a hair cell from the vestibule of a rat inner ear. The tips of the 
stereocilia are labeled green due to the incorporation of GFP-actin monomers at their barbed ends. Taller stereocilia contain a longer column of GFP- 
labeled subunits, which reflects a more rapid incorporation of actin monomers. The stereocilia appear red due to labeling by rhodamine-labeled phalloidin, 
which binds to actin filaments. (d) A hair cell from the bullfrog inner ear. The localization of myosin Vila is indicated in green. The orange bands near the 
bases of the stereocilia (due to red and green overlap) indicate a concentration of myosin Vlla. (e) Myosin XVa (green) is localized at the tips of the stereocilia 
of a rat auditory hair cell. (f) Scanning electron micrograph of the hair cells of the cochlea of a control mouse. The stereocilia are arranged in V-shaped rows. 
(g) A corresponding micrograph of the hair cells of a mouse with mutations in the gene encoding myosin VI. The stereocilia of the hair cells exhibit a 
disorganized arrangement. Both myosin VI and myosin Vila are required for proper hair cell development and maintenance. 

Source: (a) T. Hasson, Curr Biology 9:22, 1999; with permission from Elsevier Science; (b) Courtesy of A.J. Hudspeth, R.A. Jacobs and P.G. Gillespie; (c) From 
A. K. Rzadzinska, et al., J. Cell Biol., vol. 164, 891, 892, 2004, Figure 4, Reproduced with permission of the Rockefeller University Press. Courtesy of B. 
Kachar; (d) From Peter Gillespie and T. Hasson et al., J. Cell Biol. vol. 137, cover #6, 1997. Reproduced with permission of Rockefeller University Press; 

(e) From A. K. Rzadzinska, et al., J. Cell Biol., vol. 164, 891, 892, 2004, Figure 5, Reproduced with permission of the Rockefeller University Press. Courtesy 
of B. Kachar; (f) From Tim Self, et al., courtesy of Karen P. Steel, Develop. 125:560, 1998; reproduced by permission of The Company of Biologists, Ltd. 
http://dev.biologists.org/content/125/4/557. full.odf+html?sid=1524ec24-e6c0-4 1 3f-9c05-ac2111083a85. 
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346 multiple nuclei because each fiber is a product of the fusion of large 
numbers of mononucleated myoblasts (premuscle cells) in the embryo. 


Organization of Sarcomeres 


Skeletal muscle cells may have the most highly ordered internal struc- 
ture of any cell in the body. A longitudinal section of a muscle fiber 
(FIGURE 9.47) reveals a cable made up of hundreds of thinner, cylindri- 
cal strands, called myofibrils. Each myofibril consists of a repeating 
linear array of contractile units, called sarcomeres. Each sarcomere 
in turn exhibits a characteristic banding pattern, which gives the mus- 
cle fiber a striped or striated appearance. Examination of stained mus- 
cle fibers in the electron microscope shows the banding pattern to be 
the result of the partial overlap of two distinct types of filaments, thin 
filaments and thick filaments (FIGURE 9.48a). Each sarcomere 
extends from one Z line to the next Z line and contains several dark 
bands and light zones. A sarcomere has a pair of lightly staining I bands 
located at its outer edges, a more densely staining A band located 
between the outer I bands, and a lightly staining H zone located in the 
center of the A band. A densely staining M line lies in the center of the 
H zone. The I band contains only thin filaments, the H zone only thick 
filaments, and that part of the A band on either side of the H zone 
represents the region of overlap and contains both types of filaments. 
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FIGURE 9.47 The structure of skeletal muscle. (a) Levels of organization 
of a skeletal muscle. (6) Light micrograph of a multinucleated muscle fiber. 
(c) Electron micrograph of a sarcomere with the bands lettered. 

Source: (b) Eric Grave/Photo Researchers, Inc.; (c) Don W. Fawcett/Photo 
Researchers, Inc. 


Cross sections through the region of overlap show that the thin 
filaments are organized in a hexagonal array around each thick fila- 
ment and that each thin filament is situated between two thick fila- 
ments (Figure 9.48b). Longitudinal sections show the presence of 
projections from the thick filaments at regularly spaced intervals. 
The projections represent cross-bridges capable of forming attach- 
ments with neighboring thin filaments. 


The Sliding Filament Model 
of Muscle Contraction 


All skeletal muscles operate by shortening; there is no other way they 
can perform work. The units of shortening are the sarcomeres, whose 
combined decrease in length accounts for the decrease in length of 
the entire muscle. The most important clue to the mechanism under- 
lying muscle contraction came from observations of the banding pat- 
tern of the sarcomeres at different stages in the contractile process. 
Asa muscle fiber shortened, the A band in each sarcomere remained 
essentially constant in length, while the H band and I bands decreased 
in width and then disappeared altogether. As shortening progressed, 
the Z lines on both ends of the sarcomere moved inward until they 
contacted the outer edges of the A band (FIGURE 9.49). 
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FIGURE 9.48 The contractile machinery of a sarcomere. (a) Diagram 


Thin filament 


of a sarcomere showing the overlapping array of thin actin-containing (orange) and 


thick myosin-containing (purple) filaments. The small transverse projections on the myosin fiber represent the myosin heads (cross-bridges). (b) Electron 
micrograph of a cross section through an insect flight muscle showing the hexagonal arrangement of the thin filaments around each thick filament. 


Source: (b) J. Auber/Photo Researchers, Inc. 
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FIGURE 9.49 The shortening of the sarcomere during muscle contraction. (a) Schematic diagram showing the difference in the structure of the 
sarcomere in a relaxed and contracted muscle. During contraction, the myosin cross-bridges make contact with the surrounding thin filaments, and the thin 
filaments are forced to slide toward the center of the sarcomere. Cross-bridges work asynchronously, so that only a fraction are active at any given instant. 


(b) Electron micrographs of longitudinal sections through a relaxed (top) 


and contracted (bottom) sarcomere. The micrographs show the disappearance of 


the H zone as the result of the sliding of the thin filaments toward the center of the sarcomere. 


Source: (b) James E. Dennis/Phototake 
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Based on these observations, two groups of British investigators, 
Andrew Huxley and Rolf Niedergerke, and Hugh Huxley and Jean 
Hanson, proposed a far-reaching model in 1954 to account for mus- 
cle contraction. They proposed that the shortening of individual sar- 
comeres did not result from the shortening of the filaments, but 
rather from their sliding over one another. The sliding of the thin fila- 
ments toward the center of the sarcomere would result in the observed 
increase in overlap between the filaments and the decreased width of 
the I and H bands (Figure 9.49). The sliding-filament model was rap- 
idly accepted, and evidence in its favor continues to accumulate. 


THE COMPOSITION AND ORGANIZATION OF THICK 
AND THIN FILAMENTS In addition to actin, the thin fila- 
ments of a skeletal muscle contain two other proteins, tropomyosin 
and troponin (FIGURE 9.50). Tropomyosin is an elongated molecule 
(approximately 40 nm long) that fits securely into the grooves within 
the thin filament. Each rod-shaped tropomyosin molecule is associated 
with seven actin subunits along the thin filament (Figure 9.50). 
Troponin is a globular protein complex composed of three subunits, 
each having an important and distinct role in the overall function of 
the molecule. Troponin molecules are spaced approximately 40 nm 
apart along the thin filament and contact both the actin and tropo- 
myosin components of the filament. The actin filaments of each half 
sarcomere are aligned with their barbed ends linked to the Z line. 
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FIGURE 9.50 The molecular organization of the thin filaments. Each 
thin filament consists of a helical array of actin subunits with rod-shaped 
tropomyosin molecules situated in the grooves and troponin molecules 
spaced at defined intervals, as described in the text. The positional changes 
in these proteins that trigger contraction are shown in Figure 9.63. 


Each thick filament is composed of several hundred myosin II 
molecules together with small amounts of other proteins. Like the 
filaments that form in vitro (see Figure 9.43), the polarity of the 
thick filaments of muscle cells is reversed at the center of the sar- 
comere. The center of the filament is composed of the opposing tail 
regions of the myosin molecules and is devoid of heads. The myosin 
heads project from each thick filament along the remainder of its 
length due to the staggered position of the myosin molecules that 
make up the body of the filament (Figure 9.43). 

The third most abundant protein of vertebrate skeletal muscles is 
titin, which is the largest protein yet to be discovered in any organism. 
The full-length titin gene (which can give rise to isoforms of different 
length) encodes a polypeptide more than 3.5 million Daltons in 
molecular mass and containing more than 38,000 amino acids. Titin 
molecules originate at the center line of each sarcomere, extend along 
the myosin filament and terminate at the Z line (FIGURE 9.51). Titin is 
a highly elastic protein that stretches like a molecular spring as certain 
regions within the molecule become uncoiled. Titin is thought to 
prevent the sarcomere from becoming pulled apart during muscle 
stretching. Titin also maintains myosin filaments in their proper posi- 
tion within the center of the sarcomere during muscle contraction. 


THE MOLECULAR BASIS OF CONTRACTION Following 
the formulation of the sliding-filament hypothesis, attention turned to 
the heads of the myosin molecules as the force-generating components 
of the muscle fiber. During a contraction, each myosin head extends 
outward and binds tightly to a thin filament, forming the cross-bridges 
seen between the two types of filaments (Figure 9.49). The heads from 
a single myosin filament interact with six surrounding actin filaments. 
While it is bound tightly to the actin filament, the myosin head under- 
goes a conformational change (described below) that moves the thin 
actin filament approximately 10 nm toward the center of the sarcomere. 
Unlike myosin V (depicted in Figure 9.44), muscle myosin (i.e., myosin 
II) is a nonprocessive motor. Muscle myosin remains in contact with its 
track, in this case a thin filament, for only a small fraction (less than 5 
percent) of the overall cycle. However, each thin filament is contacted. 
by a team of a hundred or so myosin heads that beat out of synchrony 
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FIGURE 9.51 The arrangement of titin molecules within the sarcomere. These huge elastic molecules stretch from the end of the sarcomere at the Z 
line to the M band in the sarcomere center. Titin molecules are thought to maintain the thick filaments in the center of the sarcomere during contraction. The 
|-band portion of the titin molecule contains spring-like domains and is capable of great elasticity. The nebulin molecules (which are not discussed in the 


text) are thought to act like a “molecular ruler” by regulating the number of actin monomers that are allowed to assemble into a thin filament. 


Source: T.C.S. Keller, Curr. Opin Cell Biol. 7:33, 1995. Current Opinion in Cell Biology by Elsevier Ltd., Current Opinion Journals. Reproduced with 
permission of Elsevier Ltd., Current Opinion Journals in the format journal via copyright clearance center. 


with one another (Figure 9.49a). Consequently, the thin filament 
undergoes continuous motion during each contractile cycle. It is esti- 
mated that a single thin filament in a muscle cell can be moved several 
hundred nanometers during a period as short as 50 milliseconds. 
Muscle physiologists have long sought to understand how a sin- 
gle myosin molecule can move an actin filament approximately 10 
nm ina single power stroke. The publication of the first atomic struc- 
ture of the S1 myosin II fragment in 1993 by Ivan Rayment, Hazel 
Holden, and their colleagues at the University of Wisconsin focused 
attention on a proposal to explain its mechanism of action. In this 
proposal, the energy released by ATP hydrolysis induces a small (0.5 
nm) conformational change within the head while the head is tightly 
bound to the actin filament. The small movement within the head is 
then amplified approximately 20-fold by the swinging movement of 
an adjoining a-helical neck (FIGURE 9.52). According to this hypoth- 
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FIGURE 9.52 Model of the swinging lever arm of a myosin II molecule. 
(a) During the power stroke, the neck of the myosin molecule moves 
through a rotation of approximately 70°, which would produce a movement 
of the actin filament of approximately 10 nm. (b) A model of the power 
stroke of the myosin motor domain consisting of the motor domain (or 
head) and adjoining neck (or lever arm). An attached actin filament is shown 
in gray/brown. The long helical neck is shown in two positions, both before 
and after the power stroke (depicted as the upper dark blue and lower light 
blue necks, respectively). It is this displacement of the neck region that is 
thought to power muscle movement. The essential and regulatory light 
chains, which wrap around the neck, are shown in yellow and magenta, 
respectively. 

Source: From Malcolm Irving, et al., Nature Struct. Biol. 7:482. 2000; 
Reprinted by permission from Macmillan Publishers, Ltd. Courtesy Malcolm 
Irving, lvan Rayment, and Carolyn Cohen. 


esis, the elongated myosin II neck acts as a rigid “lever arm’ causing 349 


the attached actin filament to slide a much greater distance than 
would otherwise be possible. The two light chains, which are wrapped 
around the neck, are thought to provide rigidity for the lever. 

Support for the “lever-arm” proposal was initially obtained 
through a series of experiments by James Spudich and colleagues at 
Stanford University. These researchers constructed genes that encoded. 
altered versions of the myosin II molecule, which contained necks of 
different length. The genetically engineered myosin molecules were 
then tested in an in vitro motility assay with actin filaments, similar 
to that depicted in Figure 9.42. As predicted by the lever-arm hypoth- 
esis, the apparent length of the power stroke of the myosin molecules 
was directly proportional to the length of their necks. Myosin mole- 
cules with shorter necks generated smaller displacements, whereas 
those with longer necks generated greater displacements. Not all stud- 
ies have supported the correlation between step size and neck length, 
so the role of the lever-arm remains a matter of controversy. 


THE ENERGETICS OF FILAMENT SLIDING Like the 
other motor proteins kinesin and dynein, myosin converts the chemical 
energy of ATP into the mechanical energy of filament sliding. Each 
cycle of mechanical activity of the myosin cross-bridge takes about 50 
msec and is accompanied by a cycle of ATPase activity, as illustrated in 
the model shown in FIGURE 9.53. According to this model, the cycle 
begins as a molecule of ATP binds to the myosin head, an event that 
induces the dissociation of the cross-bridge from the actin filament 
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FIGURE 9.53 A schematic model of the actinomyosin contractile cycle. 
The movement of the thin filament by the force-generating myosin head 
occurs as the result of a linkage between a mechanical cycle involving the 
attachment, movement, and detachment of the head and a chemical cycle 
involving the binding, hydrolysis, and release of ATP, ADP, and P.. In this 
model, the two cycles begin in step 1 with the binding of ATP to a cleft in 
the myosin head, causing the detachment of the head from the actin 
filament. The hydrolysis of the bound ATP (step 2) energizes the head, 
causing it to bind weakly to the actin filament (step 3). The release of P, 
causes a tighter attachment of the myosin head to the thin filament and the 
power stroke (step 4) that moves the thin filament toward the center of the 
sarcomere. The release of the ADP (step 5) sets the stage for another cycle. 


Source: M. Y. Jiang & M. P. Scheetz, Bioessays 16:532, 1994. 
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occurs before the myosin head makes contact with the actin filament. 
The products of ATP hydrolysis, namely, ADP and P,, remain bound to 
the active site of the enzyme, while the energy released by hydrolysis is 
absorbed by the protein as a whole (Figure 9.53, step 2). At this point, 
the cross-bridge is in an energized state, analogous to a stretched spring 
capable of spontaneous movement. The energized myosin then attaches 
to the actin molecule (step 3) and releases its bound phosphate, which 
triggers a large conformational change driven by the stored free energy 
(step 4). This conformational change shifts the actin filament toward 
the center of the sarcomere. This movement represents the power stroke 
of the myosin head as shown in Figure 9.52. The release of the bound 
ADP (step 5) is followed by the binding of a new ATP molecule so that 
a new cycle can begin. In the absence of ATP, the myosin head remains 
tightly bound to the actin filament. The inability of myosin cross- 
bridges to detach in the absence of ATP is the basis for the condition of 
rigor mortis, the stiffening of muscles that ensues following death. 


EXCITATION-CONTRACTION COUPLING Muscle fib- 
ers are organized into groups termed motor units. All the fibers of a 
motor unit are jointly innervated by branches from a single motor 
neuron and contract simultaneously when stimulated by an impulse 
transmitted along that neuron. The point of contact of a terminus of 
an axon with a muscle fiber is called a neuromuscular junction 
(FIGURE 9.54; see also Figure 4.56 for a closer view of the structure of 
the synapse). The neuromuscular junction is a site of transmission of 
the nerve impulse from the axon across a synaptic cleft to the muscle 
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fiber, whose plasma membrane is also excitable and capable of con- 
ducting an action potential. 

The steps that link the arrival of a nerve impulse at the muscle 
plasma membrane to the shortening of the sarcomeres deep within 
the muscle fiber constitute a process referred to as excitation- 
contraction coupling. Unlike a neuron, where an action potential 
remains at the cell surface, the impulse generated in a skeletal muscle 
cell is propagated into the interior of the cell along membranous 
folds called transverse (T) tubules (Figure 9.54). The T tubules 
terminate in very close proximity to a system of cytoplasmic mem- 
branes that make up the sarcoplasmic reticulum (SR), which 
forms a membranous sleeve around the myofibril. Approximately 80 
percent of the integral protein of the SR membrane consists of Ca’*- 
ATPase pump, whose function is to transport Ca’* out of the cytosol 
and into the lumen of the SR, where it is stored until needed. 

The importance of calcium in muscle contraction was first 
shown in 1882 by Sydney Ringer, an English physician. Ringer found 
that an isolated frog heart would contract in a saline solution made 
with London tap water, but failed to contract in a solution made with 
distilled water. Ringer determined that calcium ions present in tap 
water were an essential factor in muscle contraction. In the relaxed 
state, the Ca”* levels within the cytoplasm of a muscle fiber are very 
low (approximately 2 x 10-’ M)—below the threshold concentration 
required for contraction. With the arrival of an action potential by 
way of the transverse tubules, calcium channels in the SR membrane 
are opened, and calcium diffuses out of the SR compartment and 
over the short distance to the myofibrils. As a result, the intracellular 
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FIGURE 9.54 The functional anatomy of a muscle fiber. Calcium is housed in the elaborate network of internal membranes that make up the sarcoplas- 
mic reticulum (SR). When an impulse arrives by means of a motor neuron, it is carried into the interior of the fiber along the membrane of the transverse 
tubule to the SR. The calcium gates of the SR open, releasing calcium into the cytosol. The binding of calcium ions to troponin molecules of the thin 


filaments leads to the events described in the following figure and the contraction of the fiber. 
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FIGURE 9.55 The role of tropomyosin in muscle contraction. Schematic 
diagram of the steric hindrance model in which the myosin-binding site on 
the thin actin filaments is controlled by the position of the tropomyosin molecule. 
When calcium levels rise, the interaction between calcium and troponin (not 
shown) leads to a movement of the tropomyosin from position b to position a, 
which exposes the myosin-binding site on the thin filament to the myosin head. 


Ca** levels rise to about 5 x 10-° M. To understand how elevated 
calcium levels trigger contraction in a skeletal muscle fiber, it is 
necessary to reconsider the protein makeup of the thin filaments. 

When the sarcomere is relaxed, the tropomyosin molecules of 
the thin filaments (see Figure 9.50) block the myosin-binding sites 
on the actin molecules. The position of the tropomyosin within the 
groove is under the control of the attached troponin molecule. When 
Ca** levels rise, these ions bind to one of the subunits of troponin 
(troponin C), causing a conformational change in another subunit of 
the troponin molecule. Like the collapse of a row of dominoes, the 
movement of troponin is transmitted to the adjacent tropomyosin, 
which moves approximately 1.5 nm closer to the center of the fila- 
ment’s groove (from position b to a in FIGURE 9.55). This shift in 
position of the tropomyosin exposes the myosin-binding sites on the 
adjacent actin molecules, allowing the myosin heads to attach to the 
thin filaments. Each troponin molecule controls the position of one 
tropomyosin molecule, which in turn controls the binding capacity 
of seven actin subunits in the thin filament. 

Once stimulation from the innervating motor nerve fiber 
ceases, the Ca”* channels in the SR membrane close, and the Ca?*- 
ATPase molecules in that membrane remove excess calcium from 
the cytosol. As the Ca”* concentration decreases, these ions dissoci- 
ate from their binding sites on troponin, which causes the tropo- 
myosin molecules to move back to a position where they block the 
actin-myosin interaction. The process of relaxation can be thought 
of as a competition for calcium between the transport protein of the 
SR membrane and troponin. The transport protein has a greater 
affinity for the ion, so it preferentially removes it from the cytosol, 
leaving the troponin molecules without bound calcium. 
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1. Describe the structure of the sarcomere of a skeletal 
muscle myofibril and the changes that occur during its 
contraction. 


2. Describe the steps that occur between the time that a 
nerve impulse is transmitted across a neuromuscular 
junction to the time that the muscle fiber begins to 


shorten. What is the role of calcium ions in this process? 
go 


9.13 Actin-Binding Proteins 


The highly ordered structures seen in muscle cells represent one way 
that actin networks can be organized in cells to orchestrate contrac- 
tility and movement. Actin networks can also form more labile, tran- 
sient arrangements that can power cells to crawl, amoeba-like, along 
a substratum. These structures are typically seen in the cell cortex, a 
thin region directly beneath the plasma membrane. 

The cortex is an active region of the cell, responsible for such 
processes as the ingestion of extracellular materials, the extension of 
processes during cell movement, and the constriction of a single 
animal cell into two cells during cell division. All of these processes 
are dependent on the assembly of actin filaments in the cortex. In the 
following pages, we will consider a number of examples of actin- 
based cell motility. First, however, it is important to survey the fac- 
tors that regulate nonmuscle actin networks. 

Purified actin can polymerize in vitro to form actin filaments, but 
such filaments cannot interact with one another or perform useful 
activities. Under the microscope, they resemble the pieces of straw 
strewn on a barn floor. In contrast, actin filaments in living cells are 
organized into a variety of patterns, including various types of bundles 
as well as crosslinked and branched networks (FIGURE 9.56). The 
organization and behavior of actin filaments inside cells are deter- 
mined by interaction of actin with a remarkable variety of actin- 
binding proteins. These proteins affect the localized assembly or 
disassembly of the actin filaments, their physical properties, and their 
interactions with one another and with cellular organelles. More than 
100 different actin-binding proteins belonging to numerous families 


FIGURE 9.56 Two different arrangements of actin filaments within a 
cell. This electron micrograph of the lamellipod of a motile cell shows actin 
filaments arranged into a dense parallel bundle, called the filopodium (top), 
as well as a crosslinked dendritic network in the lamellipod (bottom half). 
Source: Courtesy of Tatyana Svitkina, University of Pennsylvania 


351 


sula}01g BuIpUlg-UlDY © E16 


352 


AYWNOW []2D Pue UOI]axSOMD EYL e 6 YILdWHO 


t3) 


End-blocking (capping) 


2) 
Monomer- 
sequestering 


Filament 
nucleating 


Monomers 


Membrane-binding 


FIGURE 9.57 The roles of actin-binding proteins. 


have been isolated from different cell types. Actin-binding proteins 
can be divided into several categories based on their presumed 
function in the cell (FIGURE 9.57).* 


1. Nucleating proteins. The slowest step in the formation of an actin 
filament is the first step, nucleation, which requires that at least 
three actin monomers come together in the proper orientation to 
begin formation of the polymer. This is a very unfavorable pro- 
cess for actin molecules left on their own. As noted earlier, the 
formation of an actin filament is accelerated by the presence of a 
preexisting seed or nucleus to which monomers can be added (as 
in Figure 9.39a). In the cell, several actin nucleators have been 
found that can greatly enhance the rate at which polymers form 
from actin monomers. The best studied is the Arp2/3 complex, 
which contains two “actin-related proteins, (or Arps)—proteins 
that share considerable sequence homology with actins but are 
not considered “true” actins. Arp2/3 is activated by binding to the 
side of an actin filament and to an activating protein, which causes 
the two Arps to adopt a conformation that provides a template to 
which actin monomers can be added, analogous to the way that 
y-tubulin is proposed to form a template for microtubule nuclea- 
tion (Figure 9.14c). As shown in Figure 9.62, the Arp2/3 complex 
generates networks of branched actin filaments. Another family 
of nucleating proteins, the formins, generate unbranched fila- 
ments, such as those found at focal adhesions (Section 7.6) and the 
contractile rings of dividing cells (Section 14.11). Unlike Arp2/3, 
which remains at the pointed end of the newly formed filament, 
formins track with the barbed end even as new subunits are 


“It should be noted that some of these proteins can carry out more than one of the 
types of activities listed, depending on the concentration of the actin-binding protein 
and the prevailing conditions (e.g., the concentration of Ca** and H'*). Most studies of 
these proteins are conducted in vitro, and it is often difficult to extend the results to 
activities within the cell. 
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inserted at that site. Thus not only do formins nucleate actin fila- 
ments, they can promote very rapid elongation of the filaments 
that they helped create. Another class of actin nucleators, whose 
founding member is the protein Spire, contains a cluster of actin- 
monomer binding domains that are thought to bring together 
multiple actin monomers in order to form an actin nucleus. 


. Monomer-sequestering proteins. Thymosins are proteins that bind 


to actin-ATP monomers and prevent them from polymerizing. 
Proteins with this activity are described as actin monomer- 
sequestering proteins. These proteins are believed responsible for 
the relatively high concentration of monomeric actin in nonmus- 
cle cells (50-200 mM). Without monomer-sequestering proteins, 
conditions within the cytoplasm would favor the near complete 
polymerization of soluble actin monomers into filaments. 
Because of their ability to bind to monomeric actin and stabilize 
the monomer pool, changes in the concentration or activity of 
monomer-sequestering proteins can shift the monomer-polymer 
equilibrium in a certain region of a cell and determine whether 
polymerization or depolymerization is favored at the time. 


. Capping proteins. Proteins of this group regulate the length of actin 


filaments by binding to one or the other end of the filaments, 
forming a cap that blocks both loss and gain of subunits. If the fast- 
growing, barbed end of a filament is capped, depolymerization 
may proceed at the opposite end, resulting in the disassembly of the 
filament. If the pointed end is also capped, depolymerization is 
blocked. The thin filaments of striated muscle are capped at their 
barbed end at the Z line by a protein called capZ and at their 
pointed end by the protein tropomodulin. If the tropomodulin cap 
is disturbed by microinjection of antibodies into a muscle cell, the 
thin filaments add more actin subunits at their newly exposed 
pointed end and undergo dramatic elongation into the middle of 
the sarcomere. 


4. Monomer-binding proteins. Profilin is a small abundant protein 
that binds to the same site on an actin monomer as does thymo- 
sin. However, rather than inhibiting polymerization, profilin 
promotes the growth of actin filaments. Profilin does this by 
attaching to an actin monomer and catalyzing the dissociation of 
its bound ADP, which is rapidly replaced with ATP. The profilin- 
ATP-actin monomer can then assemble onto the free barbed end 
of a growing actin filament, which leads to the release of profilin. 
Filament elongation at the barbed end is further promoted by 
members of the Ena/VASP protein family. 


5. Depolymerizing proteins. Members of the cofilin family of pro- 
teins (including cofilin, ADE, and depactin) bind to actin-ADP 
subunits present within the body and at the pointed end of actin 
filaments (see Figure 18.33). Cofilin has two apparent activities: 
it can fragment actin filaments, and it can promote their depo- 
lymerization at the pointed end. These proteins play a role in the 
rapid turnover of actin filaments at sites of dynamic changes in 
cytoskeletal structure. They are essential for cell locomotion, 
phagocytosis, and cytokinesis. 


6. Cross-linking proteins. Proteins of this group are able to alter the 
three-dimensional organization of a population of actin filaments. 
Each of these proteins has two or more actin-binding sites and 
therefore can cross-link two or more separate actin filaments. 
Some of these proteins (e.g., filamin) have the shape of a long, 
flexible rod and promote the formation of loose networks of fila- 
ments interconnected at near right angles to one another (as in 
Figure 9.56). Regions of the cytoplasm containing such networks 
have the properties of a three-dimensional elastic gel that resists 
local mechanical pressures. Other crosslinking proteins (e.g., vil- 
lin and fimbrin) have a more globular shape and promote the 
bundling of actin filaments into tightly knit, parallel arrays. Such 
arrays are found in the microvilli that project from certain epithelial 
cells (FIGURE 9.58) and in the hair-like stereocilia (Figure 9.46) 
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FIGURE 9.58 Actin filaments and actin-binding proteins in a microvillus. 
Microvilli are present on the apical surface of epithelia that function in 
absorption of solutes, such as the lining of the intestine and wall of the 
kidney tubule. Actin filaments are maintained in a highly ordered arrange- 
ment by the bundling proteins villin and fimbrin. The role of myosin |, which 
is present between the plasma membrane of the microvillus and the 
peripheral actin filaments, remains unclear. 


that project from receptor cells of the inner ear. Bundling fila- 353 


ments together adds to their rigidity, allowing them to act as a 
supportive internal skeleton for these cytoplasmic projections. 


7. Filament-severing proteins. Proteins of this class have the ability 
to bind to the side of an existing filament and break it in two. 
Severing proteins (e.g., gelsolin) may also promote the incorpo- 
ration of actin monomers by creating additional free barbed 
ends, or they may cap the fragments they generate. As indicated 
in Figure 9.63, cofilin is also capable of severing filaments. 


8. Membrane-binding proteins. Much of the contractile machinery 
of non-muscle cells lies just beneath the plasma membrane. 
During numerous activities, the forces generated by the contrac- 
tile proteins act on the plasma membrane, causing it to protrude 
outward (as occurs, for example, during cell locomotion) or to 
invaginate inward (as occurs, for example, during phagocytosis 
or cytokinesis). These activities are generally facilitated by link- 
ing the actin filaments to the plasma membrane indirectly, by 
means of attachment to a peripheral membrane protein. Two 
examples were described in previous chapters: the inclusion of 
short actin polymers into the membrane skeleton of erythrocytes 
(see Figure 4.32d), and the attachment of actin filaments to the 
membrane at focal adhesions and adherens junctions (see Figures 
7.17 and 7.26). Proteins that link membranes to actin include 
vinculin, members of the ERM family (ezrin, radixin, and 
moesin), and members of the spectrin family (including dystro- 
phin, the protein responsible for muscular dystrophy). 


REVIEW | 


1. List the various types of actin-binding proteins and 
one function of each type. 


e 


9.14 Cellular Motility 


Actin filaments, often working in conjunction with myosin motors 
and the actin-binding proteins described above, are responsible for a 
variety of dynamic activities in non-muscle cells, including cytoki- 
nesis, phagocytosis, cytoplasmic streaming (the directed bulk flow 
of cytoplasm that occurs in certain large plant cells), vesicle traffick- 
ing, blood platelet activation, lateral movements of integral proteins 
within membranes, cell-substratum interactions, cell locomotion, 
axonal outgrowth, and changes in cell shape. Cellular motility is 
illustrated by the following examples. 

Cell locomotion is required for many activities in higher verte- 
brates, including tissue and organ development, formation of blood 
vessels, development of axons, wound healing, and protection 
against infection. Cell locomotion also contributes to the spread of 
cancerous tumors. In the following discussion, we will concentrate 
on studies of cultured cells moving over a flat (i.e., two-dimensional) 
substrate because these are the experimental conditions that have 
dominated this field of research. Keep in mind that cells in the body 
don’t move over bare, flat substrates, and there is increasing evidence 
that some of the findings from these studies may not apply to cells 
traversing more complicated terrain. Researchers have recently 
begun to develop more complex substrata, including various types of 
three-dimensional extracellular matrices (see Figure 18.22), which 
may lead to new insights into some of the aspects of the mechanism 
of cell locomotion discussed below. 
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FIGURE 9.59 Scanning electron micrograph of a mouse fibroblast 
crawling over the surface of a culture dish. The leading edge of the cell is 
spread into a flattened lamellipodium whose structure and function are 
discussed later in the chapter. 


Source: From Guenter Albrecht-Buehler, Int. Rev. Cytol. 120:194, 1990. 


FIGURE 9.59 shows a single fibroblast that was in the process of 
moving toward the lower right corner of the field when it was pre- 
pared for microscopy. Cell locomotion, as exhibited by the fibroblast 
in Figure 9.59, shares properties with other types of locomotion, for 
example, walking. As you walk, your body performs a series of repeti- 
tive activities: first, a leg is extended in the direction of locomotion; 
second, the bottom of your foot makes contact with the ground, 
which acts as a point of temporary adhesion; third, force is generated 
by the muscles of your legs that moves your entire body forward past 
the stationary foot, all the while generating traction against the point 
of adhesion; fourth, your foot—which is now behind your body rather 
than in front of it—is lifted from the ground in anticipation of your 
next step. Even though motile cells may assume very different shapes 
as they crawl over a substratum, they display a similar sequence of 
activities (FIGURE 9.60). (1) Movement is initiated by the protrusion 
ofa part of the cell surface in the direction in which the cell is to move. 
(2) A portion of the lower surface of the protrusion attaches to the 
substratum, forming temporary sites of anchorage. (3) The bulk of the 
cell is pulled forward over the adhesive contacts, which eventually 
become part of the rear of the cell. (4) The cell breaks its rear contacts 
with the substratum, causing retraction of the trailing edge, or “tail” 

When a small piece of living tissue, such as skin or liver, is placed 
in a culture dish in an appropriate culture medium, individual cells 
migrate out of the specimen and onto the surface of the dish. 
Examination of these cells under the microscope typically shows them 
to be fibroblasts, which are the predominant cells present in connec- 
tive tissue (see Figure 7.1). As it moves, a fibroblast flattens itself close 
to the substratum and becomes fan-shaped, with a broadened frontal 
end anda narrow “tail” (as in Figure 9.59). Its movement is erratic and 
jerky, sometimes advancing and other times withdrawing. On a good 
day, a fibroblast may move a distance of about 1 mm. The key to the 
fibroblast’s locomotion is seen by examining its leading edge, which is 
extended out from the cell as a broad, flattened, veil-like protrusion, 
called a lamellipodium (FIGURE 9.61a). Lamellipodia are typically 
devoid of cytoplasmic vesicles and other particulate structures, and 
the outer edge often exhibits an undulating motion, giving it a ruffled 
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FIGURE 9.60 The repetitive sequence of activities that occurs as a cell 
crawls over the substratum. Step 1 shows the protrusion of the leading 
edge of the cell in the form of a lamellipodium. Step 2 shows the adhesion 
of the lower surface of the lamellipodium to the substratum, an attachment 
that is mediated by integrins residing in the plasma membrane. The cell 
uses this attachment to grip the substratum. Step 3 shows the movement of 
the bulk of the cell forward over the site of attachment, which remains 
relatively stationary. This movement is accomplished by a contractile 
(traction) force exerted against the substratum. Step 4 shows the cell after 
the rear attachments with the substratum have been severed and the 
trailing portion of the cell has been pulled forward. 


appearance (Figure 9.61b). As a lamellipodium is extended from the 
cell, it adheres to the underlying substratum at specific points, provid- 
ing temporary sites of anchorage for the cell to pull itself forward. 

We will see in Section 9.15 how the polymerization of actin 
monomers can provide the force that propels a Listeria bacterium 
through the cytoplasm. This type of intracellular movement is 
accomplished without the involvement of molecular motors. A simi- 
lar type of actin-polymerization mechanism is thought to provide 
the motile force required for the protrusion of the leading edge of a 
lamellipodium. This type of motility also demonstrates the impor- 
tance of actin-binding proteins (depicted in Figure 9.57) in orches- 
trating the assembly and disassembly of actin-filament networks at a 
particular site within the cell at a particular time. 

Suppose we begin with a rounded white blood cell that receives 
a chemical signal coming from one particular direction where the 
body has been infected. Once the stimulus is received at the plasma 
membrane, it triggers the localized polymerization of actin, which 
leads to the polarization of the cell and its movement toward the 
source of the stimulus (FIGURE 9.62a).° Just as Listeria has a protein 
(ActA) that activates polymerization at the bacterial cell surface, 
mammalian cells have a family of proteins (the WASP/WAVE family) 
that activates the Arp2/3 complex at the site of stimulation near the 
plasma membrane. WASP, the founding member of the family, was 
discovered as the product of a gene responsible for Wiskott-Aldrich 
syndrome. Patients with this disorder have a crippled immune sys- 
tem because their white blood cells lack a functional WASP protein 
and consequently fail to respond to chemotactic signals. 


>This type of response can be seen in a remarkable film on the Web showing a neutrophil 
chasing a bacterium. It can be found using the search words: “neutrophil crawling” 
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FIGURE 9.61 The leading edge of a motile cell. (a) The leading edge of this motile fibroblast is flattened against the substratum and spread out into a 
veil-like lamellipodium. (b) Scanning electron micrograph of the leading edge of a cultured cell, showing the ruffled membranes of the lamellipodium. 


Source: (a) Courtesy of J. Victor. Small; (b) From Jean Paul Revel, Symp. Soc. Exp. Biol.28:4. 
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FIGURE 9.62 Directed cell motility. (a) Micrograph of a white blood cell (a neutrophil) that has responded to a chemoattractant being delivered by a 
pipette (seen on the right). The cell has become polarized and is moving toward the source of the stimulus. (6) A proposed mechanism for the 

movement of a cell in a directed manner. A stimulus is received at the cell surface (step 1) that leads to the activation of Aro2/3 complex by a member 

of the WASP/WAVE family (step 2). Activated Aro2/3 complexes serve as nucleating sites for the formation of new actin filaments (step 3). Once filaments 
have formed, Arp2/3 complexes attach to the sides of the filaments (step 4), which stimulates their nucleating activity. As a result, the bound Arp2/3 
complexes initiate side branches that extend outward (step 5) at an angle of about 70° relative to the existing filaments to which they are anchored. As these 
filaments polymerize, they are thought to push the plasma membrane outward, resulting in extension of the leading edge of the lamellipodium. Meanwhile, 
the barbed end of previously formed filaments are bound by a capping protein, which prevents further growth of these filaments, 

keeping them short and rigid. Eventually, the pointed ends of the older, preexisting actin filaments undergo depolymerization, releasing ADP-actin 

subunits (step 6). Depolymerization is promoted by cofilin, which binds to ADP-actin subunits within the filament and stimulates dissociation of the 

subunits from the pointed end of the filament. Released subunits bind profilin and become recharged by ATP/ADP exchange, which makes them 

ready to engage in actin polymerization (as in step 3). (A discussion of the validity of this model can be found in Trends Cell Biol. 21:2, 2011 and 


Nature Cell Biol, 13: 1012, 2011.) 
Source: (a) From Carole A. Parent, Curr. Opin, Cell Biol, 16:5, 2004. © 2004, with permission from Elsevier. 
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Figure 9.62b depicts a model for the major steps in formation 
of a lamellipodium that would guide a cell in a particular direction. 
A stimulus is received at one end of the cell (step 1, Figure 9.620), 
which leads to the activation of Arp2/3 protein complexes by acti- 
vated WASP proteins (step 2). In its activated state, the Arp2/3 
complex binds to the side of an existing actin filament and also 
adopts a conformation that resembles the barbed end of an actin 
filament. As a result, free ATP-actin monomers bind to the Arp2/3 
template, leading to the formation of a new branched actin filament 
(step 3). Polymerization of ATP-bound actin monomers onto the 
free barbed ends of the growing filaments is promoted by profilin 
molecules (page 353). Other Arp2/3 complexes can then bind to 
the sides of these new filaments (step 4) and nucleate the formation 
of additional actin branches (step 5). The Arp2/3 complexes remain 
at the pointed ends, which are situated at the branchpoints. 
Meanwhile, growth of the barbed ends of older filaments is blocked 
by the addition of capping protein (step 5). In contrast, addition of 
actin subunits to the barbed ends of more recently formed fila- 
ments of the network pushes the membrane of the lamellipodium 
outward in the direction of the attractive stimulus (steps 5 and 6). 
As newer filaments are growing by addition of subunits to their 
barbed ends, the older capped filaments undergo disassembly from 
their pointed ends (step 6). Disassembly is promoted by cofilin, 
which binds to actin-ADP subunits along the filaments (step 6). 
Actin-ADP subunits released from the disassembling filaments are 
recharged into ATP-actin monomers through the action of profilin, 
and can then be reutilized in the assembly of new actin filaments at 
the leading edge. 

FIGURE 9.63 illustrates some of the major structural features of 
cell locomotion. The electron micrograph of Figure 9.63 shows the 
branched, cross-linked nature of the filamentous actin network that 


FIGURE 9.63 The structural basis of lamellipodial extension. Electron 
micrograph of a replica of the cytoskeleton at the leading edge of a motile 
mouse fibroblast. The actin filaments are seen to be arranged in a branched 
network, which has been colorized to indicate individual “trees.” The 


circular insets show a succession of Y-shaped junctions between branched 
actin filaments. Aro2/3 complexes are localized at the base of each branch 
by antibodies linked to colloidal gold particles (yellow). 


Source: From Tatyana M. Svitkina and Gary G. Borisy, fromJ. Cell Biol., 
Vol. 145, #5, 1999; Reproduced withpermission of the Rockefeller 
University Press. 


resides just beneath the plasma membrane of an advancing lamel- 
lipodium. The circular insets in Figure 9.63 show a succession of 
short actin-filament branches, with Arp2/3 complexes highlighted 
by immunogold labeling. The Arp2/3 complexes are seen to reside at 
the Y-shaped junctions where the newly polymerized filaments have 
branched off of preexisting filaments. 

Lamellipodial movement is a dynamic process. As actin fila- 
ment polymerization and branching continue at the very front edge 
of the lamellipodium, assembled actin filaments flow in a rearward 
direction and then depolymerize toward the back of the lamellipo- 
dium (step 6, Figure 9.62). Thus taken as a whole, the entire actin- 
filament array undergoes a type of treadmilling (page 340) in which 
actin subunits are added to barbed ends of the array at its front end 
and lost from pointed ends of the array toward the rear. 

According to the sequence of events depicted in Figure 9.62, 
protrusion of the leading edge is followed by movement of the bulk 
of the cell. The major forces involved in cell locomotion are those 
generated at sites of adhesion that are required to pull or “tow” the 
main body of the cell forward (step 3 of Figure 9.60). They are 
often described as “traction forces” because they occur at sites 
where the cell grips the substrate. When cells are allowed to migrate 
over a thin sheet of elastic material, movements of the cell are 
accompanied by deformation of the underlying substratum (see 
Figure 7.18). The magnitude of the traction forces exerted at vari- 
ous locations within a live migrating cell can be calculated from 
the dynamic patterns of substrate deformation and portrayed as 
shown in FIGURE 9.64a. As seen by examination of this computer- 
ized image of a migrating fibroblast, the greatest traction forces are 
exerted just behind the cell’s leading edge where the cell adheres 
strongly to the underlying substratum. The presence of these sites 
of attachment is best revealed by following the localization of fluo- 
rescently labeled vinculin, a major component of focal adhesions, 
within a living cell. This allows investigators to specifically visual- 
ize structures where the cell makes contact with the underlying 
substratum. Figure 9.64 is a fluorescence micrograph showing the 
presence of red fluorescent vinculin molecules concentrated 
just behind the leading edge of a migrating cell. The vinculin- 
containing sites where the leading edge of a migrating cell adheres 
to the substratum tend to be smaller and simpler than the mature 
focal adhesions seen in highly spread, stationary cultured cells and 
are often referred to as focal complexes. Traction forces are thought 
to be generated in the actin cytoskeleton associated with these 
adhesion sites and then transmitted to the extracellular substrate 
by way of the transmembrane integrin molecules that connect the 
inside and outside of the cell. The focal complexes that form near 
the leading edge of a motile cell either disassemble as the cell moves 
forward or mature into larger, more contractile focal adhesions. 
Maturation of focal complexes is probably stimulated by tension 
exerted on these adhesion sites. 

A large body of evidence indicates that actin polymerization is 
responsible for pushing the leading edge of a cell outward (step 1, 
Figure 9.60), whereas myosin (in conjunction with actin filaments) 
is responsible for pulling the remainder of the cell forward (step 3, 
Figure 9.60). These contrasting roles of actin and myosin are well 
illustrated in studies on fish keratocytes, which are cells derived 
from the epidermis that covers the fish’s scales. Keratocytes have 
been a favored system for studying locomotion because their rapid 
gliding movement depends on the formation of a very broad, thin 
lamellipodium. FIGURE 9.65 shows a moving keratocyte that has 
been fixed and stained for actin (Figure 9.65a) and myosin II 


(a) 20 um 
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FIGURE 9.64 Distribution of traction forces within a migrating fibroblast. 
(a) As a cell migrates it generates traction (pulling) forces against its 
substrate. The present image shows the traction forces generated per unit 
area by the surface of a migrating fibroblast. Traction forces were calculated 
at different sites on the surface based on the degree of substrate 
deformation (see Figure 7.17). The magnitude of the traction forces are 
expressed by varying colors with red representing the strongest forces. The 
largest forces are generated at sites of small focal complexes that form 
transiently behind the leading edge of the cell where the lamellipodium is 
being extended (arrow). Deformation at the rear of the cell (shown in red) 
occurs as the front end actively pulls against the tail, which is passively 
anchored. (b) A living, migrating fibroblast exhibiting a well-developed 
lamellipodium that is adhering to the underlying substratum at numerous 
sites (red). This cell is expressing GFP-actin (green) and had been injected 
with rhodamine-tagged vinculin (red). The fluorescently labeled vinculin is 
incorporated into dot-like focal complexes near the leading edge of the cell. 
Some of these focal complexes disassemble, whereas others mature into focal 
adhesions, which are situated farther from the advancing edge. 

Source: (a) From Karen A. Beningo, Micah Dembo, and Yu-Li Wang, J. Cell 
Biol, 153:885, 2001, Fig. 3. Reproduced with permission of the Rockefeller 
University Press; (b) From J. Victor Small, et al., Image courtesy of Olga 
Krylyshkina, Nature Revs.Mol. Cell Biol. 3:957, 2002. © 2002, reprinted with 
permission from Macmillan Publishers, Ltd. 
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FIGURE 9.65 The roles of actin and myosin in the lamellipodial-based 
movement of fish keratocytes. (a,b) Fluorescence micrographs of a fish 
keratocyte moving over a culture dish by means of a broad, flattened 
lamellipodium. The arrow shows the direction of movement, which can 
occur at rates of 10 um/min. The distribution of filamentous actin is revealed 
in part a, which shows the localization of fluorescently labeled phalloidin, 
which binds only to actin filaments. The distribution of myosin in the same 
cell is revealed in part b, which shows the localization of fluorescent 
antimyosin antibodies. It is evident that the body of the lamellipodium 
contains actin filaments but is virtually devoid of myosin. Myosin is 
concentrated, instead, in a band that lies just behind the lamellipodium, 
where it merges with the body of the cell. (c) A schematic drawing depicting 
the filamentous actin network of the lamellipodium and the actin-myosin 
interactions toward the rear of the lamellipodium. The actin network is 
shown in red, myosin molecules in blue. 


Source: (a) By Alexander B. Verkhovsky, from Tatyana M. Svitkina et al., 
J.Cell Biol. 139:397, 1977, Fig. 1. Reproduced with permission of The 
Rockefeller University Press. 
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advancing edge of the lamellipodium is filled with actin. Myosin, on 
the other hand, is concentrated in a band where the rear of the lamel- 
lipodium joins the remainder of the cell. Micrographs of this region 
show the presence of clusters of small, bipolar myosin II filaments 
bound to the actin network (Figure 9.65c). Contractile forces gener- 
ated by these myosin molecules are presumed to pull the bulk of the 
cell along behind the advancing lamellipodium. Myosin I and other 


unconventional myosins are also thought to generate forces for cell 
locomotion in some organisms. 


REVIEW | 


1. Describe the steps taken by a mammalian cell crawling 
over a substratum. 


9.15 EXPERIMENTAL PATHWAY 


Studying Actin-Based Motility without Cells 


The leading edge of a motile cell is a complex, crowded, and 
dynamic compartment, filled with a multitude of different types 
of proteins. Parsing out the roles of each of these proteins is of 
great interest to cell biologists, but difficult to accomplish within 
the context of a living cell. It is for this reason that for many years, 
research has focused on a simplified and pared-down system 
for cell motility—intracellular movement of the Gram-positive 
bacterium Listeria monocytogenes. The discovery and refinement 
of this model system has been an important experimental tool 
that has greatly augmented our understanding of how actin net- 
works are used to generate force in vivo. 

Listeria monocytogenes infection was first described in a 
report in 1926 by E.G.D. Murray and colleagues, who were study- 
ing the cause of an epidemic observed in rabbits and guinea 
pigs.' Listeriosis was later observed in humans, but it was not until 
the early 1980s that listeriosis was linked to the ingestion of con- 
taminated food products. Foodborne illness caused by Listeria 
can sometimes be severe, particularly in immunocompromised 
individuals, but it more often causes fever and gastrointestinal 
symptoms in healthy adults. 

Research into the cause of listeriosis began in earnest in the 
mid-1980s, leading multiple labs to discover that Listeria could 
enter and multiply within the cytosol of cells. Interestingly, some 
researchers, including Ed Havell at the Trudeau Institute and Daniel 
Portnoy at Washington University, observed that the infection could 
spread from one cell to its neighboring cells on a petri dish even 
in the presence of extracellular antibiotic, suggesting that the 
bacteria were somehow able to infect new cells directly from their 
previous host cell.2 Portnoy suspected that the mode of infection 
might be actin-based from his observations and that the addition 
of cytochalasin, an inhibitor of actin polymerization, could block the 
bacteria from spreading to new cells. To get a better view of what 
Listeria was doing in the cell cytosol, Portnoy, then at the University 
of Pennsylvania, began a collaboration with Lew Tilney, a prominent 
electron microscopist. Using a macrophage-like cell line, Portnoy 
infected one set of petri dishes with wild-type Listeria, and, as a 
negative control, infected another set with a Listeria mutant that 
was unable to enter cells. At specific timepoints, the cells were fixed 
and prepared for electron microscopy. Two hours after infection, 
Portnoy and Tilney observed that the wild-type Listeria in the cyto- 
plasm were surrounded by a cloud of actin, which was confirmed 
through the decoration of the filaments with myosin S1 fragments. 
At the four hour timepoint, numerous bacteria were observed 


with long “tails” of short randomly oriented actin filaments that 
appeared to have extended from one pole of the Listeria, giving 
them a comet-like appearance (FIGURE 1b). At the edges of infected 
cells, some Listeria appeared to have pushed finger-like protru- 
sions outward from the cell membrane. Based on these snapshots, 
Portnoy and Tilney postulated in 1989 that the actin comet tails 
were used to propel Listeria through the cytoplasm, and were ulti- 
mately used to gain access to a new host cell. 

The description of the comet tails behind Listeria was 
intriguing to cell biologists studying the actin cytoskeleton. Julie 
Theriot, a graduate student in the laboratory of Tim Mitchison 
at the University of California, San Francisco, saw parallels bet- 
ween her studies of actin filament dynamics in locomoting cells 
and the postulated propulsion of Listeria cells in the cytosol. 
During the course of her graduate studies, Theriot described, 
for the first time, the dynamics of actin networks in living loco- 
moting cells using fluorescently tagged actin and a homemade 
video microscopy system.’ To follow a specific population of 
actin monomers over time, Theriot coupled a photoactivatable 
fluorescent compound to actin monomers and injected the 
labeled actin into live cells. She then allowed the labeled actin to 
be incorporated into actin polymers in the cell. By shining light of 
a specific wavelength in a small rectangular-shaped region of the 
lamellipod, Theriot could turn on the fluorescence of the labeled 
actin and observe how the labeled filamentous actin behaved 
in motile cells (Figure 1a). By activating a region at the front of 
the lamellipod, she observed that the labeled actin filaments 
remained organized and stationary relative to the substrate as 
the cell moved forward. This was the case for all of the crawl- 
ing cells she observed, no matter how quickly they moved across 
the substrate. The bar of fluorescence eventually dispersed and 
disappeared as the bar approached the rear of the cell, suggest- 
ing that filaments were eventually completely dismantled and 
the monomers recycled. From these observations, Theriot and 
Mitchison proposed that extension of the lamellipodium during 
cell motility was caused by the polymerization of new actin fila- 
ments at the leading edge (rather than from the rearrangement 
of existing filaments). 

In collaboration with Portnoy and Tilney, Theriot and Mitchison 
then utilized the experimental system they developed to study 
the movement of Listeria. In this case, Theriot injected photoac- 
tivatable actin monomers into cells that had been infected with 
Listeria and selected actin tails to study in more detail using 


FIGURE 1 Cell motility can be driven by actin 
polymerization. (a) Fluorescence micrograph of a 
portion of a cell infected with the bacterium L. 
monocytogenes. The bacteria appear as red-stained 
objects just in front of the green-stained filamen- 
tous actin tails. (b) Electron micrograph of a cell 
infected with the same bacterium as in a, showing 
the actin filaments that form behind the bacterial 
cell and push it through the cytoplasm. The actin 
filaments have a bristly appearance because they 
have been decorated with myosin heads. Bar at the 
upper left, 0.1 m. htto://biochem.web.utah.edu/ 
iwasa/projects/Karp/9_14.html 


Source: (a) Courtesy of Pascale Cossart; (b) From 
Lewis G. Tilney, et al., J. Cell Biol. 118:77, 1992, 
Fig. 8. Reproduced with permission of the 
Rockefeller University Press. 


fluorescence video microscopy.’ After photoactivating fluorescent 
actin in a small region of the tail, Theriot observed that while the 
bacteria shot away from the fluorescent mark, the mark remained 
stationary relative to the cytoplasm. Using phase-contrast micros- 
copy, Theriot observed that the faster-moving bacteria had 
longer tails and demonstrated that the rate of movement and 
the length of the tail were linearly correlated. From labeling fixed 
cells with fluorescently conjugated phalloidin, she also observed 
that actin filament density was highest close to the bacterial 
surface, and that fluorescence intensity (and therefore filament 
density) decreased exponentially further away from the surface. 
From these observations, the authors postulated that new actin 
filaments were being continually polymerized at the surface of the 
bacterium and that this alone served to propel Listeria through 
the cytoplasm. They concluded that the rate of Listeria motility 
was equal to the rate of actin polymerization. In their discussion, 
the authors noted the similarity between the dynamics of actin 
networks in Listeria based motility and at the leading edge of 
motile cells and suggested that Listeria might be used as a sim- 
plified system for studying actin-based motility. 

At around the same time, two labs independently identi- 
fied the ActA protein of Listeria as the surface protein required 
for nucleation of actin filaments.’ Listeria mutants that lacked 
ActA were shown to be unable to recruit actin filaments and 
no longer showed motility in infected cells. It was later demon- 
strated that expression of ActA in previously non-motile strains of 
bacteria allowed for the formation of actin comet tails and motility 
characteristic of pathogenic strains. These experiments demon- 
strated that the protein ActA is necessary and sufficient for actin- 
based motility of Listeria. Researchers in different labs soon began 
to dissect the ActA protein to determine which regions of the pro- 
tein were responsible for its nucleation effect on the actin cytoskel- 
eton. From in vitro studies, it was clear that ActA did not directly 
nucleate new actin filaments and that one or more unidentified 
cellular factors were needed in order to generate comet tails. 

In 1997, Matthew Welch, then a postdoctoral fellow in Tim 
Mitchison’s lab at UCSF, and his colleagues used a biochemical 
approach to identify the Aro2/3 complex as the endogenous 
actin nucleator in cells.? Using human platelet cells, which are 
rich in actin, they created a soluble cell extract that could support 


Listeria motility. They then fractionated the extract using a series 
of chromatographic columns that separate proteins in the extract 
based on characteristics such as size or charge. Fractions were 
tested for their ability to promote the polymerization of fluores- 
cently labeled actin around Listeria in a microscopy screen. The 
final fraction, which contained the purified Arp2/3 complex, was 
able to promote the formation of actin clouds (but notably, not 
comet tails) around Listeria. By creating an antibody against the 
Arp3 subunit, Welch and his colleagues were able to localize the 
Arp2/3 complex to the surface of Listeria, supporting its pro- 
posed role as an actin nucleator. Based on this data, the authors 
proposed a model in which ActA, while on the surface of Listeria, 
transiently interacts with Aro2/3 and stimulates its nucleation 
activity. Acting as a template for a new actin filament, the Arp2/3 
complex dissociates from ActA and gets incorporated into the 
growing actin tail. 

With the finding of the elusive actin nucleator, cell biolo- 
gists were getting ever closer to reconstituting motility in a well- 
defined, in vitro system. In 1999, Thomas Loisel and colleagues in 
the laboratory of Marie-France Carlier were able to finally achieve 
this goal.'!° The key to their success was the development of a 
“motility medium” that consisted of a set of purified proteins from 
a variety of sources. To achieve the optimal motility of Listeria, the 
motility medium required actin, Arp2/3, capping protein, actin 
depolymerizing factor, profilin, aloha-actinin, and VASP, as well as 
a steady supply of ATP. By adding, subtracting, and altering the 
concentration of each protein in the mix, Loisel and colleagues 
were able to determine the conditions needed for optimal 
movement. Only four proteins were strictly required for motility: 
actin, Arp2/3, cofilin, and capping protein. They also found that 
altering the concentration of other proteins in the mix affected 
the velocity of Listeria in a bell-shaped curve, confirming the sus- 
pected roles of some proteins in motility. For example, the actin 
monomer binding protein profilin, which was known biochemically 
to enhance actin polymerization, was shown to increase the rate of 
movement by two- to threefold. Some findings were more unex- 
pected, such as the effect of the protein VASP, whose absence 
caused a tenfold decrease in the rate of movement. 

In other labs, researchers experimented with replacing the 
Listeria bacteria in their motility media with artificial “cargo.” This 


continued 
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allowed them to test the impact of changing the size and shape 
of the cargo, as well as the ActA concentration on the surface. 
Lisa Cameron and colleagues in the laboratory of Julie Theriot at 
Stanford University coated small plastic beads with purified ActA 
protein and observed them after they were placed in cytoplasmic 
extract.'’ They observed that beads smaller than 0.5 um formed 
actin clouds and then comet tails within 1 hour of incubation and 
moved at rates comparable to Listeria cells. Beads larger than 
0.5 um, however, formed symmetric clouds of polymerized actin 
but largely remained immobile. They found that motility could be 
rescued in the larger beads by coating only half of the bead with 
ActA. These results, along with the results of other labs studying 
bead-based motility, led to numerous biophysical and computa- 
tional studies exploring the molecular and physical parameters 
that allow Listeria and Listeria-like beads to “break symmetry”— 
that is, to escape the symmetric actin cloud that they initially gen- 
erate and propel themselves in a specific direction. 

The relative simplicity of the Listeria-based motility system, 
with its well-defined and relatively well-understood component 
list, attracted computational biologists who sought to create 
computational models of motility. Jonathan Alberts and Garrett 
Odell, at the University of Washington at Friday Harbor, success- 
fully recapitulated many of the observed dynamic properties 
of Listeria-based motility using a simulation that they dubbed 


FIGURE 2. Still image from a computational simulation of actin-based 


motility of a Listeria bacterium. Simulated actin filaments appear as yellow 
lines behind the bacterium (white outline). 

Source: Jonathan B. Alberts, Garret M. Odell, PLOS Biology 2(12), 2004. 
Courtesy Jonathan B. Alberts. 


an “in silico reconstitution” experiment in 2004 (FIGURE 2).' 
The simulation they designed was based on well-understood 
and experimentally observed parameters, such as protein 
association and disassociation rates and actin ATPase rates, and 
they described the biochemical and mechanical interactions 
involving hundreds of actin filaments. Satisfyingly, the speed and 
overall dynamics of the in silico Listeria was consistent with those 
observed for in vivo and in vitro models. The simulation also 
revealed saltatory motion, where the simulated bacterium pauses 
before continuing along its trajectory—an observation that was 
also made for bacterial motion using live Listeria. This simulation, 
and other in silico experiments run by other groups, has served 
to test, confirm, and enhance our understanding of actin based 
motility. 
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9.16 Actin-dependent Processes 
During Development 


During the development of a multicellular organism, the actin 
cytoskeleton powers cells through a number of essential processes. 
We will discuss two topics, axonal outgrowth and cellular shape 
changes during development, below. 


Axonal Outgrowth 


In 1907, Ross Harrison of Yale University performed one of the clas- 
sic experiments in biology. Harrison removed a small piece of tissue 
from the developing nervous system of a frog embryo and placed the 
fragment into a tiny drop of lymphatic fluid. Harrison watched 
the tissue under a microscope over the next few days and found that 
the nerve cells not only remained healthy, but many of them sprouted 
processes that grew out into the surrounding medium. Not only was 
this the first time that cells had been kept alive in tissue culture, but 
the experiment also provided strong evidence that axons develop by 
a process of active outgrowth and elongation. 

The tip of an elongating axon is very different from the remainder 
of the cell (see Figure 9.1). Although the bulk of the axon shows little 
outward evidence of motile activity, the tip, or growth cone, resem- 
bles a highly motile, crawling fibroblast. Close examination of a living 
growth cone reveals several types of locomotor protrusions: a broad, 
flattened lamellipodium that creeps outward over the substratum; 
short, stiff microspikes (FIGURE 9.66a) that point outward to the edge 
of the lamellipodium; and highly elongated filopodia that extend and 
retract in a continuous display of motile activity. Fluorescence micros- 


(a) (b) 


copy shows all of these structures in the peripheral domain of the 361 


growth cone to be filled with actin filaments (shown in green, 
Figure 9.66b). These actin filaments are presumed to be responsible 
for the motile activities of the growth cone. Microtubules, on the other 
hand, fill the axon and the central domain of the growth cone, provid- 
ing support for the thin, elongating axon. A number of individual 
microtubules are seen to penetrate into the actin-rich periphery 
(shown in orange, Figure 9.66). These penetrating microtubules are 
highly dynamic and are thought to play an important role in steering 
the growth cone in the appropriate direction. 

The growth cone is a highly motile region of the cell that explores 
its environment and elongates the axon. Within the embryo, the axons 
of developing neurons grow along defined paths, following certain top- 
ographical features of the substratum or responding to the presence of 
certain chemicals that diffuse into their path. The lamellipodia and filo- 
podia from the growth cone respond to the presence of these physical 
and chemical stimuli, causing the pathfinding axons to turn toward 
attractive factors and away from repulsive factors. FIGURE 9.67a shows a 
cultured neuron whose advancing tip has made a direct turn toward a 
diffusible protein called netrin, which acts as an attractant for axons 
growing within the early embryo. Figure 9.67b shows a growth cone 
(green) making contact with a cell that is expressing another protein 
called ephrin (red) that also acts as a neuronal guidance factor. Unlike 
netrin, ephrin is a non-diffusible, integral protein of the plasma mem- 
brane that binds to an ephrin receptor on the surface of the growth cone. 
The growth cone is making contact with the ephrin-expressing cell by 
means of its long filopodia, which serve a sensory function. Ultimately, 
the correct wiring of the entire nervous system depends on the uncanny 
ability of embryonic growth cones to make the proper steering “deci- 
sions” that lead them to the target organ they must innervate. 


FIGURE 9.66 The structure of a growth cone: the motile tip of a growing axon. (a) A video image of a live growth cone. The terminus is spread into a 
flattened lamellipodium that creeps forward over the substratum. Rodlike microspikes (arrows) can be seen within the transparent veil of the lamellipodium, and 
fine processes called filopodia (arrowheads) can be seen projecting ahead of the leading edge of the lamellipodium. Bar, 5 um. (6) Fluorescence micrograph of 
the growth cone of a neuron showing the actin filaments (green) concentrated in the peripheral domain and the microtubules (orange) concentrated in the 


central domain. A number of microtubules can be seen to invade the peripheral domain, where they interact with actin-filament bundles. 


Source: (a) From Paul Forscher and Stephen J. Smith, J. Cell Biol. 107:1508, 1988, Fig. 2. Reproduced with permission of Rockefeller University Press; 
(b) From Feng-Quan Zhou, Clare M. Waterman-Storer, and Christopher S. Cohan, J. Cell Biol. vol. 157, #5 cover, 2002. Reproduced with permission of the 


Rockefeller University Press. 
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(b) 


FIGURE 9.67 The directed movements of a growth cone. (a) A video image of a live growth cone of a Xenopus neuron that has turned toward a 
diffusible protein (netrin-1) released from a pipette whose position is indicated by the arrow. (b) The growth cone (green) at the tip of a motor axon has 
made contact by means of its filopodia with a target cell that is expressing the neuronal guidance factor ephrin (red). 


Source: (a) From Elke Stein and Marc Tessier-Lavigne, Science 291:1929, 2001. Reprinted with permission from AAAS; (b) Courtesy of Irina Dudanova. 


CHANGES IN CELL SHAPE DURING EMBRYONIC 
DEVELOPMENT Each part of the body has a characteristic 
shape and internal architecture that arises during embryonic devel- 
opment: The spinal cord is basically a hollow tube, the kidney con- 
sists of microscopic tubules, each lung is composed of microscopic 
air spaces, and so forth. Numerous cellular activities are necessary 
for the development of the characteristic morphology of an organ, 
including programmed changes in cell shape. Changes in cell shape 
are brought about largely by changes in the orientation of cytoskel- 
etal elements within the cells. One of the best examples of this phe- 
nomenon is seen in the early stages of the development of the 
nervous system. 

Toward the end of gastrulation in vertebrates, the outer (ectoder- 
mal) cells situated along the embryo’s dorsal surface elongate and form 
a tall epithelial layer called the neural plate (FIGURE 9.68a,b). The cells 
of the neural plate elongate as microtubules become oriented with 
their long axes parallel to that of the cell (inset, Figure 9.68). Following 
elongation, the cells of the neural epithelium become constricted at 
one end, causing them to become wedge shaped and the entire layer of 
cells to curve inward (Figure 9.68c). This latter change in cell shape is 
brought about by the contraction of a band of microfilaments that 
assemble in the cortical region of the cells just beneath the apical cell 
membrane (inset, Figure 9.68c). Eventually, the curvature of the neu- 
ral tube causes the outer edges to contact one another, forming a 
cylindrical, hollow tube (Figure 9.68d,e) that will give rise to the 
animal’s entire nervous system. 


REVIEW | 


1. Describe the role of actin filaments in the activities of 
the growth cone of a neuron. 


ee  - 


9.17 The Bacterial Cytoskeleton 


For many years, it was widely held that the cytoskeleton was strictly 
a eukaryotic innovation that was absent from prokaryotic cells. 
Recent research from the past decade has revealed that numerous 
prokaryotes contain tubulin-, actin- and intermediate filament-like 
proteins. Many of these bacterial proteins have been found to be 
structurally and functionally related to eukaryotic cytoskeletal pro- 
teins, suggesting that actin, microtubules and intermediate filaments 
have all evolved from prokaryotic structures (FIGURE 9.69). 

The bacterial cytoskeleton must carry out many of the same 
tasks as the eukaryotic cytoskeleton, albeit at a smaller scale. For 
example, the role of building the cytokinetic ring during cell division 
is carried out by actin in eukaryotes, and by the protein FtsZ in 
prokaryotes. FtsZ polymerizes into numerous short rod-like fila- 
ments at the site of cell division, generating a force-generating con- 
tractile ring and pulling the membrane inwards to carry out cell 
fission (Figure 9.69c). Although it acts analogously to the actin 
cytoskeleton during cytokinesis, FtsZ is actually a tubulin homolog 
that is found in nearly all prokaryotic cells. 

Another prokaryotic cytoskeletal protein, ParM, has been shown 
to play a role in plasmid segregation, analogous to the action of micro- 
tubules during mitosis (FIGURE 9.69a, 9.70). During cell division, 
ParM filaments indirectly bind to specific centromere-like regions of 
low-copy number plasmids and push them to opposite poles of the 
cell through a bidirectional polymerization mechanism. The result is 
that each daughter cell receives a copy of the plasmid, an outcome that 
would be very unlikely without such a segregation system in place. 
Interestingly, biochemical studies of ParM have revealed that it under- 
goes cycles of rapid growth and disassembly, very much akin to the 
dynamic instability seen in microtubules. Interaction with the plasmid 
stabilizes the ends of the ParM filaments, allowing them to continue 
growing, in much the same way that +TIP binding proteins can 
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FIGURE 9.68 Early stages in the development of 
the vertebrate nervous system. (a—d) Schematic 
drawings of the changes in cell shape that cause a layer 
of flattened ectodermal cells at the mid-dorsal region 
of the embryo to roll into a neural tube. The initial 
change in height of the cells is thought to be driven by 
the orientation and elongation of microtubules, 
whereas the rolling of the plate into a tube is thought 
to be driven by contractile forces generated by actin 
filaments at the apical ends of the cells. (e) Scanning 
electron micrograph of the dorsal surface of a chick 
embryo as its neural plate is being folded to form a 
tube. 

Source: (e) Courtesy of Kathryn W. Tosney. 


FIGURE 9.69 Bacterial cytoskeletal filaments carry out functions 

similar to their eukaryotic homologs. (a) ParM polymerizes into filaments 

that separate two sister plasmids, analogous to the role of microtubules. (b) 

Paralleling the role of intermediate filaments, CreS polymerizes proximal to 

the membrane, and plays a key role in cell shape. (c) Like actin during 

cytokinesis, FtsZ polymers form a contractile ring that helps separate the 
(a) (b) (o) two daughter cells. 
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stabilize microtubules. But despite these similarities to microtubules, 
ParM is structurally similar to actin, rather than to tubulin. 

Yet other bacterial cytoskeletal proteins have been implicated in 
regulating cell shape. One example of this class of proteins is 
Crescentin (CreS), an intermediate filament-like protein found in 
the crescent-shaped bacterium Caulobacter crescentus. CreS has 
been found to form bundled filaments at the highly curved concave 
portion of the cell membrane (Figure 9.69b). Deletion of the creS 
gene results in loss of cellulvar curvature, resulting in the cells 


Analytic Questions| 


1. If a myofibril were pulled so that the sarcomeres increased in 
length by approximately 50 percent, what effect would you 
expect this to have on the contractile ability of the myofibril? 
Why? What effects would this have on the H, A, and | bands? 


2. Why do microtubules disassemble at the ends instead of falling 
apart in the middle? Hint: Consider the protein-protein interac- 
tions that a tubulin a8 dimer makes when it assembles into the 
microtubule lattice. 


3. What are two types of cell motility that might be unaffected if 
both myosin | and myosin Il were depleted? Why? 


4. Acentriole contains 
ium contains 


complete microtubules, and a cil- 
complete microtubules. 


5. Microtubules can be formed in vitro from tubulin that is bound to 
GTP analogues that (unlike GTP) cannot be hydrolyzed. What 
properties would you expect these microtubules to possess? 
Could such microtubules still show dynamic instability? 


6. List two things you could do that would shift the dynamic equi- 
librium of an in vitro preparation of tubulin and microtubules 
toward the formation of microtubules. List two treatments that 
would shift the equilibrium in the opposite direction. 


7. It was noted that a demembranated ciliary or flagellar axoneme 
is able to beat with a normal frequency and pattern. Can you 
conclude that the plasma membrane is not important in ciliary or 
flagellar function? 


8. Because cytoplasmic vesicles are seen to move in both direc- 
tions within an axon, can you conclude that some microtubules 
are oriented with their plus end facing the axon terminus and 
others oriented with the opposite polarity? Why or why not? 


FIGURE 9.70 ParM segregates low-copy number plasmids. Still image 
from an animation demonstrating how the actin-like protein ParM is able to 
separate two sister plasmids during cell division. 


appearing as straight rods rather than crescents. Another example of 
a cytoskeletal protein that controls cell shape is the actin-like 
homolog MreB. MreB is expressed in rod-shaped bacteria, such as 
Escherichia coli, as well as helical bacteria. Deletion of mreB results 
in spherical cells, clearly indicating its key role in shape determina- 
tion. The mechanism and structure of MreB in vivo is currently 
unclear, but is an area of active research. 


REVIEW | 


1. How are bacterial cytoskeletal proteins similar to their 
mammalian counterparts? How are they different? 
e 


9. Would you agree with the statement that the centrosome plays 
a key role in determining the rates of lengthening and shorten- 
ing of the microtubules of an animal cell? Why or why not? 


10. If you were comparing the molecular structure of kinesin and 
myosin, which are thought to have been derived from a common 
ancestral protein, which part (heads or tails) would you expect to 
be most similar between them? Why? 


11. Figure 9.25a shows the appearance of a cross section of a ciliary 
axoneme cut at an interior portion of the cilium. How might the 
image of a cross section differ if it had been cut very near the tip 
of a cilium at the beginning of the recovery stroke? 


12. If an individual kinesin molecule can move at a rate of 800 nm/ 
sec in an in vitro motility assay, what is the maximal turnover rate 
(ATP molecules hydrolyzed per second) by one of the molecule’s 
motor domains? (See Quantitative Tutorial Video). 


13. Actin treadmilling (Figure 9.39) requires actin monomers to 
dissociate from the pointed end of the actin filament while at the 
same time actin monomers are associating at the barbed end of 
the filament. If we start with an actin filament and pop a subunit 
off one end or the other, in either case we now have a free actin 
subunit plus a filament that is shorter by one monomer. If the 
initial state is the same (single actin filament) and the final state 
is the same (shorter filament plus one monomer), the free energy 
difference should be the same and, as shown in Chapter 3, 
the dissociation constant should be the same. So how is it 
possible to have a concentration of actin that permits dissocia- 
tion at one end and association at the other? What biochemical 
activity of actin has our simple statement of the problem 
ignored? 


14. 


15. 


16. 


WZ. 


18. 


Which type of vertebrate tissue would you expect to be an excel- 
lent source of tubulin? of actin? of keratin? Which protein would 
you expect to be the least soluble and most difficult to extract? 
What types of protein would you expect as contaminants in a 
preparation of tubulin? Which in a preparation of actin? 


Actin is one of the most evolutionarily conserved proteins. What 
does this tell you about the structure and function of this protein 
in eukaryotic cells? 


How is it possible that higher plant cells can operate without 
cytoplasmic dynein? 

Myosin Il action (Figure 9.53) differs from that of kinesin (Figure 9.9) 
in that one of the kinesin heads is always in contact with a microtu- 
bule, whereas both myosin heads become completely detached 
from the actin filament. How are these differences correlated with 
the two types of motor activities in which these proteins engage? 


The microtubules of an axon are thought to be nucleated at the 
centrosome, then severed from their nucleation site and moved 
into the axon. It was noted in the text that cytoplasmic dynein is 


19. 


20. 


21. 


responsible for the retrograde movement of organelles in axons, 
yet this same motor is thought to mediate the anterograde 
movement of microtubules in these same cellular processes. 
How is it possible that the same minus end-directed motor can 
be involved in both anterograde and retrograde movements? 
Hint: Review in vitro motility assays for motor proteins. 


Motor proteins are often described as mechanoenzymes. Why? 
Could this same term be applied to virtually all enzymes? Why or 
why not? 

In eukaryotes, tubulin is involved in chromosome segregation 
while actin is involved in building the contractile ring that divides 
the cells. Contrast the roles of tubulin and actin homologs in 
bacteria with their roles in eukaryotes. 


Kinesin steps 8 nm per ATP hydrolyzed, and without ATP it can- 
not step. As the concentration of ATP increases, kinesin steps 
more frequently and moves faster. Can kinesin be made to move 
infinitely fast if you add enough ATP? 
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The Nature of the Gene 
and the Genome 


GENOMICS GETS PERSONAL 


- the winter of 2003, a baby girl was born who would 
eventually become a symbol for the emerging field of 
personal genomics. Bea Rienhoff was primarily a healthy 
baby, but displayed a constellation of unusual traits, including 
wide-set eyes, curled hands and feet, and poor muscle tone 
that her father, Hugh Rienhoff, who was trained as a geneti- 
cist, recognized as potential markers for a genetic disorder. 
After two years and numerous visits to medical specialists 
looking for a diagnosis, Bea’s parents were left empty- 
handed. Frustrated, Rienhoff built a makeshift genetics 
laboratory in his home and began a personal project, which 
would eventually be called “The Bea Project,” to identity the 
mutation he was certain existed in his daughter’s genome. 


Based on Bea’s symptoms, some of which appeared similar 
to other known genetic diseases, Rienhoff suspected that 
the mutation existed somewhere in the TGF-beta pathway. 
He sent Bea's DNA off to have key members of the pathway 
sequenced and compared the returned sequences with 
those in the recently published human genome. Finding 
nothing, he expanded his search, enlisting the help of 
experts in academia and industry. 
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In 2009, Rienhoff turned to a company called Illumina that 
was testing new methods that would allow for the sequencing 
of the portion of the genome that encodes for proteins (called 
the exome). He convinced the company to sequence the 
exomes of Bea and members of the immediate family as a 
test case. After analyzing the data, a single point mutant in a 
single gene that encoded the protein TGF-beta3 was found in 
Bea's exome, but not in that of any of the other family mem- 
bers. From experiments in cell culture, researchers found that 


10.1 The Concept of a Gene 
as a Unit of Inheritance 


Our concept of the gene has undergone a remarkable evolution as 
biologists have learned more and more about the nature of inherit- 
ance. The earliest studies revealed genes to be discrete factors that 
were retained throughout the life of an organism and then passed on 
to each of its progeny. Over the following century, these hereditary 
factors were shown to reside on chromosomes and to consist of 
DNA, a macromolecule with extraordinary properties. FIGURE 10.1 
provides an overview of some of the early milestones along this 
remarkable journey of discovery, capped by the description of the 
double helical structure of DNA in 1953. In the decades that fol- 
lowed this turning point, a major branch of molecular biology began 
to focus on the genome, which is the collective body of genetic 
information that is present in a species. A genome contains all of the 
genes required to “build” a particular organism. During the past 
decade or so, collaborations by laboratories around the world have 
uncovered the complete nucleotide sequences of many different 
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the point mutation results in a nonfunctional protein that 
reduces overall signaling through the TGF-beta pathway. So 
far, Bea Rienhoff is the only person in the world who has this 
mutation and the accompanying, as yet unnamed, syndrome. 
Currently, studies are underway to create animal models to 
see whether there are any long-term impacts on health that 
the mutation may have, and Bea’s father is hopeful that they 
will eventually find others with the same mutation that have 
gone on to live long and healthy lives. 


genomes, including that of our own species and the chimpanzee, our 
closest living relative. For the first time in human history, we have 
the means to reconstruct the genetic path of human evolution by 
comparing corresponding regions of the genome in related organ- 
isms. We can learn which regions of our genome have been dupli- 
cated and which have been lost since our split from a common 
ancestor; we can observe which nucleotides in a particular gene or 
regulatory region have undergone change and which have remained 
constant; most important, we can infer which parts of our genome 
have been subject to natural selection and which have been free to 
drift randomly as time has passed. We have also begun to use this 
information to learn more about our history as a species: when and 
where we arose, how we are related to one another, and how we came 
to occupy the regions of the Earth that we have. 

The science of genetics began in the 1860s with the work of 
Gregor Mendel, a friar at the abbey of St. Thomas located in today’s 
Czech Republic. Mendel’s laboratory was a small garden plot on the 
grounds of the abbey. We don’t know exactly what motivated Mendel 
to begin his studies, but he evidently had a clear experimental plan in 
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FIGURE 10.1 An overview depicting several of the most important early discoveries on the nature of the gene. Each of these discoveries is 


discussed in the present chapter. 
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368 TABLE 10.1 Seven Traits of Mendel’s Pea Plants 
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Trait Dominant allele Recessive allele 


Dwarf 

Green 

Angular (wrinkled) 
White 

At stem tips 

Yellow 

Constricted 


Height Tall 
Seed color 
Seed shape 


Yellow 
Round 
Purple 
Along stem 
Green 
Inflated 


Flower color 
Flower position 
Pod color 

Pod shape 


(See www.mendelweb.org for a discussion of Mendel’s work.) 


mind: His goal was to mate, or cross, pea plants having different 
inheritable characteristics and to determine the pattern by which 
these characteristics were transmitted to the offspring. Mendel chose 
the garden pea for a number of practical reasons, not the least of which 
was that he could purchase a variety of seeds that would give rise to 
plants with distinct characteristics. Mendel chose to focus on seven 
clearly definable traits, including plant height and flower color, each of 
which occurred in two alternate and clearly identifiable forms 
(Table 10.1). Mendel crossbred plants through several generations 
and counted the number of individuals having various characteristics. 
After several years of research, Mendel drew the following conclu- 
sions, which are expressed below in modern genetic terminology. 


1. The characteristics of the plants were governed by distinct factors 
(or units) of inheritance, which were later termed genes. An 
individual plant possessed two copies of a gene that controlled 
the development of each trait, one derived from each parent. The 
two copies could be either identical to one another or nonidentical. 
Alternate forms of a gene are called alleles. For each of the seven 
traits studied, one of the two alleles was dominant over the other. 
When both were present together in the same plant, the existence 
of the recessive allele was masked by the dominant one. 


2. Each reproductive cell (or gamete) produced by a plant contained 
only one copy of a gene for each trait. A particular gamete could 
have either the recessive or the dominant allele for a given trait, 
but not both. Each plant arose by the union of a male and female 
gamete. Consequently, one of the alleles that governed each trait 
in a plant was inherited from the female parent, and the other 
allele was inherited from the male parent. 


3. Even though the pair of alleles that governed a trait remained 
together throughout the life of an individual plant, they became 
separated (or segregated) from one another during the formation 
of the gametes. This finding formed the basis of Mendel’s law of 
segregation. 

4. The segregation of the pair of alleles for one trait had no effect on 
the segregation of alleles for another trait. A particular gamete, 
for example, could receive a paternal gene governing seed color 
and a maternal gene governing seed shape. This finding formed 
the basis of Mendel’s law of independent assortment. 


Mendel presented the results of his work to the members of the 
Natural History Society of Briinn; the minutes of the meeting record 
no discussion of his presentation. Mendel’s experiments were pub- 
lished in the Briinn society’s journal in 1866, where they generated 
no interest whatsoever until 1900, 16 years after his death. In that 
year, three European botanists independently reached the same con- 
clusions, and all three rediscovered Mendel’s paper, which had been 
sitting on the shelves of numerous libraries throughout Europe for 
35 years. 


REVIEW | 


1. What is an allele and what is its relationship to a gene? 


2. Describe Mendel’s law of independent assortment. 


10.2 The Discovery 
of Chromosomes 


Although Mendel provided convincing evidence that inherited traits 
were governed by discrete factors, or genes, his studies were totally 
unconcerned with the physical nature of these elements or their 
location within the organism. During the time between Mendel’s 
work and its rediscovery, a number of biologists became concerned 
with this other aspect of heredity—its physical basis within the cell. 

By the 1880s, a number of European biologists were closely fol- 
lowing the activities of cells and using rapidly improving light micro- 
scopes to observe newly discernible cell structures. None of these 
scientists was aware of Mendel’s work, yet they understood that 
whatever it was that governed inherited characteristics would have 
to be passed from cell to cell and from generation to generation. 
This, by itself, was a crucial realization; all the genetic information 
needed to build and maintain a complex plant or animal had to fit 
within the boundaries of a single cell. Observations of dividing cells 
by the German biologist Walther Flemming in the early 1880s 
revealed that the cytoplasmic contents were shuttled into one daugh- 
ter cell or the other as a matter of chance, depending on the plane 
through which the furrow happened to divide the cell. In contrast, 
the cell appeared to go to great lengths to divide the nuclear contents 
equally between the daughters. During cell division, the material of 
the nucleus became organized into visible “threads,” which were 
named chromosomes, meaning “colored bodies.” 

At about this time, the process of fertilization was observed, and 
the roles of the two gametes—the sperm and the egg—were described 
(FIGURE 10.2). Even though the sperm is a tiny cell, it was known to be 
as important genetically as the much larger egg. What was it that these 
two very different cells had in common? The most apparent feature was 
the nucleus and its chromosomes. The importance of the chromosomes 
contributed by the male became evident in a study by the German biol- 
ogist Theodore Boveri on sea urchin eggs that had been fertilized by 
two sperm rather than just one, as is normally the case. This condition, 
referred to as polyspermy, is characterized by disruptive cell divisions 
and the early death of the embryo. Why should the presence of one 
extra tiny sperm nucleus within the very large egg have such drastic 
consequences? The second sperm donates an extra set of chromosomes 
and an extra centriole (Section 9.5). These additional components lead 
to abnormal cell divisions in the embryo during which daughter cells 
receive variable numbers of chromosomes. Boveri concluded that the 
orderly process of normal development is “dependent upon a particular 
combination of chromosomes; and this can only mean that the indi- 
vidual chromosomes must possess different qualities.” This was the first 
evidence of a qualitative difference among chromosomes. 

Events occurring after fertilization were followed most closely 
in the roundworm Ascaris, whose few chromosomes are large and 
were as readily observed in the nineteenth century as in introductory 
biology laboratories today. In 1883, the Belgian biologist Edouard 
van Beneden noted that the cells of the worm’s body had four large 
chromosomes, but that the male and female nuclei present in the egg 
just after fertilization (before the two nuclei fuse with one another) 


Vrouwelijke pronucleus 


Mannelijke pronucleus 


FIGURE 10.2 Events occurring in the 

roundworm Ascaris following fertilization, as 

reported in a classic nineteenth-century investi- 

gation. Both the male and female gamete are 5 
seen to contain two chromosomes. Fusion of the 
sperm and egg nuclei (called pronuclei) in the 
egg cytoplasm (between e and f) produces a 
zygote containing four chromosomes. The 
second polar body shown in a is a product of the 
previous meiosis as described in Section 14.13. 


Source: From T. Boveri, Jenaische Zeit 22:685, 
1888. 


Begin deling nucleus 


had but two chromosomes apiece (Figure 10.2). About this same 
time, the process of meiosis was described, and in 1887 the German 
biologist August Weismann proposed that meiosis included a 
“reduction division” during which the chromosome number was 
reduced by half prior to formation of the gametes. If a reduction 
division did not occur, and each gamete contained the same number 
of chromosomes as an adult cell, then the union of two gametes 
would double the number of chromosomes in the cells of the prog- 
eny. The number of chromosomes would double with each and 
every generation, which obviously does not and cannot occur.’ 


REVIEW | 


1. What evidence led early microscopists to believe that 
the chromosomes contained the cell's genetic 
information? 


10.3 Chromosomes as the Carriers 
of Genetic Information 


The rediscovery of Mendel’s work and its confirmation had an imme- 
diate influence on research in cell biology. Whatever their physical 
nature, the carriers of the hereditary units would have to behave in a 
manner consistent with Mendelian principles. In 1903, Walter Sutton, 
a graduate student at Columbia University, published a paper that 


‘Readers who have forgotten the events that occur during meiosis might want to read 
through Section 14.12, which discusses this pivotal event in the life cycle of eukaryotic 
organisms. 
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pronuclei 


Deling nucleus 
(begin celdeling) 


pointed directly to the chromosomes as the physical carriers of 
Mendel’ genetic factors. Sutton followed the development of sperm 
cells in the grasshopper, which like Ascaris, has large, easily observable 
chromosomes. The cells that give rise to sperm are called spermatogo- 
nia, and they can undergo two very different types of cell division. 
A spermatogonium can divide by mitosis to produce more spermato- 
gonia, or it can divide by meiosis to produce cells that differentiate into 
sperm (see Figure 14.41a). On examination of the mitotic stages of 
grasshopper spermatogonia, Sutton counted 23 chromosomes. Careful 
examination of the shapes and sizes of the 23 chromosomes sug- 
gested that they were present in “look-alike” pairs. Eleven pairs of 
chromosomes could be distinguished together with one additional 
chromosome, termed an accessory chromosome (and later shown to 
be a sex-determining X chromosome), that was not part of an obvi- 
ous pair. Sutton realized that the presence in each cell of pairs of 
chromosomes, or homologous chromosomes, as they were soon 
called, correlated perfectly with the pairs of inheritable factors 
uncovered by Mendel. 

When Sutton examined the chromosomes in cells just beginning 
meiosis, he found that the members of each pair of chromosomes were 
associated with one another, forming a complex called a bivalent. Eleven 
bivalents were visible, each showing a cleft along its length where the 
two homologous chromosomes were associated (FIGURE 10.3). The 
ensuing first meiotic division separated the two homologous chromo- 
somes into separate cells. This was the reduction division proposed 15 
years earlier by Weismann on theoretical grounds. Here also was the 
physical basis for Mendel’s proposal that hereditary factors exist in 
pairs that remain together throughout the life of an individual and 
then separate from one another upon formation of the gametes. The 
reduction division observed by Sutton explained several of Mendel’s 
other findings: Gametes could contain only one version (allele) of each 
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FIGURE 10.3 Homologous chromosomes. Sutton’s drawing of the 
homologous chromosomes of the male grasshopper that have associated 
during meiotic prophase to form bivalents. Eleven pairs of homologous 
chromosomes (a-k) and an unpaired X chromosome were observed. 
Source: W. S. Sutton, Biol. Bulletin 4:24, 1902. Biological Bulletin by Marine 
Biological Laboratory (Woods Hole, Mass.) Copyright 1902. Reproduced 
with permission of Marine Biological Laboratory in the format Textbook via 
Copyright Clearance Center. 


gene; the number of gametes containing one allele was equal to the 
number containing the other allele; and the union of two gametes at 
fertilization would produce an individual with two alleles for each trait. 
But many questions remained unanswered. For instance, how were the 
genes organized within the chromosomes, and could the location of 
specific genes be determined? 

As clearly as Sutton saw the relationship between chromosome 
behavior and Mendel’s alleles, he saw one glaring problem. Mendel 
had examined the inheritance of seven traits and found that each 
was inherited independently of the others. This observation formed 
the basis of Mendel’s law of independent assortment. But if genes are 
packaged together on chromosomes, like beads on a string, then 
genes should be passed from parent to offspring in packages, just as 
intact chromosomes are passed to the next generation. Genes on the 
same chromosome should act as if they are linked to one another; 
that is, they should be part of the same linkage group. 

How is it that all seven of Mendel’s traits assorted indepen- 
dently? Were they all on different linkage groups, that is, different 
chromosomes? As it happens, the garden pea has seven pairs of 
homologous chromosomes. The genes that govern each of Mendel’s 
traits either occur on different chromosomes or are so far apart from 
one another on the same chromosome that they act independently. 
Sutton’s prophecy of linkage groups was soon confirmed. Within a 
couple of years, genes for two traits in sweet peas (flower color and 
pollen shape) were shown to be linked; other evidence for chromo- 
somal linkage rapidly accumulated. 


REVIEW | 


1. What is a linkage group? What is its relationship to a 
chromosome? How can one determine the number 
of linkage groups in a species? 


oo 


10.4 Genetic Analysis 
in Drosophila 


Research in genetics soon came to focus on one particular organism, 
the fruit fly, Drosophila (FIGURE 10.4). Fruit flies have a generation 
time (from egg to sexually mature adult) of about 10 days and can pro- 
duce up to 1000 eggs in a lifetime. In addition, fruit flies are very small, 
so a large number could be kept on hand; they were easy to house and 
breed; and they were very inexpensive to maintain. In 1909, fruit flies 
seemed like the ideal organism to Thomas Hunt Morgan of Columbia 
University, and he began what was to be the start of a new era in genetic 
research. There was one major disadvantage in beginning work with 
this insect—there was only one “strain” of fly available, the wild type. 
Whereas Mendel had simply purchased varieties of pea seeds, Morgan 
had to generate his own varieties of fruit flies. Morgan expected that 
variants from the wild type would appear if he bred sufficient numbers 
of flies. Within a year—after observing thousands of flies—he found 
his first mutant, that is, an individual having an inheritable character- 
istic that distinguished it from the wild type. The mutant had white 
eyes rather than the normal red-colored ones (Figure 10.4). 

By 1915, Morgan and his students had found 85 different 
mutants with a wide variety of affected structures. It was apparent 
that on rare occasions a spontaneous change, or mutation, occurs 
within a gene, altering it permanently, so that the altered trait is 
passed from generation to generation. The demonstration of a spon- 
taneous, inheritable alteration in a gene had consequences far beyond 
the study of Drosophila genetics. It suggested a mechanism for the 
origin of variation that exists within populations—evidence for a 
vital link in the theory of evolution. If variants of genes could arise 
spontaneously, then isolated populations could become genetically 
different from one another and ultimately give rise to new species. 

Whereas mutations are a necessary occurrence for evolution, 
they are a tool for geneticists, because they provide contrast to the 
wild-type condition. As Drosophila mutants were isolated, they were 
bred and crossbred and kept as stocks within the lab. As expected, 
the 85 mutations did not all assort independently; instead, Morgan 
found that they belonged to four different linkage groups, one of 


FIGURE 10.4 The fruit fly Drosophila melanogaster. Photograph of a 
wild-type female fruit fly and a mutant male that carries a mutation leading 
to white eyes. 


Source: Courtesy of Stanley J. P. lyadurai. 


yin 


FIGURE 10.5 Fruit flies have four pairs of homologous chromosomes, 
one of which is very small. The two dissimilar homologues are the 
chromosomes that determine sex. As in humans, male fruit flies are XY and 
females are XX. 


which contained very few mutant genes (only two in 1915). This 
finding correlated perfectly with the observation that Drosophila 
cells have four pairs of homologous chromosomes, one of which is 
very small (FIGURE 10.5). Little doubt remained that genes reside on 
chromosomes. 


Crossing Over and Recombination 


Even though the association of genes into linkage groups was con- 
firmed, linkage between alleles on the same chromosome was found 
to be incomplete. In other words, alleles of two different genes, such as 
short wings and black body (as in Figure 10.7), that were originally 
present together on a given chromosome did not always remain 
together during the production of gametes. Thus maternal and pater- 
nal characteristics that were inherited by an individual on separate 
homologous chromosomes could be reshuffled so that they ended up 
on the same chromosome of a gamete. Conversely, two characteristics 
that were inherited together on the same chromosome could become 
separated from one another and end up in separate gametes. 

In 1911, Morgan offered an explanation for the “breakdown” in 
linkage. Two years earlier, F. A. Janssens had observed that the homol- 
ogous chromosomes of bivalents became wrapped around each other 
during meiosis (FIGURE 10.6). Janssens had proposed that this inter- 
action between maternal and paternal chromosomes resulted in the 
breakage and exchange of pieces. Capitalizing on this proposal, 
Morgan suggested that this phenomenon, which he termed crossing 
over (or genetic recombination), could account for the appear- 
ance of offspring (recombinants) having unexpected combinations of 
genetic traits. An example of crossing over is shown in FIGURE 10.7. 

Analyses of offspring from a large number of crosses between 
adults carrying a variety of alleles on the same chromosome indi- 
cated that (1) the percentage of recombination between a given pair 
of genes on a chromosome, such as eye color and wing length, was 
essentially constant from one experiment to another; and (2) the 
percentages of recombination between different pairs of genes, such 
as between eye color and wing length versus eye color and body 
color, could be very different. 

The fact that a given pair of genes gave the same approximate 
frequency of recombination in every cross strongly suggested that 
the positions of the genes along the chromosome (their loci) were 
fixed and did not vary from one fly to the next. If the locus of each 
gene is fixed, then the frequency of recombination between two 
genes provides a measure of the distance that separates those two 


FIGURE 10.6 Visualizing sites of crossing over. Homologous chromo- 
somes wrap around each other during meiosis, as seen in this micrograph 
of a meiotic cell of a lily. The points at which the homologues are crossed 
are termed chiasmata (arrows) and (as discussed in Chapter 14) are sites at 
which crossing over had occurred at an earlier stage. 

Source: From A. H. Sparrow, Annals of the New York Academy of Sciences, 
1508, 1951. This material is used by permission of John Wiley & Sons. 


genes. The more room between two sites on a chromosome for 
breakage to occur, the more likely it is that a break would occur 
between those two sites and, therefore, the greater the recombination 
frequency. In 1911, Alfred Sturtevant, an undergraduate at Columbia 
University who was working in Morgan's laboratory, conceived the 
idea that recombination frequencies could be used to map the rela- 
tive positions of individual genes along a given chromosome. An 
example of the underlying principles of this mapping procedure is 
illustrated in Figure 10.7. In this example, the genes for wing length 
and body color in Drosophila are situated at a considerable distance 
from one another on the chromosome and, therefore, are likely to 
become unlinked by an intervening breakage and crossover. In con- 
trast, the genes for eye color and body color are situated very close to 
one another on the chromosome and consequently are less likely to 
become unlinked. Using recombination frequencies, Sturtevant— 
who became one of the century’s most prominent geneticists—began 
to construct detailed maps of the serial order of genes along each of 
the four chromosomes of the fruit fly. Recombination frequencies 
have since been used to prepare chromosomal maps from viruses 
and bacteria as well as a large variety of eukaryotic species. 


Mutagenesis and Giant Chromosomes 


During the early period of genetics, the search for mutants was a 
slow, tedious procedure dependent on the spontaneous appearance of 
altered genes. In 1927, using a special strain of fruit flies designed to 
reveal the presence of recessive alleles, H. J. Muller found that flies 
subjected to a sublethal dose of X-rays displayed more than 100 times 
the spontaneous mutation rate exhibited by nonirradiated controls. 
This finding had important consequences. On the practical side, the 
use of mutagenic agents, such as X-rays and ultraviolet radiation, 
greatly increased the number of mutants available for genetic 
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FIGURE 10.7 Crossing over provides the mechanism for reshuffling 
alleles between maternal and paternal chromosomes. (a) Simplified 
representation of a single crossover in a Drosophila heterozygote (BbWw) at 
chromosome number 2 and the resulting gametes. If either of the crossover 
gametes participates in fertilization, the offspring will have a chromosome 
with a combination of alleles that was not present on a single chromosome 
in the cells of either parent. (b) Bivalent (tetrad) formation during meiosis, 
showing three crossover intersections (chiasmata, indicated by black arrows). 


research. The finding also pointed out the hazard of the increasing 
use of radiation in the industrial and medical fields. Today, muta- 
tions in Drosophila are most often generated by adding a chemical 
mutagen (ethyl methane sulfonate) to the insects’ feed. 

The rediscovery in 1933 of giant chromosomes in certain insect 
cells by Theophilus Painter of the University of Texas illustrates a 
basic principle in biology. There is such remarkable variability among 
organisms, not only at the obvious macroscopic levels, but also at the 
cellular and subcellular levels, that often one particular type of cell 
may be much better suited for a particular type of research than 
others. Cells from the larval salivary gland of Drosophila contain 
chromosomes that are about 100 times thicker than the chromo- 
somes found in most other cells of the organism (FIGURE 10.8a). 


(b) 


FIGURE 10.8 Giant polytene chromosomes of larval insects. (a) These 
giant polytene chromosomes from the salivary gland of a larval fruit fly 
show several thousand distinct, darkly staining bands. The bands have 
been identified as the loci of particular genes. The inset shows how 
polytene chromosomes consist of a number of individual DNA molecules. 
The stained bands on the chromosomes correspond to sites where the 
DNA is more tightly compacted. (b) Scanning electron micrograph of a 
giant polytene chromosome from a Chironomus larva showing how specific 
sites are expanded to form a “puff.” Chromosome puffs are sites where 
DNA is being very actively transcribed. 

Source: (a) Andrew Syred/Photo Researchers, Inc. Reproduced with 
permission of the Company of Biologists, Ltd.; (b) From Terry Allen and 
Claus Pelling, J. Cell Science, cover of vol. 93, part 4, 1989. Reproduced 
with permission of the Company of Biologists, Ltd. 


During larval development, these cells cease dividing, but they keep 
growing. DNA replication continues, providing the additional genetic 
material needed to support the high level of secretory activity of these 
enormous cells. The duplicated DNA strands remain attached to each 
other in perfect side-by-side alignment (inset of Figure 10.82), pro- 
ducing giant chromosomes with as many as 1024 times the number 
of DNA strands of normal chromosomes. 

These unusual polytene chromosomes, as they are called, 
are rich in visual detail, displaying about 5000 bands when stained 
and examined microscopically. The banding pattern is essentially 
constant from one individual to the next, but considerable differ- 
ences are observed between the chromosomes from flies of different 
species of the Drosophila genus. Painter soon found that individual 
bands could be correlated with specific genes. The relative positions 
of these genes on the giant chromosomes agreed with those pre- 
dicted on the basis of genetic maps prepared from recombination 
frequencies, thus providing visual confirmation of the validity of the 
entire mapping procedure. 

Giant insect chromosomes have been useful in other ways. 
Comparisons of banding patterns among polytene chromosomes of 
different species have provided an unparalleled opportunity for the 
investigation of evolutionary changes at the chromosome level. In 
addition, these chromosomes are not inert cellular objects but 
dynamic structures in which certain regions become “puffed out” at 
particular stages of development (Figure 10.8). These chromosome 
puffs are sites where DNA is being transcribed at a very high level, 
providing one of the best systems available for the direct visualiza- 
tion of gene expression. 


REVIEW | 


1. How do genetic mutations help in mapping the 
location of genes on a chromosome? 


2. What is meant by incomplete linkage? What does 
this have to do with pairing of homologous 
chromosomes during meiosis? 


3. How do polytene chromosomes of an insect differ 
from normal chromosomes? 


——— 


10.5 The Structure of DNA 


Classical geneticists uncovered the rules governing the transmission 
of genetic characteristics and the relationship between genes and 
chromosomes. In his acceptance speech for the Nobel Prize in 1934, 
T. H. Morgan stated, “At the level at which the genetic experiments lie, 
it does not make the slightest difference whether the gene is a hypo- 
thetical unit or whether the gene is a material particle.” By the 1940s, 
however, a new set of questions was being considered, foremost of 
which was, “What is the chemical nature of the gene?” The experi- 
ments answering this question are outlined in the Experimental 
Pathways (Section 10.6). Once it became evident that genes are made 
of DNA, biologists were faced with a host of new questions. These 
questions will occupy us for the remainder of the chapter. 

To understand the workings of a complex macromolecule— 
whether it is a protein, polysaccharide, lipid, or nucleic acid—it is 
essential to know how that molecule is constructed. The mystery of 
DNA structure was investigated by a number of laboratories in both 


FIGURE 10.9 Model of DNA built by James Watson and Francis Crick 
at Cambridge University, 1953. 


Source: Science and Society/SuperStock. 


the United States and England in the early 1950s and was solved by 
James Watson and Francis Crick at Cambridge University in 1953 
(FIGURE 10.9). Before describing their proposed DNA structure, let 
us consider the facts available at the time. 

The basic building block of DNA was known to be a nucleotide 
(FIGURE 10.10a,b) consisting of the five-carbon sugar deoxyribose to 
which one phosphate is esterified at the 5’ position of the sugar ring and 
one nitrogenous base is attached at the 1’ site.” Two types of nitroge- 
nous bases are present in a nucleic acid: pyrimidines, which contain 
a single ring, and purines, which contain two rings (Figure 10.10c). 
DNA contains two different pyrimidines, thymine (T) and cytosine (C), 
and two different purines, guanine (G) and adenine (A). The nucleo- 
tides are covalently linked to one another to form a linear polymer, 
or strand, with a backbone composed of alternating sugar and phos- 
phate groups joined by 3'- 5'-phosphodiester bonds (Figure 10.10c). 
The bases attached to each sugar were thought to project from the 
backbone like a column of stacked shelves. 


“It is useful to introduce a bit of terminology at this point. A molecule consisting 
simply of one of the four nitrogenous bases of Figure 10.10 linked to a pentose sugar 
moiety is known as a nucleoside. If the sugar is deoxyribose, the nucleoside is a deoxy- 
ribonucleoside. There are four major deoxyribonucleosides distinguished by the 
attached base: deoxyadenosine, deoxyguanosine, deoxythymidine, and deoxycytidine. 
If the nucleoside has one or more attached phosphate groups (generally at the 5’ position, 
but alternatively at the 3’ position), the molecule is a nucleotide. There are nucleoside 
5'-monophosphates, nucleoside 5’-diphosphates, and nucleoside 5'-triphosphates, 
depending on the number of phosphates in the molecule. Examples of each are deoxy- 
adenosine 5'-monophosphate (dAMP), deoxyguanosine 5’-diphosphate (dGDP), and 
deoxycytidine 5'-triphosphate (dCTP). A similar set of nucleosides and nucleotides 
involved in RNA metabolism contain the sugar ribose rather than deoxyribose. The 
nucleotides employed in energy metabolism, such as adenosine triphosphate (ATP), 
are ribose-containing molecules. 
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A nucleotide has a polarized structure: One end, where the 
phosphate is located, is called the 5’ end (pronounced “five-prime 
end”), while the other end is the 3’ end (Figure 10.10b). Because each 
of the stacked nucleotides in a strand faces the same direction, 
the entire strand has a direction (Figure 10.10c). X-ray diffraction 
analysis indicated that the distance between the nucleotides of the 
stack was 3.4 A (0.34 nm) and suggested the presence of a large 
structural repeat every 3.4 nm. 

As discussed in Section 10.6, DNA was thought for many 
years to consist of a simple repeating tetranucleotide (e.g., 
—ATGCATGCATGC—), which made it unlikely to serve as an 
informational macromolecule. In 1950, Erwin Chargaff of Columbia 
University reported an important finding that delivered the final 
blow to the tetranucleotide theory and provided vital information 
about DNA structure. Chargaff, believing that the sequence of 
nucleotides of the DNA molecule held the key to its importance, 
determined the relative amounts of each base in various samples of 
DNA, that is, the base composition of the samples. Base composi- 
tion analysis of a DNA sample was performed by hydrolyzing the 
bases from their attached sugars, separating the bases in the hydro- 
lysate by paper chromatography, and determining the amount of 
material in each of the four spots to which the bases migrated. 

If the tetranucleotide theory were correct, each of the four bases 
in a DNA sample would be present as 25 percent of the total num- 
ber. Chargaff found that the ratios of the four component bases were 
quite variable from one type of organism to another, often being 
very different from the 1:1:1:1 ratio predicted by the tetranucleotide 
theory. For example, the A:G ratio of the DNA of a tubercle bacillus 
was 0.4, whereas the A:G ratio of human DNA was 1.56. It made no 
difference which plant or animal tissue was used as the source of the 
DNA; the base composition remained constant for that species. 
Amid this great variability in base composition of different species’ 
DNA, an important numerical relationship was discovered. The 
number of purines always equaled the number of pyrimidines in a 
given sample of DNA. More specifically, the number of adenines 
always equaled the number of thymines, and the number of gua- 
nines always equaled the number of cytosines. In other words, 
Chargaff discovered the following rules of DNA base composition: 


[A] =[T], [G] =[C], [A] +[T] #[G] +[C] 


Chargaff’s findings put the DNA molecule in a new light, giving it 
specificity and individuality from one organism to another. The sig- 
nificance of the base equivalencies, however, remained obscure. 


The Watson-Crick Proposal 


When protein structure was discussed in Chapter 2, the importance 
of secondary and tertiary structure as determinants of the protein’s 
activity was stressed. Similarly, information about the three-dimensional 


FIGURE 10.10 The chemical structure of DNA. (a) Model of a DNA 
nucleotide containing the base thymine; the molecule is deoxythymidine 
5'-monophosphate (dTMP). The netlike cage represents the electron density 
of the atoms that make up the molecule. (b) Chemical structure of a DNA 
nucleotide containing the base adenosine; the molecule is deoxyadenosine 
5'-monophosphate (dAMP). A nucleotide is composed of a nucleoside linked 
to a phosphate; the nucleoside portion of the molecule (i.e., deoxyadeno- 
sine) is enclosed by the dashed line. (c) The chemical structure of a small 
segment of a single DNA strand showing all four nucleotides. 

Source: (a) From Arnon Lavie et al., Nature Str. Biol.4:604, 1997, Fig. 2a. 
Reprinted by permission of Macmillan Publishers Limited. 


organization of DNA was needed if its biological activity was to be 
understood. Using X-ray diffraction data (obtained by Rosalind 
Franklin and Maurice Wilkins at King’s College London) and models 
constructed from cutouts of the four types of nucleotides, Watson 
and Crick proposed a structure of DNA that included the following 
elements (Figure 10.10): 


1. The molecule is composed of two chains of nucleotides. This 
conclusion followed on the heels of an erroneous proposal by 
Linus Pauling, who had suggested that DNA was composed of 
three nucleotide strands. 


2. The two chains spiral around each other to form a pair of right- 
handed helices. In a right-handed helix, an observer looking 
down the central axis of the molecule would see that each strand 
follows a clockwise path, as it moves away from the observer. 
The helical nature of DNA was revealed in the pattern of spots 
produced by Franklin’s X-ray diffraction image which was 
shown to Watson during a visit to King’s College. 


3. The two chains comprising one double helix run in opposite 
directions; that is, they are antiparallel. Thus, if one chain is 
aligned in the 5’—> 3’ direction, its partner must be aligned in 
the 3'->5’ direction. 


4. The -sugar-phosphate-sugar-phosphate- backbone of each 
strand is located on the outside of the molecule with the two sets 
of bases projecting toward the center. The phosphate groups 
give the molecule a large negative charge (FIGURE 10.11a). 


5. The bases occupy planes that are approximately perpendicular 
to the long axis of the molecule and are, therefore, stacked one 
on top of another like a pile of plates. Hydrophobic interactions 
and van der Waals forces (page 36) between the stacked, planar 
bases provide stability for the entire DNA molecule. Together, 
the helical turns and planar base pairs cause the molecule to 
resemble a spiral staircase. This manner of construction is evi- 
dent in the photograph in Figure 10.9, which shows the original 
Watson-Crick model. 


6. The two strands are held together by hydrogen bonds between 
each base of one strand and an associated base on the other 
strand. Because individual hydrogen bonds are weak and easily 
broken, the DNA strands can become separated during various 
activities. But the strengths of hydrogen bonds are additive, and 
the large numbers of hydrogen bonds holding the strands 
together make the double helix a stable structure. 


7. The distance from the phosphorus atom of the backbone to the 
center of the axis is 1 nm (thus the width of the double helix is 
2nm), 


8. A pyrimidine in one chain is always paired with a purine in the 
other chain. This arrangement produces a molecule that is 
2 nm wide along its entire length. 


9. The nitrogen atoms linked to carbon 4 of cytosine and carbon 6 of 
adenine are predominantly in the amino (NH;) configuration 
(Figure 10.10c) rather than the imino (NH) form. Similarly, the 
oxygen atoms linked to carbon 6 of guanine and carbon 4 of thy- 
mine are predominantly in a keto (C=O) configuration rather 
than the enol (C—OH) form. These structural restrictions on the 
configurations of the bases suggested that adenine was the only 
purine structurally capable of bonding to thymine and that gua- 
nine was the only purine capable of bonding to cytosine. Therefore, 
the only possible pairs were A-T and G-C (Figure 10.11c). This fit 


perfectly with the base composition analysis carried out by 375 


Chargaff whose results were communicated to Watson and Crick 
during a meeting of the three scientists in Cambridge in 1952. 
Because an A-T and G-C base pair had the same geometry (Figure 
10.11c), there were no restrictions on the sequence of bases; a 
DNA molecule could have any one of an unlimited variety of 
nucleotide sequences. 


10. The spaces between adjacent turns of the helix form two grooves 
of different width—a wider major groove and a narrower minor 
groove—that spiral around the outer surface of the double helix 
(Figure 10.11a, b). Proteins that bind to DNA often contain 
domains that fit into these grooves. In many cases, a protein 
bound in a groove is able to read the sequence of nucleotides 
along the DNA without having to separate the strands. 


11. The double helix makes one complete turn every 10 residues 
(3.4 nm), or 150 turns per million daltons of molecular mass. 


12. Because an A on one strand is always bonded to a T on the 
other strand, and a G is always bonded to a C, the nucleotide 
sequences of the two strands are always fixed relative to one 
another. Because of this relationship, the two chains of the dou- 
ble helix are said to be complementary to one another. For 
example, A is complementary to T, 5’-AGC-3’ is complemen- 
tary to 3’-TCG-5’, and one entire chain is complementary to 
the other. As we shall see, complementarity is of overriding 
importance in nearly all the activities and mechanisms in which 
nucleic acids are involved. 


The Importance of the 
Watson-Crick Proposal 


From the time biologists first considered DNA as the genetic mate- 
rial, it was expected to fulfill three primary functions (FIGURE 10.12): 


1. Storage of genetic information. As genetic material, DNA must 
contain a stored record of instructions that determine all the 
inheritable characteristics that an organism exhibits. In molecular 
terms, DNA must contain the information needed to assemble all 
of the proteins that are synthesized by an organism (Figure 10.12a) 


2. Replication and inheritance. DNA must contain the informa- 
tion for synthesis of new DNA strands (replication). DNA repli- 
cation allows genetic instructions to be transmitted from one cell 
to its daughter cells and from one individual to its offspring 
(Figure 10.125) 


3. Expression of the genetic message. DNA is more than a storage 
center; it is also a director of cellular activity. Consequently, the 
information encoded in DNA has to be expressed in some form 
that can take part in events that are taking place within the cell. 
More specifically, the information in DNA must be used to direct 
the order by which specific amino acids are incorporated into a 
polypeptide chain (Figure 10.12c) 


The Watson-Crick model of DNA structure suggested a way in 
which the first two of these three genetic functions might be accom- 
plished. The model strongly supported the suspicion that the infor- 
mation content of DNA resided in the linear sequence of its bases. 
A given segment of DNA would correspond to each gene. The spe- 
cific sequence of nucleotides in that segment would dictate the 
sequence of amino acids in a corresponding polypeptide. A change 
in the linear sequence of nucleotides within that segment would 
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FIGURE 10.11 The double helix. (a) Schematic representation of the DNA double helix. (b) Space-filling model of the B form of DNA. (c) The Watson- 

_ Crick base pairs. The original model showed both A-T and G-C pairs with two hydrogen bonds; the third hydrogen bond in the G-C pair was subsequently 

| identified by Linus Pauling. (d) Electron micrograph of DNA being released from the head of a T2 bacteriophage. This linear DNA molecule (note the two 
free ends) measures 68 ym in length, approximately 60 times longer than the phage head in which it is contained. 
Source: (b) Courtesy of Nelson Max, Lawrence Livermore National Laboratory and the Department of Energy; (c) Figure 28.6 on page 854 from Voet and 
Voet, Biochemistry 2e; copyright 1995, John Wiley & Sons, Inc. This material is reproduced with permission of John Wiley & Sons, inc.; (d) From A. K. 
Kleinschmidt, et al., Biochim. Biophys. Acta 61:861, 1962, with permission from Elsevier. 
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FIGURE 10.12 Three functions required of the genetic material. 
(a) DNA must contain the information that encodes inheritable traits. 
(b) DNA must contain the information that directs its own duplication. 
(c) DNA must contain the information that directs the assembly of 
specific proteins. 


correspond to an inheritable mutation in that gene. Differences in 
nucleotide sequence were presumed to form the basis of genetic 
variation, whether between individuals of a species or between 


species themselves. As discussed in the next chapter, another decade 377 


would pass before biologists would learn the mechanism by which 
a sequence of amino acids is encoded by a sequence of DNA 
nucleotides. 

As for the second function, Watson and Crick’s initial publica- 
tion of DNA structure included a proposal as to how such a molecule 
might replicate. This may have been the first time that a study of 
molecular structure led directly to a hypothesis of a basic molecular 
mechanism. Watson and Crick proposed that during replication, the 
hydrogen bonds holding the two strands of the DNA helix were 
sequentially broken, causing the gradual separation of the strands 
much like the separation of two halves of a zipper. Each of the sepa- 
rated strands, with its nitrogenous bases exposed, would then serve 
as a template directing the order in which complementary nucleo- 
tides become assembled to form the complementary strand. When 
complete, the process would generate two double-stranded DNA 
molecules that were (1) identical to one another and (2) identical to 
the original DNA molecule. According to the Watson-Crick pro- 
posal, each DNA double helix would contain one strand from the 
original DNA molecule and one newly synthesized strand (see 
Figure 13.1). As we will see in Chapter 13, the Watson-Crick pro- 
posal fared quite well in predicting the mechanism of DNA replica- 
tion. Of the three primary functions listed above, only the mechanism 
by which DNA governs the assembly of a specific protein remained 
a total mystery. 

Not only was the elucidation of DNA structure important in its 
own right, it provided the stimulus for investigating all the activities 
in which the genetic material must take part. Once the model for its 
structure was accepted, any theory of a genetic code, DNA synthesis, 
or information transfer had to be consistent with that structure. 


REVIEW | 


1. What is meant by saying that a DNA strand has 
polarity? That the two strands are antiparallel? That 
the molecule has a major groove and a minor groove? 
That the strands are complementary to one another? 

2. What is the relationship between Chargaft's analysis 


of DNA base composition and the structure of the 
double helix? 


o—-—___— 


10.6 EXPERIMENTAL PATHWAYS 


The Chemical Nature of the Gene 


Three years after Gregor Mendel presented the results of his work 
on inheritance in pea plants, Friedrich Miescher graduated from 
a Swiss medical school and traveled to Tlibingen, Germany, to 
spend a year studying under Ernst Hoppe-Seyler, one of the fore- 
most chemists (and possibly the first biochemist) of the period. 
Miescher was interested in the chemical contents of the cell 
nucleus. To isolate material from cell nuclei with a minimum of 
contamination from cytoplasmic components, Miescher needed 
cells that had large nuclei and were easy to obtain in quantity. 


He chose white blood cells, which he obtained from the pus in 
surgical bandages that were discarded by a local clinic. Miescher 
treated the cells with dilute hydrochloric acid to which he added 
an extract from pig's stomach that removed protein (the stomach 
extract contained the proteolytic enzyme pepsin). The residue 
from this treatment was composed primarily of isolated cell nuclei 
that settled to the bottom of the vessel. Miescher then extracted 
the nuclei with dilute alkali. The alkali-soluble material was further 
purified by precipitation with dilute acid and reextraction with 
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dilute alkali. Miescher found that the alkaline extract contained 
a substance that had properties unlike any previously discovered: 
The molecule was very large, acidic, and rich in phosphorus. He 
called the material “nuclein.” His year up, Miescher returned 
home to Switzerland, while Hoppe-Seyler, who was cautious 
about the findings, repeated the work. With the results con- 
firmed, Miescher published the findings in 1871." 

Back in Switzerland, Miescher continued his studies of 
the chemistry of the cell nucleus. Living near the Rhine River, 
Miescher had ready access to salmon that had swum upstream 
and were ripe with eggs or sperm. Sperm were ideal cells to study 
nuclei. Like white blood cells, they could be obtained in large 
quantity, and 90 percent of their volume was occupied by nuclei. 
Miescher's nuclein preparations from sperm cells contained a 
higher percentage of phosphorus (almost 10 percent by weight) 
than those from white blood cells, indicating they had less con- 
taminating protein. In fact, they were the first preparations of 
relatively pure DNA. The term nucleic acid was coined in 1889 
by Richard Altmann, one of Miescher's students, who worked out 
the methods of purifying protein-free DNA from various animal 
tissues and yeast.” 

During the last two decades of the nineteenth century, 
numerous biologists focused on the chromosomes, describing 
their behavior during and between cell division (Section 10.2). One 
way to observe chromosomes was to stain these cellular structures 
with dyes. A botanist named E. Zacharias discovered that the same 
stains that made the chromosomes visible also stained a prepara- 
tion of nuclein that had been extracted using Miescher’s procedure 
of digestion with pepsin in an HCI medium. Furthermore, when 
the pepsin-HCl-extracted cells were subsequently extracted 
with dilute alkali, a procedure that was known to remove nuclein, 
the cell residue (which included the chromosome remnants) no 
longer contained stainable material. These and other results 
pointed strongly to nuclein as a component of the chromosomes. 
In one remarkably farsighted proposal, Otto Hertwig, who had 
been studying the behavior of the chromosomes during fertiliza- 
tion, stated in 1884: "| believe that | have at least made it highly 
probable that nuclein is the substance that is responsible not only 
for fertilization but also for the transmission of hereditary charac- 
teristics.”° Ironically, as more was learned about the properties of 
nuclein, the less it was considered as a candidate for the genetic 
material. 

During the 50 years that followed Miescher's discovery of 
DNA, the chemistry of the molecule and the nature of its compo- 
nents were described. Some of the most important contributions 
in this pursuit were made by Phoebus Aaron Levene, who emi- 
grated from Russia to the United States in 1891 and eventually 
settled in a position at the Rockefeller Institute in New York. It 
was Levene who finally solved one of the most resistant problems 
of DNA chemistry when he determined in 1929 that the sugar of 
the nucleotides was 2-deoxyribose.* To isolate the sugar, Levene 
and E. S. London placed the DNA into the stomach of a dog 
through a surgical opening and then collected the sample from 
the animal's intestine. As it passed through the stomach and 
intestine, various enzymes of the animal's digestive tract acted 
on the DNA, carving the nucleotides into their component parts, 
which could then be isolated and analyzed. Levene summarized 
the state of knowledge on nucleic acids in a monograph pub- 
lished in 1931.° 

While Levene is credited for his work in determining the 
structure of the building blocks of DNA, he is also credited as hav- 
ing been the major stumbling block in the search for the chemical 
nature of the gene. Through this period, it became increasingly 
evident that proteins were very complex and exhibited great 
specificity in catalyzing a remarkable variety of chemical reactions. 


FIGURE 1. The large glistening colonies on the right are virulent 
S-type pneumococci, whereas the smaller colonies on the left are 
nonvirulent R-type pneumococci. As discussed below, the cells in these 
particular S colonies are the result of transformation of R bacteria by 
DNA from heat-killed S pneumococci. 


Source: From O. T. Avery, C. M. Macleod, and M. Mccarty, J. Exp. Med. 
79:153, 1944. Reproduced with permission of The Rockefeller 
University Press. 


DNA, on the other hand, was thought to be composed of a 
monotonous repeat of its four nucleotide building blocks. The 
major proponent of this view of DNA, which was called the tetra- 
nucleotide theory, was Phoebus Levene. Because chromosomes 
consisted of only two components—DNA and protein—there 
seemed little doubt at that time that protein was the genetic 
material. 

Meanwhile, as the structure of DNA was being worked out, 
a seemingly independent line of research was being carried out in 
the field of bacteriology. During the early 1920s, it was found that 
a number of species of pathogenic bacteria could be grown in the 
laboratory in two different forms. Virulent bacteria, that is, bacte- 
rial cells capable of causing disease, formed colonies that were 
smooth, dome shaped, and regular. In contrast, nonvirulent bac- 
terial cells grew into colonies that were rough, flat, and irregular 
(FIGURE 1).° The British microbiologist J. A. Arkwright introduced 
the terms smooth (S) and rough (R) to describe these two types. 
Under the microscope, cells that formed S colonies were seen to 
be surrounded by a gelatinous capsule, whereas the capsule was 
absent from cells in the R colonies. The bacterial capsule helps 
protect a bacterium from its host's defenses, which explains why 
the R cells, which lacked these structures, were unable to cause 
infections in laboratory animals. 

Because of its widespread impact on human health, the 
bacterium responsible for causing pneumonia (Streptococcus 
pneumoniae, or simply pneumococcus) has long been a focus 
of attention among microbiologists. In 1923, Frederick Griffith, a 
medical officer at the British Ministry of Health, demonstrated that 
pneumococcus also grew as either S or R colonies and, further- 
more, that the two forms were interconvertible; that is, on occasion 
an R bacterium could revert to an S form, or vice versa.’ Griffith 
found, for example, if he injected exceptionally large numbers of 
R bacteria into a mouse, the animal frequently developed pneu- 
monia and produced bacteria that formed colonies of the S form. 

It had been shown earlier that pneumococcus occurred 
as several distinct types (types I, Il, and Ill) that could be distin- 
guished from one another immunologically. In other words, anti- 
bodies could be obtained from infected animals that would react 
with only one of the three types. Moreover, a bacterium of one 
type never gave rise to cells of another type. Each of the three 
types of pneumococcus could occur as either the R or S form. 


Live encapsulated (S) Live capsuleless (R) 


cells, type | cells, type Il 
Inject Inject 
into mouse into mouse 


Mouse survives 


Mouse dies 


Heat-killed S 
cells, type | 


Heat-killed S cells, type | 
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FIGURE 2 Outline of the experiment by Griffith of the discovery of bacterial transformation. 


In 1928, Griffith made a surprising discovery while inject- 
ing various bacterial preparations into mice.® Injections of large 
numbers of heat-killed S bacteria or small numbers of living 
R bacteria, by themselves, were harmless to the mouse. However, 
if he injected both of these preparations together into the same 
mouse, it contracted pneumonia and died. Virulent bacteria 
could be isolated from the mouse and cultured. To extend the 
findings, he injected combinations of bacteria of different types 
(FIGURE 2). Initially, eight mice were injected with heat-killed, 
type |, S bacteria together with a small inoculum of a live, type Il, 
R strain. Two of the eight animals contracted pneumonia, and Griffith 
was able to isolate and culture virulent, type I, S bacterial cells 
from the infected mice. Because there was no possibility that the 
heat-killed bacteria had been brought back to life, Griffith con- 
cluded that the dead type | cells had provided something to the 
live, nonencapsulated, type II cells that transformed them into an 
encapsulated type | form. The transformed bacteria continued to 
produce type | cells when grown in culture; thus the change was 
stable and permanent. 

Griffith's finding of transformation was rapidly confirmed by 
several laboratories around the world, including that of Oswald 
Avery, an immunologist at the Rockefeller Institute, the same 
institution where Levene was working. Avery had initially been 
skeptical of the idea that a substance released by a dead cell 
could alter the appearance of a living cell, but he was convinced 
when Martin Dawson, a young associate in his lab, confirmed the 
results.” Dawson went on to show that transformation need not 
occur within a living animal host. A crude extract of the dead S 
bacteria, when mixed in bacterial culture with a small number 
of the nonvirulent (R) cells in the presence of anti-R serum, was 
capable of converting the R cells into the virulent S form. The 
transformed cells were always of the type (|, Il, or Ill) characteristic 
of the dead S cells."° 

The next major step was taken by J. Lionel Alloway, another 
member of Avery's lab, who was able to solubilize the transforming 
agent. This was accomplished by rapidly freezing and thawing the 
killed donor cells, then heating the disrupted cells, centrifuging 
the suspension, and forcing the supernatant through a porcelain 
filter whose pores blocked the passage of bacteria. The soluble, 
filtered extract possessed the same transforming capacity as the 
original heat-killed cells." 

For the next decade, Avery and his colleagues focused their 
attention on purifying the substance responsible for transforma- 
tion and determining its identity. As remarkable as it may seem 
today, no other laboratory in the world was pursuing the identity 
of the “transforming principle,” as Avery called it. Progress on the 
problem was slow.'* Eventually, Avery and his co-workers, Colin 
MacLeod and Maclyn McCarty, succeeded in isolating a substance 
from the soluble extract that was active in causing transformation 


when present at only 1 part per 600 million. All the evidence 
suggested that the active substance was DNA: (1) It exhibited a 
host of chemical properties that were characteristic of DNA, (2) no 
other type of material could be detected in the preparation, and 
(3) tests of various enzymes indicated that only those enzymes 
that digested DNA inactivated the transforming principle. 

The paper published in 1944 was written with scrupulous 
caution and made no dramatic statements that genes were made 
of DNA rather than protein.'? The paper drew remarkably little 
attention. Maclyn McCarty, one of the three authors, describes an 
incident in 1949 when he was asked to speak at Johns Hopkins 
University along with Leslie Gay, who had been testing the effects 
of the new drug Dramamine for the treatment of sea sickness. 
The large hall was packed with people, and “after a short period 
of questions and discussion following [Gay's] paper, the president 
of the Society got up to introduce me as the second speaker. Very 
little that he said could be heard because of the noise created by 
people streaming out of the hall. When the exodus was complete, 
after | had given the first few minutes of my talk, | counted approx- 
imately thirty-five hardy souls who remained in the audience 
because they wanted to hear about pneumococcal transforma- 
tion or because they felt they had to remain out of courtesy.” But 
Avery's awareness of the potential of his discovery was revealed in 
a letter he wrote in 1943 to his brother Roy, also a bacteriologist: 


If we are right, & of course that's not yet proven, then it means 
that nucleic acids are not merely structurally important but 
functionally active substances in determining the biochemical 
activities and specific characteristics of cells—& that by 
means of a known chemical substance it is possible to induce 
predictable and hereditary changes in cells. This is something 
that has long been the dream of geneticists. . . . Sounds like 
a virus—may be a gene. But with mechanisms | am not now 
concerned—one step at a time. . . . Of course the problem 
bristles with implications. . . . It touches genetics, enzyme 
chemistry, cell metabolism & carbohydrate synthesis—etc. 
But today it takes a lot of well documented evidence to con- 
vince anyone that the sodium salt of deoxyribose nucleic 
acid, protein free, could possibly be endowed with such bio- 
logically active & specific properties & that evidence we are 
now trying to get. It's lots of fun to blow bubbles,—but it’s 
wiser to prick them yourself before someone else tries to. 


Many articles and passages in books have dealt with the 
reasons why Avery's findings were not met with greater acclaim. 
Part of the reason may be due to the subdued manner in which 
the paper was written and the fact that Avery was a bacteriologist, 
not a geneticist. Some biologists were persuaded that Avery’s 
preparation must have been contaminated with minuscule 
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amounts of protein and that the contaminant, not the DNA, 
was the active transforming agent. Others questioned whether 
studies on transformation in bacteria had any relevance to the 
field of genetics. 

During the years following the publication of Avery's paper, 
the climate in genetics changed in an important way. The existence 
of the bacterial chromosome was recognized, and a number of 
prominent geneticists turned their attention to these prokary- 
otes. These scientists believed that knowledge gained from the 
study of the simplest cellular organisms would shed light on the 
mechanisms that operate in the most complex plants and ani- 
mals. In addition, the work of Erwin Chargaff on the base com- 
position of DNA shattered the notion that DNA was a molecule 
consisting of a simple repetitive series of nucleotides (discussed 
in Section 10.6).'* This finding awakened researchers to the possi- 
bility that DNA might have the properties necessary to fulfill a role 
in information storage. 

Seven years after the publication of Avery's paper on bacte- 
rial transformation, Alfred Hershey and Martha Chase of the Cold 
Spring Harbor Laboratories in New York turned their attention to 
an even simpler system—bacteriophages, or viruses that infect 
bacterial cells. By 1950, researchers recognized that viruses also 
had a genetic program. Viruses injected their genetic material 
into the host bacterium, where it directed the formation of new 
virus particles inside the infected cell. Within a matter of minutes, 
the infected cell broke open, releasing new bacteriophage parti- 
cles, which then infected neighboring host cells. 

It was clear that the genetic material directing the formation of 
viral progeny had to be either DNA or protein because these were 
the only two molecules the virus contained. Electron microscopic 
observations showed that, during the infection, the bulk of the 
bacteriophage remains outside the cell, attached to the cell sur- 
face by tail fibers (FIGURE 3). Hershey and Chase reasoned that the 
virus’s genetic material must possess two properties. First, if the 
material were to direct the development of new bacteriophages 
during infection, it must pass into the infected cell. Second, if the 
material carries inherited information, it must be passed on to the 
next generation of bacteriophages. Hershey and Chase prepared 
two batches of bacteriophages to use for infection (FIGURE 4). 
One batch contained radioactively labeled DNA (82P]DNA); the 
other batch contained radioactively labeled protein ([°°S]protein). 
Because DNA lacks sulfur (S) atoms, and protein usually lacks 
phosphorus (P) atoms, these two radioisotopes provided distin- 
guishing labels for the two types of macromolecules. Their exper- 
imental plan was to allow one or the other type of bacteriophage 
to infect a population of bacterial cells, wait a few minutes, and 
then strip the empty viruses from the surfaces of the cells. After 
trying several methods to separate the bacteria from the attached 
phage coats, they found this was best accomplished by subject- 
ing the infected suspension to the spinning blades of a Waring 
blender. Once the virus particles had been detached from the 
cells, the bacteria could be centrifuged to the bottom of the tube, 
leaving the empty viral coats in suspension. 

By following this procedure, Hershey and Chase deter- 
mined the amount of radioactivity that entered the cells versus 
the amount that remained behind in the empty coats. They found 
that when cells were infected with protein-labeled bacterio- 
phages, the bulk of the radioactivity remained in the empty coats. 
In contrast, when cells were infected with DNA-labeled bacte- 
riophages, the bulk of the radioactivity passed inside the host 
cell. When they monitored the radioactivity passed onto the next 
generation, they found that less than 1 percent of the labeled 
protein could be detected in the progeny, whereas approximately 
30 percent of the labeled DNA could be accounted for in the next 
generation. 


FIGURE 3 Electron micrograph of a bacterial cell infected by T4 
bacteriophage. Each phage is seen attached by its tail fibers to the 
outer surface of the bacterium, while new phage heads are being 
assembled in the host cell’s cytoplasm. 


Source: Lee D. Simon/Photo Researchers, Inc. 


The publication of the Hershey-Chase experiments in 1952,'° 
together with the abandonment of the tetranucleotide theory, 
removed any remaining obstacles to the acceptance of DNA as 
the genetic material. Suddenly, tremendous new interest was 
generated in a molecule that had largely been ignored. The stage 
was set for the discovery of the double helix. 
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FIGURE 4 The Hershey-Chase experiment showing that bacterial cells infected with phage containing °*P-labeled DNA (red DNA molecules) 
became radioactively labeled and produced labeled phage progeny. In contrast, bacterial cells infected with phage containing *°S-labeled protein 
(red phage coats) did not become radioactively labeled and produced only nonlabeled progeny. 


10.7 DNA Supercoiling 


In 1963, Jerome Vinograd and his colleagues at the California 
Institute of Technology discovered that two closed, circular DNA 
molecules of identical molecular mass could exhibit very different 
rates of sedimentation during centrifugation (Section 18.10). Further 
analysis indicated that the DNA molecule sedimenting more rapidly 
had a more compact shape because the molecule was twisted upon 
itself (FIGURE 10.13a, b), much like a rubber band in which the two 
ends are twisted in opposite directions or a tangled telephone cord 
after extended use. DNA in this state is said to be supercoiled. 
Because supercoiled DNA is more compact than its relaxed counter- 
part, it occupies a smaller volume and moves more rapidly in 
response to a centrifugal force or an electric field (Figure 10.13c). 
Supercoiling is best understood if one pictures a length of double- 
helical DNA lying free on a flat surface. A molecule in this condition 
has the standard number of 10 base pairs per turn of the helix and is 
said to be relaxed. The DNA would still be relaxed if both ends of the 
two strands were simply fused to form a circle. Consider, however, 
what would happen if the molecule were twisted before the ends 
were sealed. If the length of DNA were twisted in a direction opposite 


to that in which the duplex was wound, the molecule would tend to 
unwind. An underwound DNA molecule has a greater number of 
base pairs per turn of the helix (FIGURE 10.14). Because the molecule 
is most stable with 10 residues per turn, it tends to resist the strain of 
becoming underwound by becoming twisted upon itself into a 
supercoiled conformation (Figure 10.14). 

DNA is referred to as negatively supercoiled when it is under- 
wound and positively supercoiled when it is overwound. Circular 
DNAs found in nature (e.g., mitochondrial, viral, bacterial) are 
invariably negatively supercoiled. Supercoiling is not restricted to 
small, circular DNAs but also occurs in linear, eukaryotic DNA. For 
example, negative supercoiling plays a key role in allowing the DNA 
of the chromosomes to be compacted so as to fit inside the confines 
of a cell nucleus (Section 12.2). Because negatively supercoiled DNA 
is underwound, it exerts a force that helps separate the two strands of 
the helix, which is required during both replication (DNA synthesis) 
and transcription (RNA synthesis). 

Cells rely on enzymes to change the supercoiled state of a DNA 
duplex. These enzymes are called topoisomerases because they 
change the topology of the DNA. Topoisomerases were discovered in 
1971 by James Wang of the University of California, Berkeley. Cells 
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FIGURE 10.13 Supercoiled DNA. (a,b) Electron micrographs showing the 
differences in conformation between a relaxed, circular molecule of phage 
DNA (a) and the same type of molecule in a supercoiled state (b). (c) When 
a mixture of relaxed and supercoiled SV40 DNA molecules is subjected to 
gel electrophoresis, the highly compact, supercoiled form (seen at the 
bottom of the gel) moves much more rapidly than the relaxed form. The 
DNA molecules are visualized by staining the gel with ethidium bromide, 

a fluorescent molecule that inserts itself into the double helix. 


Source: (a and b) Courtesy James C. Wang; (c) From Walter Keller, Proc. 
Nat'l Acad. Sci. U.S. 72:2553, 1975. 


FIGURE 10.14 Underwound DNA. The DNA molecule at the left is 
underwound; that is, it has more than an average of 10 base pairs per turn 
of a helix. An underwound molecule spontaneously assumes a negatively 
supercoiled conformation, as shown on the right. 


contain a variety of topoisomerases, which can be divided into two 
classes. Type I topoisomerases change the supercoiled state of a DNA 
molecule by creating a transient break in one strand of the duplex. 


A model for the mechanism of action of human topoisomerase I is 
shown in FIGURE 10.15a. The enzyme cleaves one strand of the DNA 
and then allows the intact, complementary strand to undergo a 
controlled rotation, which relaxes the supercoiled molecule. 
Topoisomerase I is essential for processes such as DNA replication 
and transcription. It functions in these activities by preventing 
excessive supercoiling from building up as the complementary 
strands of a DNA duplex separate and unwind (as in Figure 13.6). 
Type II topoisomerases make a transient break in both strands of a 
DNA duplex. Another segment of the DNA molecule (or a separate 
molecule entirely) is then transported through the break, and the 
severed strands are resealed. As expected, this complex reaction 
mechanism is accompanied by a series of dramatic conformational 
changes that are depicted in the model in Figure 10.15b. These 
enzymes are capable of some remarkable “tricks.” In addition to 
being able to supercoil and relax DNA (Figure 10.15c, panel 1), type II 
topoisomerases can tie a DNA molecule into knots or untie a DNA 
knot (Figure 10.15c, panel 2). They can also cause a population of 
independent DNA circles to become interlinked (catenated), or 
separate interlinked circles into individual components (Figure 
10.15c(3),d). Topoisomerase II is required to disentangle DNA mol- 
ecules before duplicated chromosomes can be separated during 
mitosis. Human topoisomerase IJ is a target for a number of drugs 
(e.g., etoposide and doxorubicin) that bind to the enzyme and pre- 
vent the cleaved DNA strands from being resealed. In doing so, these 
drugs preferentially kill rapidly dividing cells and are therefore used 
in the treatment of cancer. 


REVIEW | 


1. How do underwound and overwound DNAs differ 
from one another? How do the two classes of 
topoisomerases differ? 


eo 


10.8 The Complexity 
of the Genome 


DNA is a macromolecule—an assemblage of a large number of 
atoms bonded together in a defined arrangement whose three- 
dimensional structure can be described by techniques such as X-ray 
crystallography. But DNA is also an information repository, which is 
a property more difficult to describe in simple molecular terms. If, as 
noted above, a gene corresponds to a particular segment of DNA, 
then the sum of the genetic information that an individual organism 
inherits from its parents is equivalent to the sum of all of the DNA 
segments present in the fertilized egg at the start of life. All the indi- 
viduals that make up a species population share the same set of 
genes, even though different individuals invariably possess slightly 
different versions (alleles) of many of these genes. Thus each species 
of organism has a unique content of genetic information, which is 
known as its genome. For humans, the genome is essentially equivalent 
to all of the genetic information that is present in a single (haploid) 
set of human chromosomes, which includes 22 different autosomes 
and both the X and Y sex chromosomes. 

To understand how the complexity of the genome is deter- 
mined, it is first necessary to consider one of the most important 
properties of the DNA double helix: its ability to separate into its two 
component strands, a property termed denaturation. 
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DNA Denaturation 


As first suggested by Watson and Crick, the two strands of a DNA 
molecule are held together by weak, noncovalent bonds. When DNA 
is dissolved in saline solution and the solution is slowly warmed, a 
temperature is reached when strand separation begins. Within a few 


FIGURE 10.15 DNA topoisomerases. (a) A model depicting the action of 
human topoisomerase |. The enzyme (yellow) cuts one of the strands of the 
DNA (step 1), which rotates around a phosphodiester bond in the intact 
strand. The cut strand is then resealed (step 2). (Note: The drawing depicts 
a type IB topoisomerase; type IA enzymes found in bacteria act by a 
different mechanism.) (b) A molecular model based on X-ray crystallography 
depicting the action of topoisomerase Il, a dimeric enzyme consisting of 
two identical halves. In step 1, the enzyme has bound the G-DNA segment, 
so named because it will form the gate through which the T-DNA (or 
transported DNA) segment will pass. In step 2, the dimeric enzyme has 
hydrolyzed two molecules of ATP and undergone a conformational change 
as the two ATPase domains close. In step 3, the G-segment is cleaved, and 
the T-segment is passed through the open “gate.” At this stage, both cut 
ends of the G-segment are covalently bound to the enzyme. In step 4, the 
two ends of the G-segment are rejoined, and the T-segment exits via the C 
gate. (c) Types of reactions that are catalyzed by topoisomerases. Part 1 
shows supercoiling-relaxation reactions; part 2 shows knotting—unknotting 
reactions; part 3 shows catenation reactions. (d) Electron micrograph of a 
pair of interconnected (catenated) circular DNA molecules. Molecules of this 
type collect in bacteria lacking a specific topoisomerase. 

Source: (a) Reprinted by permission from Macmillan Publishers Ltd: D. A. 
Koster et al., Nature 434:671, 2005; Copyright 2005. (b) Reprinted by 
permission from Macmillan Publishers Ltd: J. M. Berger et al., Nature 
388:231, 1996; copyright 1997.; (d) From N.Cozarrelli, Cell vol. 71, cover 
#2, 1992, with permission from Elsevier. 


degrees, the process is generally complete and the solution contains 
single-stranded molecules that are completely separated from their 
original partners. The progress of thermal denaturation (or DNA 
melting) is usually monitored by following the increase in absorb- 
ance of ultraviolet light by the dissolved DNA. Once the two DNA 
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from base stacking are greatly decreased, which changes the elec- 
tronic nature of the bases and increases their absorbance of ultravio- 
let radiation. The rise in absorbance that accompanies DNA 
denaturation is shown in FIGURE 10.16. The temperature at which 
the shift in absorbance is half completed is termed the melting tem- 
perature (T,,). The higher the GC content (%G + %C) of the DNA, 
the higher the T,,,. This increased stability of GC-containing DNA 
reflects the presence of the extra hydrogen bond between the bases 
as compared with AT pairs (Figure 10.1 1c). 


DNA Renaturation 


The separation of the two strands of the DNA duplex by heat is not 
an unexpected finding, but the reassociation of single strands into 
stable double-stranded molecules with correct base pairs seems 
almost inconceivable. However, in 1960, Julius Marmur and Paul 
Doty at Harvard University found that if they slowly cooled a 
solution of bacterial DNA that had been thermally denatured, the 
DNA regained the properties of the double helix; it absorbed less 
ultraviolet light and once again behaved like genetic material in 
being able to transform bacterial cells (discussed in Section 10.6). It 
became apparent from these studies that complementary single- 
stranded DNA molecules were capable of reassociating, an event 
termed renaturation, or reannealing. This finding has proved to 
be one of the most valuable observations ever made in molecular 
biology. On one hand, reannealing has served as the basis for an 
investigation into the complexity of the genome, a subject discussed 
in the following sections. On the other hand, reannealing has led to the 
development of a methodology called nucleic acid hybridization, in 
which complementary strands of nucleic acids from different sources can 
be mixed to form double-stranded (hybrid) molecules. Examples of 
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FIGURE 10.16 Thermal denaturation of DNA. A thermal denaturation 
curve for native bacteriophage T6é DNA in 0.3 M sodium citrate. The 
“melting” of the DNA (strand separation) occurs over a narrow range of 
temperature, particularly for the simpler DNAs of small viruses. The 
temperature corresponding to half the increase in absorbance is 

termed the T,,. 


Source: J. Marmur and P. Doty, Journal Molecular Biology 3:593, 1961; 
copyright 1961. Journal of Molecular Biology by Academic Press. 
Reproduced with permission of Academic Press in the format reuse in a 
book/textbook via Copyright Clearance Center. 


the types of questions that have been answered by allowing single- 
stranded nucleic acids to hybridize are discussed later in the chapter 
and illustrated in Figure 10.20. Nucleic acid hybridization plays a 
key role in our most important modern biotechnologies, including 
DNA sequencing, DNA cloning, and DNA amplification. 

Studying the rate at which genomes from different organisms 
reanneal has provided insight into the types of sequences that exist 
within these genomes. The renaturation of viral and bacterial DNA 
fragments occurs along single, symmetrical curves (FIGURE 10.17), 
suggesting that these simple genomes primarily consist of genes 
arranged along a linear array. In contrast, when DNA fragments 
from plants and animals are allowed to reanneal, the curve typically 
shows three more-or-less distinct steps (FIGURE 10.18), which cor- 
respond to the reannealing of three broad classes of DNA sequences. 
The three classes reanneal at different rates because they differ as to 
the number of times their nucleotide sequence is repeated within the 
population of fragments. The three classes are termed the highly 
repeated fraction, the moderately repeated fraction, and 
the nonrepeated fraction. 


HIGHLY REPEATED DNA SEQUENCES The highly 
repeated fraction (also called tandem repeats) constitutes any- 
where from about 1 to 10 percent of the total DNA. These sequences 
are typically short (a few hundred nucleotides at their longest) and 
present in clusters in which the given sequence repeats itself over 
and over again without interruption. A sequence arranged in this 
end-to-end manner is said to be present in tandem. Highly repeated 
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FIGURE 10.17 The kinetics of renaturation of viral and bacterial 
DNAs. The curves show the renaturation of sheared strands of DNA from 
two viruses (MS-2 and T4) and a bacterium (E. coli). (The formation of 
double-stranded DNA is plotted against Cot, which is a term that combines 
two variables: initial DNA concentration (Cg) and time of incubation (t). A 
solution containing a high concentration of DNA incubated for a short time 
will have the same Cot as one of low concentration incubated for a 
correspondingly longer time; both will have the same percentage of 
reannealed DNA.) The genome size, that is, the number of nucleotide base 
pairs in one copy of total genetic information of the organism, is indicated 
by the arrows near the upper numerical scale. The shape of each of the 
renaturation curves is very simple and occurs with a single slope. However, 
the time over which renaturation occurs is very different and depends 

on the concentration of complementary fragments, which in turn depends 
on the size of the genome. The larger the genome, the lower the concen- 
tration of complementary fragments in solution, and the greater the time 
required for renaturation to be complete. 
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FIGURE 10.18 An idealized plot showing the kinetics of renaturation 
of eukaryotic DNA. When single-stranded DNA is allowed to reanneal, 
three classes of fragments can usually be distinguished by their frequency 
of repetition within the genome: a highly repeated DNA fraction, a 
moderately repeated DNA fraction, and a nonrepeated (single-copy) DNA 
fraction. (Note: This is an idealized plot: The three classes of sequences are 
not as clearly separated in an actual renaturation curve.) 


sequences fall into several overlapping categories, including satellite 
DNAs, minisatellite DNAs, and microsatellite DNAs. 


e Satellite DNAs. Satellite DNAs consist of short sequences 
(about five to a few hundred base pairs in length) that form very 
large linear arrays, each containing up to several million base 
pairs of DNA. In many species, the base composition of these 
DNA segments is sufficiently different from the bulk of the DNA 
that fragments containing the sequence can be separated into a 
distinct “satellite” band during density gradient centrifugation 
(hence the name satellite DNA). Satellite DNAs tend to evolve 
very rapidly, causing the sequences of these genomic elements to 
vary even between closely related species. The localization of 
satellite DNA within the centromeres of chromosomes is 
described in Figure 10.20 and in further detail in Section 12.5. 


e Minisatellite DNAs. Minisatellite sequences range from about 
10 to 100 base pairs in length and are found in sizeable clusters 
containing as many as 3000 repeats. Thus, minisatellite 
sequences occupy considerably shorter stretches of the genome 
than do satellite sequences. Minisatellites tend to be unstable, 
and the number of copies of a particular sequence often 
increases or decreases from one generation to the next, most 
likely as the result of unequal crossing over (see Figure 10.23). 
Consequently, the length of a particular minisatellite locus is 
highly variable in the population, even among members of the 
same family. Because they are so variable (or polymorphic) in 
length, minisatellite sequences form the basis for the technique 
of DNA fingerprinting, which is used to identify individuals in 
criminal or paternity cases (FIGURE 10.19). 


e Microsatellite DNAs. Microsatellites are the shortest 
sequences (1 to 9 base pairs long) and are typically present in 
small clusters of about 10 to 40 base pairs in length, which are 
scattered quite evenly through the genome. DNA replicating 
enzymes have trouble copying regions of the genome that 
contain these small, repetitive sequences, which causes these 
stretches of DNA to change in length through the generations. 
Because of their variable lengths within the population, 
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FIGURE 10.19 DNA fingerprinting. In this technique, which is used 
widely to identify an individual from a sample of DNA, the DNA is digested 
by treatment with specific nucleases (called restriction endonucleases, 
described in Section 18.20), and the DNA fragments are separated on the 
basis of length by gel electrophoresis. The location in the gel of DNA 
fragments containing specific DNA sequences is determined using labeled 
probes with sequences complementary to those being sought. The DNA 
fragments that bind these probes have variable lengths from one person to 
the next because of the presence of variable numbers of short tandem 
repeats (STRs) in the genome. Forensic labs typically analyze about 13 STR 
markers that are known to be highly polymorphic. The chance that two 
individuals will have identical STR profiles is astronomically small. The 
fingerprint shown in this figure was used in a criminal case in which the 
defendant was charged with the fatal stabbing of a young woman. The 
bloodstains on the pants and shirt of the defendant were compared to the 
known blood standards from the victim and defendant. DNA from the 
bloodstains on the defendant's clothing did not match his own blood 
standard, but they did match that of the victim. The lanes contain DNA 
from the following sources: 1, 2, 3, 9, and 10 are control DNA samples that 
serve as quality control checks; 4, the defendant's blood; 5, bloodstains 
from the defendant's pants; 6 and 7, bloodstains from the defendant's shirt; 
and 8, the victim's blood. With the advent of DNA amplification techniques 
(i.e., PCR), minuscule samples of DNA can be used for these analyses. 
Source: Courtesy of Orchid Cellmark, Princeton, New Jersey. 


microsatellite DNAs have been used to analyze the relationships 
between different human populations, as illustrated in the 
following example. It is generally agreed that modern humans 
arose in Africa. If this is true, then members of different African 
populations should exhibit greater DNA sequence variation than 
human populations living on other continents because genomes 
of African populations have had longer to diverge. The argument 
for African genesis has received support from numerous studies 
on human DNA sequences. In one study that analyzed 60 
different microsatellite loci, it was found that members of 
African populations had a significantly greater genetic 
divergence than Asian or European populations. The majority of 
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microsatellite loci occur outside of genes, and changes in their 
length generally go unnoticed. This is not the case for the 
microsatellite sequences discussed in the accompanying Human 
Perspective (Section 10.16). 


Once it became apparent that eukaryotic genomes contain large 
numbers of copies of short DNA sequences, researchers sought to 
learn where these DNA sequences are located in the chromosomes. 
Are satellite DNA sequences, for example, clustered in particular 
parts of a chromosome, or scattered uniformly from one end to the 
other? It was noted above that the discovery of DNA reannealing led 
to the development of a vast nucleic acid hybridization methodology 
(discussed at length in Chapter 18). The ability to locate satellite DNA 
sequences illustrates the analytic power of nucleic acid hybridization. 

In the reannealing experiments described in Figure 10.18, com- 
plementary DNA strands were allowed to bind to one another while 
colliding randomly in solution. An alternate experimental protocol 
called in situ hybridization was developed in 1969 by Mary Lou 
Pardue and Joseph Gall of Yale University and was used to determine 
the location of satellite DNA. The term in situ means “in place,” 
which refers to the fact that the DNA of the chromosomes is kept in 
place while it is allowed to react with a particular preparation of 
labeled DNA. In early in situ hybridization studies, the DNA to be 
detected (the probe DNA) was radioactively labeled and was local- 
ized by autoradiography. The resolution of the technique has been 


increased using probes that are labeled with fluorescent dyes, then 
localized with a fluorescence microscope (as in FIGURE 10.20b). This 
latter technique is called fluorescence in situ hybridization 
(FISH) and has become so refined that it can be used to map the 
locations of different sequences along single DNA fibers. 

To carry out nucleic acid hybridization, both interactants must be 
single-stranded. In the experiment depicted in Figure 10.20, the chro- 
mosomes from a mitotic cell are spread on a slide, and the DNA is 
made single-stranded by treating the chromosomes with a hot salt 
solution that causes the DNA strands to separate and remain apart. 
During the ensuing hybridization step, the denatured chromosomes 
are incubated with a solution of biotin-labeled, single-stranded satel- 
lite DNA, which binds selectively to complementary strands of immo- 
bilized satellite DNA located in the chromosomes. Following the 
incubation period, the soluble, unhybridized satellite DNA is washed 
away or digested, and the sites of the bound, labeled DNA fragments 
are revealed. As shown in Figure 10.20, satellite DNA is localized in 
the centromeric regions of the chromosome (see Figure 12.26). 
Another example using FISH technology is shown in FIGURE 10.21. 


MODERATELY REPEATED DNA SEQUENCES The 
moderately repeated fraction of the genomes of plants and animals 
can vary from about 20 to more than 80 percent of the total DNA, 
depending on the organism. This fraction includes sequences that 
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FIGURE 10.20 Fluorescence in situ hybridization and the localization of 
satellite DNA. (a) The steps that are taken in carrying out fluorescence in situ 
hybridization. In this technique, certain of the nucleotides in the probe DNA are 
linked covalently to a small organic molecule, usually biotin. Following hybridiza- 
tion, the location of the bound, biotin-labeled DNA can be visualized by treating 
the preparation with fluorescently labeled avidin, a protein that binds to biotin 
with very high affinity. Chromosomes in these preparations usually appear red 
because they have been counterstained with propidium iodide. (b) Localization 
of a-satellite DNA at the centromere of human chromosomes. The location of 
the bound, biotin-labeled, satellite DNA is revealed by the yellow fluorescence, 
which stands out against a background of red, counterstained chromosomes. 
Fluorescence appears only at the site where each chromosome is constricted, 
which marks the location of the centromere. 


Source: From Huntington F. Willard, Trends Genet. 6:414, 1990, with permission 
from Elsevier. 


FIGURE 10.21 Chromosomal localization of a nonrepeated DNA 
sequence. These mitotic chromosomes were prepared from a dividing 
mouse cell and incubated with a purified preparation of biotin-labeled DNA 
encoding one of the nuclear lamin proteins (lamin Bz), which is encoded by 
a nonrepeated gene. The locations of the bound, labeled DNA appear as 
bright dots. The lamin gene is present on the homologues of chromosome 
10. Each chromosome contains two copies of the gene because the DNA 
had been replicated prior to the cells entering mitosis. 

Source: From Monika Zewe et al., courtesy of Werner Franke, Eur. J.Cell 
Biol. 56:349, 1991, with permission from Elsevier. 


are repeated within the genome anywhere from a few times to tens of 
thousands of times. Included in the moderately repeated DNA frac- 
tion are some sequences that code for known gene products, either 
RNAs (such as rRNAs) or proteins (including histones), but the bulk 
of this DNA fraction lacks a coding function. Rather than occurring 
as clusters of tandem sequences, these noncoding elements are scat- 
tered (ie., interspersed) throughout the genome. Most of these 
repeated sequences can be grouped into two classes that are referred 
to as SINEs (short interspersed elements) or LINEs (Jong interspersed 
elements). SINEs and LINEs sequences are discussed on page 393. 


NONREPEATED DNA SEQUENCES $s initially pre- 
dicted by Mendel, classical studies on the inheritance patterns of vis- 
ible traits led geneticists to conclude that each gene was present in 
one copy per single (haploid) set of chromosomes. When denatured 
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fragments are very slow to find partners, so slow in fact that they are 
presumed to be present in a single copy per genome. This fraction 
comprises the nonrepeated (or single-copy) DNA sequences, which 
includes the genes that exhibit Mendelian patterns of inheritance. 
Because they are present in a single copy in the genome, nonrepeated 
sequences localize to a particular site on a particular chromosome 
(Figure 10.21). 

Included within the nonrepeated fraction are the DNA 
sequences that code for virtually all proteins other than histones. 
Even though these sequences are not present in multiple copies, 
genes that code for polypeptides are usually members of a family of 
related genes. This is true for the globins, actins, myosins, collagens, 
tubulins, integrins, and most other proteins in a eukaryotic cell. 
Each member of a multigene family is encoded by a different but 
related sequence. We will look into the origin of these multigene 
families in the following section. 

Now that the human genome has been sequenced and analyzed, 
we finally have a relatively accurate measure of the DNA sequences 
that are responsible for encoding the amino acid sequences of our 
proteins, and it is remarkably small. If you would have suggested to 
a geneticist in 1960 that less than 1.5 percent of the human genome 
encodes the amino acids of our proteins, he or she would have con- 
sidered the suggestion to be ridiculous. Yet, that is the reality that has 
emerged from the study of genome sequences. In the following sec- 
tion, we will try to better understand how the remaining 98+ percent 
of DNA sequences might have evolved. 


REVIEW | 


1. What is a genome? How does the complexity of 
bacterial genomes differ from that of eukaryotic 
genomes? 

2. What is meant by the term DNA denaturation? How 


does denaturation depend on the GC content of the 
DNA? How does this variable affect the T,,? 


3. What is a microsatellite DNA sequence? What role 
do these sequences play in human disease? 


4. Which fraction of the genome contains the most 
information? Why is this true? 


10.9 THE HUMAN PERSPECTIVE 


Diseases That Result from Expansion of Trinucleotide Repeats 


For decades, biologists believed that genes were always transmitted 
from generation to generation as stable entities. On rare occasion, 
a change occurs in the nucleotide sequence of a gene in the germ 
line, creating a mutation that is subsequently inherited. This is one 
of the basic tenets of Mendelian genetics. Then in 1991, several 
laboratories reported a new type of “dynamic mutation” in which 
the nucleotide sequence of particular genes changed dramati- 
cally between parents and offspring. In each case, these mutations 
affected genes that contained a repeating trinucleotide unit (e.g., 
CCG or CAG) as part of their sequence. In most members of the 


population, these particular genes contain a relatively small (but var- 
iable) number of repeated trinucleotides, and they are transmitted 
from one generation to the next without a change in the number of 
repeats. In contrast, a small fraction of the population possesses a 
mutant version of the gene that contains a larger number of these 
repeating units. Unlike the normal version, the mutant alleles are 
highly unstable, and the number of repeating units tends to increase 
as the gene passes from parents to offspring. When the number of 
repeats increases beyond a critical number, the individual inheriting 
the mutant allele develops a serious disease. 


continued 
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At the present time, more than 20 inherited disorders have 
been attributed to expansion of trinucleotide repeats. These dis- 
eases fall into two basic categories, which we will consider in turn. 
Type | diseases are all neurodegenerative disorders that result 
from the expansion of the number of repeats of a CAG trinucleo- 
tide within the coding portion of the mutant gene (FIGURE 1). We 
can illustrate the nature of these disorders by considering the most 
prevalent and widely studied of the group, Huntington's disease 
(HD). HD is a fatal illness characterized by involuntary, uncoordi- 
nated movements; changes in personality, including depression 
and irritability; and gradual intellectual decline. Symptoms usually 
begin in the third to fifth decade of life and increase in severity 
until death. 

The normal HD gene, which is stably transmitted, contains 
between 6 and 35 copies of the CAG trinucleotide. The pro- 
tein encoded by the HD gene is called huntingtin, and its pre- 
cise function remains unclear. CAG is a triplet that encodes the 
amino acid glutamine. Thus the normal huntingtin polypeptide 
contains a stretch of 6 to 35 glutamine residues—a polyglutamine 
tract—as part of its primary structure. We think of polypeptides 
as having a highly defined primary structure, and most of them 
do, but huntingtin is normally polymorphic with respect to the 
length of its polyglutamine tract. The protein appears to function 
normally as long as the tract length remains below approxi- 
mately 35 glutamine residues. But if this number is exceeded, the 
protein takes on new properties, and the person is predisposed 
to developing HD. 

HD exhibits a number of unusual characteristics. Unlike most 
inherited diseases, HD is a dominant genetic disorder, which 
means that a person with the mutant allele will develop the dis- 
ease regardless of whether he or she has a normal HD allele. In 
fact, persons who are homozygous for the HD allele are no more 
seriously affected than heterozygotes. This observation indicates 
that the mutant huntingtin polypeptide causes the disease, not 
because it fails to carry out a particular function, but because it 
acquires some type of toxic property, which is referred to as a 
gain-of-function mutation. This interpretation is supported by 
studies with mice. Mice that are engineered to carry the mutant 
human HD allele (in addition to their own normal alleles) develop 
a neurodegenerative disease resembling that found in humans. 
The presence of the one abnormal allele is sufficient to cause dis- 
ease. Another unusual characteristic of HD and the other CAG 
disorders is a phenomenon called genetic anticipation, which 
means that, as the disease is passed from generation to genera- 
tion, its severity increases and/or it strikes at an increasingly ear- 
lier age. This was once a puzzling feature of HD but is now readily 


explained by the fact that the number of CAG repeats in a mutant 
allele (and the resulting consequences) often increases dramati- 
cally from one generation to the next. 

The molecular basis of HD remains unclear, but there is no 
shortage of theories as to why an expanded glutamine tract may 
be toxic to brain cells. One feature appears indisputable: When 
the polyglutamine tract of huntingtin exceeds about 35 residues, 
the protein (or a fragment cleaved from it) undergoes abnormal 
folding to produce a misfolded molecule that (1) binds to other 
mutant huntingtin molecules to form insoluble aggregates, 
not unlike those seen in the brains of Alzheimer's victims, and 
(2) binds to a number of unrelated proteins that do not interact 
with normal, wild-type huntingtin molecules. Among the proteins 
bound by mutant huntingtin are several transcription factors, 
which are proteins involved in the regulation of gene expression. 
Several of the most important transcription factors present in 
cells, including TBP (see Figure 11.17) and CBP (see Figure 12.50), 
contain polyglutamine stretches themselves, which makes them 
particularly susceptible to aggregation by proteins with mutant, 
expanded polyglutamine tracts. In fact, the protein aggregates 
present in degenerating neurons of HD patients contain both 
of these transcription factors. These results suggest that mutant 
huntingtin sequesters transcription factors, disrupting the tran- 
scription of genes that are required for the health and survival of 
the affected neurons. This hypothesis has received support from 
a study in which mice were genetically engineered so that their 
brain cells lacked the ability to produce certain key transcription 
factors. These mice exhibited the same type of neurodegenera- 
tion as is seen in animals carrying a mutant HD gene. Other basic 
neuronal processes, including axonal transport, mitochondrial 
permeability and fission, cholesterol synthesis, and protein deg- 
radation, are also disrupted by the HD mutation and represent 
possible causes of nerve cell death. It should be noted that a 
number of HD researchers hold an alternate opinion about the 
cause of cell death. They argue that it is not the protein aggre- 
gates that are toxic, but the soluble mutant protein (or frag- 
ments of the mutant protein) itself. In fact, proponents of this 
view argue that the mutant protein aggregates protect the cell 
by sequestering the harmful molecules. It is important for prac- 
tical reasons to distinguish between these possibilities because a 
number of proposed therapies are aimed at blocking formation 
of the aggregates, which could actually prove more harmful to a 
patient. 

Type Il trinucleotide repeat diseases differ from the Type | 
diseases in a number of ways. Type II diseases (1) arise from the 
expansion of a variety of trinucleotides, not only CAG, (2) the trinu- 
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FIGURE 1. Trinucleotide repeat sequences and human disease. The top line shows a generalized gene that is transcribed into a messenger RNA 
with several distinct portions, including a 5’ noncoding portion called the 5’ UTR (5’ untranslated region), a coding exon that carries the information for 
the amino acid sequence of the polypeptide, and a 3’ noncoding portion (the 3’ UTR). The introns in the DNA (see Figure 11.27) are not represented in 
the mature messenger RNA. The general location of the trinucleotide responsible for each of four different diseases (fragile X syndrome, Friedreich's 
ataxia, Huntington's disease, and myotonic dystrophy) is indicated by the location of each pyramid. The number of repeats responsible for the normal 
(red), carrier (orange), and disease (yellow) conditions for each disease-causing gene is indicated. Genes responsible for Type | diseases, such as 
Huntington's, do not exhibit the intermediate “carrier” state in which an individual possesses an unstable allele but is not affected. 


Source: Reprinted by permission from Macmillan Publishers Ltd: J-L Mandel, Nature 386:768, 1997; Copyright 1997. 


cleotides involved are present in a part of the gene that does not 
encode amino acids (Figure 1), (3) the trinucleotides are subject to 
massive expansion into thousands of repeats, and (4) the diseases 
affect numerous parts of the body, not only the brain. The best 
studied Type II disease is fragile X syndrome, so named because 
the mutant X chromosome is especially susceptible to damage. 
Fragile X syndrome is characterized by mental retardation as well 
as a number of physical abnormalities. The disease is caused by 
a dynamic mutation in a gene called FMR7 that encodes an RNA- 
binding protein that regulates the translation of certain mRNAs 
involved in neuronal development and/or synaptic function. A 
normal allele of this gene contains anywhere from about 5 to 55 
copies of a specific trinucleotide (CGG) that is repeated in a part 
of the gene that corresponds to the 59 noncoding portion of 
the messenger RNA (Figure 1). However, once the number of 


copies rises above about 60, the locus becomes very unstable 
and the copy number tends to increase rapidly into the thou- 
sands. Females with an FMR17 gene containing 60 to 200 copies of 
the triplet generally exhibit a normal phenotype but are carriers 
for the transmission of a highly unstable chromosome to their off- 
spring. If the repeat number in the offspring rises above about 
200, the individual is almost always mentally retarded. Unlike an 
abnormal HD allele, which causes disease as the result of a gain 
of function, an abnormal FMR7 allele causes disease as the result 
of a loss of function; FMR7 alleles containing an expanded CGG 
number are selectively inactivated so that the gene is not tran- 
scribed or translated. Although there is currently no effective 
treatment for any of the diseases caused by trinucleotide expan- 
sion, the risk of transmitting or possessing a mutant allele can be 
assessed through genetic screening. 


10.10 The Stability of the 
Genome: Duplication 


Because DNA is the genetic material, we tend to think of it as a con- 
servative molecule whose information content changes slowly over 
long periods of evolutionary time. In actual fact, the sequence organ- 
ization of the genome is capable of rapid change, not only from one 
generation to the next, but within the lifetime of a given individual. 


Whole-Genome Duplication 
(Polyploidization) 


As discussed in the first sections of this chapter, peas and fruit flies 
have pairs of homologous chromosomes in each of their cells. These 
cells are said to have a diploid number of chromosomes. If a person 
were to compare the number of chromosomes present in the cells of 
closely related organisms, especially higher plants, they would find 
that some species have a much greater number of chromosomes than 
a close relative. Among animals, the widely studied amphibian 
Xenopus laevis, for example, has twice the number of chromosomes as 
its cousin X. tropicalis. These types of discrepancies can be explained 
by a process known as polyploidization, or whole-genome 
duplication. Polyploidization is an event in which offspring are pro- 
duced that have twice the number of chromosomes in each cell as 
their diploid parents; the offspring have four homologues of each 
chromosome rather than two. Polyploidization is thought to occur in 
either of two ways: Two related species mate to form a hybrid organ- 
ism that contains the combined chromosomes from both parents, or, 
alternatively, a single-celled embryo undergoes chromosome duplica- 
tion but the duplicates, rather than being separated into separate cells, 
are retained in a single cell that develops into a viable embryo. The 
first mechanism occurs most often in plants, and the second most 
often in animals. Polyploidization is particularly common in flower- 
ing plants, including numerous crop species (e.g., wheat, bananas, 
and coffee) depicted in FIGURE 10.22. When polyploidization occurs 
in a plant lineage, the chromosome number suddenly doubles and, in 
most cases, tends to return back toward the original diploid number 
over an ensuing period of evolution. As a result, different modern 
plant species are caught at various stages in the evolutionary process 
of winnowing their gene number. This is the reason that the genomes 


FIGURE 10.22 A sampling of agricultural crops that are polyploid. 
Pictured are oil from oilseed rape, bread from bread wheat, rope from sisal, 
coffee beans, banana, cotton, potatoes, and maize. 

Source: From A. R. Leitch and |. J. Leitch, Science 320:481, 2008; © 2008, 
reprinted with permission from AAAS. 


of different plants tend to have a much greater variation in numbers 
of genes than that exhibited by different animals (see Figure 10.28). 
A “sudden” doubling of the number of chromosomes is a dra- 
matic event, one that gives an organism remarkable evolutionary 
potential—assuming it can survive the increased number of chro- 
mosomes and reproduce. Depending on the circumstances, poly- 
ploidization may result in the production of a new species that has a 
great deal of “extra” genetic information. Several different fates can 
befall extra copies of a gene; they can be lost by deletion, rendered 
inactive by deleterious mutations, or, most importantly, they can 
evolve into new genes that possess new functions. Viewed in this 
way, extra genetic information is the raw material for evolutionary 
diversification. In 1971, Susumu Ohno of the City of Hope Cancer 
Center in Los Angeles put forward the “2R” hypothesis, in which he 
proposed that the evolution of vertebrates from a much simpler 
invertebrate ancestor was made possible by two separate rounds of 
whole-genome duplication during an early evolutionary period. 
Ohno suggested that the thousands of extra genes that would be 
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selection into new genes that were required to encode the more 
complex vertebrate body. Over the past three and a half decades, 
Ohno’ proposal has been hotly debated as geneticists have tried to 
find evidence that either supports or refutes the notion. 

The problem facing genome analysts is the vast stretch of time 
that has passed—hundreds of millions of years—since the origin of 
our earliest vertebrate ancestors. Just as a river or sea slowly wears 
away the face of the Earth, chromosomal rearrangement and muta- 
tion slowly wear away the face of an ancestral genome. Even with the 
complete sequence of a number of invertebrate and vertebrate 
genomes in hand, it has still proven a formidable challenge to identify 
the origin of many of our genes. The strongest evidence for the 2R 
hypothesis comes from analysis of the amphioxus genome. Amphioxus 
lacks a backbone, which makes it an invertebrate, but it has a number 
of features (e.g., a notochord, dorsal tubular nerve cord, and segmen- 
tal body musculature) that clearly identify it as a member of the phy- 
lum Chordata to which vertebrates belong. The lineages leading to 
modern vertebrates and amphioxus are thought to have separated 
about 550 million years ago, yet the two groups share a remarkably 
similar collection of genes. However, when researchers looked more 
closely at certain groups of genes, the genomes of vertebrates typically 
contained four times the number of such genes when compared to the 
homologous sequences in the amphioxus genome. This finding pro- 
vides strong support for Ohnos hypothesis of two rounds of whole- 
genome duplication in the ancestral vertebrate lineage. 


Duplication and Modification 
of DNA Sequences 


Polyploidization is an extreme case of genome duplication and 
occurs only rarely during evolution. In contrast, gene duplica- 
tion, which refers to the duplication of a small portion of a single 
chromosome, happens with surprisingly high frequency, and its 
occurrence is readily documented by genome analysis.’ According 


3Actually, three categories of duplication can be distinguished: whole-genome, gene, 
and segmental duplication. The last category, which refers to the duplication of a large 
block of chromosomal material (from a few kilobases to hundreds of kilobases in 
length) is not discussed here, but has a significant impact in genome evolution. 
Approximately 5 percent of the present human genome consists of segmental duplica- 
tions that have arisen during the past 35 million years. 


- 


(a) 


(b) 


to one estimate, each gene in the genome has about a 1 percent 
chance of being duplicated every million years. Duplication of a gene 
can probably occur by several different mechanisms but is most 
often thought to be produced by a process of unequal crossing over, 
as depicted in FIGURE 10.23b. Unequal crossing over occurs when a 
pair of homologous chromosomes comes together during meiosis in 
such a way that they are not perfectly aligned. As a result of mis- 
alignment, genetic exchange between the homologues causes one 
chromosome to acquire an extra segment of DNA (a duplication) 
and the other chromosome to lose a DNA segment (a deletion). If 
the duplication of a particular sequence is repeated in subsequent 
generations, a cluster of tandemly repeated segments is generated at 
a localized site within that chromosome (see Figure 10.29). 

The vast majority of gene duplicates are either lost during evolu- 
tion through deletion or rendered nonfunctional by unfavorable 
mutation. However, in a small percentage of cases, the “extra” copy 
accumulates favorable mutations and acquires a new function. More 
often, both copies of the gene undergo mutation so that each evolves 
a more specialized function than that performed by the original 
gene. In either case, the two genes will have closely related sequences 
and encode similar polypeptides, which is to say that they encode 
different isoforms of a particular protein, such as o- and B-tubulin 
(Section 9.2). Subsequent duplications of one of the genes can lead to 
the formation of additional isoforms (e.g., y-tubulin), and so forth. It 
becomes apparent from this example that successive gene duplica- 
tion can generate families of genes that encode polypeptides with 
related amino acid sequences. The production of a multigene family 
is illustrated by the evolution of the globin genes. 


Evolution of Globin Genes 


Hemoglobin is a tetramer composed of four globin polypeptides (see 
Figure 2.40b). Examination of globin genes, whether from a mam- 
mal or a fish, reveals a characteristic organization. Each of these 
genes is constructed of three exons and two introns. Exons are parts 
of genes that code for amino acids in the encoded polypeptide, 
whereas introns do not; they are noncoding intervening sequences. 
The subject of exons and introns is discussed in detail in Section 11.6 
(see Figure 11.21). For the present purposes, we will simply use 
these gene parts as landmarks of evolution. Examination of genes 
encoding certain globin-like polypeptides, such as the plant protein 
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FIGURE 10.23 Unequal crossing over between duplicated genes provides a mechanism for generating changes in gene number. (a) The initial state 
shown has two related genes (1 and 2). In a diploid individual, gene 1 on one homologue can align with gene 2 on the other homologue during meiosis. If a 
crossover occurs during this misalignment, half the gametes will be missing gene 2 and half will have an extra gene 2. (b) As unequal crossing over continues 
to occur during meiotic divisions in subsequent generations, a tandemly repeated array of DNA sequences will gradually evolve. 


leghemoglobin and the muscle protein myoglobin, reveals the pres- 
ence of four exons and three introns. This is proposed to represent 
the ancestral form of the globin gene. It is thought that the modern 
globin polypeptide arose from the ancestral form as the result of the 
fusion of two of the globin exons (step 1, FIGURE 10.24) some 800 
million years ago. 

A number of primitive fish are known that have only one globin 
gene (step 2), suggesting that these fish diverged from other verte- 
brates prior to the first duplication of the globin gene (step 3). 
Following this duplication approximately 500 million years ago, the 
two copies diverged by mutation (step 4) to form two distinct globin 
types, an o type and a f type, located on a single chromosome. This 
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FIGURE 10.24 A pathway for the evolution of globin genes. Exons are 
shown in green, introns in brown. The evolutionary steps depicted in the 
diagram are discussed in the text. The arrangement of a- and B-globin 
genes on human chromosomes 16 and 11 (shown in purple without their 
introns in step 7) are the products of several hundred million years of 
evolution. As discussed in Chapter 2, hemoglobin molecules consist of two 
pairs of polypeptide chains—one pair is always a member of the a-globin 
subfamily, and the other pair is always a member of the B-globin subfamily. 
Specific combinations of a- and B-globins are found at different stages of 
development. The a and B-globin chains that are observed in embryonic, 
fetal, and adult hemoglobins are indicated. 


is the present arrangement in the amphibian Xenopus and in 
zebrafish. In subsequent steps, the o and B forms are thought to have 
become separated from one another by a process of rearrangement 
that moved them to separate chromosomes (step 5). Each gene then 
underwent subsequent duplications and divergence (step 6), gener- 
ating the arrangement of globin genes that exists today in humans 
(step 7). The evolution of vertebrate globin genes illustrates how 
gene duplication typically leads to the generation of a family of genes 
whose individual members have specialized functions (in this case, 
embryonic, fetal, and adult forms) when compared to the single 
founding gene. 

When the DNA sequences of globin gene clusters were ana- 
lyzed, researchers found “genes” whose sequences are homologous 
to those of functional globin genes, but which have accumulated 
severe mutations that render them nonfunctional. Genes of this 
type, which are evolutionary relics, are known as pseudogenes. 
Examples of pseudogenes are found in both the human o- and 
B-globin gene clusters of Figure 10.24. The human genome contains 
an estimated 11,000 pseudogenes. Although pseudogenes do not 
encode functional proteins, they can be transcribed into RNAs, 
which may have regulatory functions. Another point that is evident 
from examination of the two globin clusters of human chromosomes 
is how much of the DNA consists of noncoding sequences, either as 
introns within genes or as spacers between genes. In fact, the globin 
regions contain a much higher fraction of coding sequences than 
most other regions of the genome. 


REVIEW | 


1. Describe the course of evolutionary events that are 
thought to give rise to multiple-gene families, such as 
those that encode the globins. How could these 
events give rise to pseudogenes? How could they 
give rise to proteins having entirely different 
functions? 


e—_____ 


10.11 The Dynamic Nature of 
the Genome: “Jumping Genes” 


If one looks at repeated sequences that have arisen during the normal 
course of evolution, one finds that repeats are sometimes present in 
tandem arrays, sometimes present on two or a few chromosomes (as in 
the case of the globin genes of Figure 10.24), and sometimes dispersed 
throughout the genome. If we assume that all members of a family of 
repeated sequences arose from a single copy, then how can individual 
members become dispersed among different chromosomes? 

The first person to suggest that genetic elements were capable of 
moving around the genome was Barbara McClintock, a geneticist 
working with maize (corn) at the Cold Spring Harbor Laboratories 
in New York. Genetic traits in maize are often revealed as changes in 
the patterns and markings in leafand kernel coloration (FIGURE 10.25). 
In the late 1940s, McClintock found that certain mutations were very 
unstable, appearing and disappearing from one generation to the 
next or even during the lifetime of an individual plant. After several 
years of careful study, she concluded that certain genetic—elements 
were moving from one place in a chromosome to an entirely different 
site. She called this genetic rearrangement transposition, and the 
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FIGURE 10.25 Visible manifestations of transposition in maize. 
Kernels of corn are typically uniform in color. The spots on this kernel result 
from a mutation in a gene that codes for an enzyme involved in pigment 
production. Mutations of this type can be very unstable, arising or 
disappearing during the period in which a single kernel develops. These 
unstable mutations appear and disappear as the result of the movement of 
transposable elements into and out of these genes during the period of 
development. 


Source: Courtesy of Venkatesan Sundaresan, Cold Spring Harbor Laboratory. 


mobile genetic elements transposable elements. Meanwhile, 
molecular biologists working with bacteria were finding no evidence 
of “jumping genes.” In their studies, genes appeared as stable ele- 
ments situated in a linear array on the chromosome that remained 
constant from one individual to another and from one generation to 
the next. McClintock’s findings were largely ignored. 


Transposons 


In the late 1960s, several laboratories discovered that certain DNA 
sequences in bacteria moved on rare occasion from one place in the 
genome to another. These bacterial transposable elements were 
called transposons. Most transposons encode a protein, or trans- 
posase, that single-handedly catalyzes the excision of a transposon 
from a donor DNA site and its subsequent insertion at a target DNA 
site. This “cut-and-paste” mechanism is mediated by two separate 
transposase subunits that bind to inverted, repeated sequences at the 
two ends of the transposon (FIGURE 10.26, step 1). The two subunits 
then come together to form an active dimer (step 2) that catalyzes a 
series of reactions leading to the excision of the transposon (step 3). 
The transposase-transposon complex then binds to a target DNA 
(step 4) where the transposase catalyzes the reactions required to 
integrate the transposon into its new residence (step 5). Integration 
of the element typically creates a small duplication in the target DNA 
that flanks the transposed element at the site of insertion. Target site 
duplications serve as “footprints” to identify sites in the genome that 
are occupied by transposable elements. 

As originally demonstrated by McClintock, eukaryotic genomes 
contain large numbers of transposable elements. In fact, two-thirds 
of the human genome has been derived from transposable elements! 
The vast majority (> 99percent) of transposable elements are inca- 
pable of moving from one place to another; they have either been 
crippled by mutation or their movement is suppressed by the cell. 
(Suppression occurs by means of small cellular RNAs [Section 11.12] 
and DNA methylation [Section 12.4]). However, when transposable 
elements do change position, they insert widely throughout the 
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FIGURE 10.26 Transposition of a bacterial transposon by a “cut-and- 
paste” mechanism. As discussed in the text, the two ends of this bacterial 
Tn5 transposon are flanked by repeated sequences (orange segments). The 
two ends are brought together by the dimerization of a pair of subunits of 
the transposase (orange spheres). Both strands of the double helix are 
cleaved at each end, which excises the transposon as part of a complex 
with the transposase. The transposon-transposase complex is “captured” 
by a target DNA, and the transposon is inserted in such a way as to 
produce a small duplication that flanks the transposed element. (Note: Not 
all DNA transposons move by this mechanism.) 

Source: From D. R. Davies et al., Science 289:77, 2000; Copyright 2000, 
reprinted with permission from AAAS. 


target DNA. In fact, many transposable elements can insert them- 
selves within the center of a protein-coding gene. Several examples 
of such an occurrence have been documented in humans, including 
a number of cases of hemophilia caused by a mobile genetic element 
that had “jumped” into the middle of one of the key blood clotting 
genes. It is estimated that approximately 1 out of 500 disease-causing 
mutations in humans is the result of the insertion of a transposable 
element. In addition, the reactivation of transposable elements may 
contribute to the development of certain cancers. 

FIGURE 10.27 illustrates two major types of eukaryotic trans- 
posable elements, DNA transposons and retrotransposons, and 
their differing mechanism of transposition. As described above for 
prokaryotes, most eukaryotic DNA transposons are excised from the 
DNA at the donor site and inserted at a distant target site (Figure 
10.27a). This “cut-and-paste” mechanism is utilized, for example, by 
members of the mariner family of transposons, which are found 
throughout the plant and animal kingdoms. Retrotransposons, in 
contrast, operate by means of a “copy-and-paste” mechanism that 
involves an RNA intermediate (Figure 10.27b). The DNA of the 
transposable element is transcribed, producing an RNA, which is 
then “reverse transcribed” by an enzyme called reverse tran- 
scriptase, producing a complementary DNA. The DNA copy is 
made double-stranded and then integrated into a target DNA site. In 
most cases, the retrotransposon itself contains the sequence that 


Donor DNA with transposon 


Recipient DNA 


Recipient DNA with transposon 


1) \0 Wii DX NIN IN" VOC OC Cat WG 


| Excision of transposon 


WOE + BOG ; 


eor7 


(a) 


Donor DNA with retrotransposon 


A 
1 
1 

1 


- 


Recipient DNA 


+ 


DROVOCEBOOOCL 


Donor DNA after loss of 
transposon and rejoining of ends 


Recipient DNA with retrotransposon 


0 Wai TOIVOIRGDOOCON — VSG 


| Transcription by 
RNA polymerase 


WAV AV AV AV AW fh 


Reverse transcription to 
form single-stranded cDNA 


| Conversion to Z 
double-stranded DNA ue ° 


+ 


A 
1 
' 
t 
t 
I 

iT 

: Donor DNA with retrotransposon 


FIGURE 10.27 Schematic pathways in the movement of transposable elements. (a) DNA transposons move by a cut-and-paste pathway whose 
mechanism is depicted in Figure 10.26. Approximately 3 percent of the human genome consists of DNA transposons, none of which are capable of 
transposition (i.e., all are relics left in the genome as a result of ancestral activity). (b) Retrotransposons move by a copy-and-paste pathway. The steps 
involved in retrotransposition take place both in the nucleus and cytoplasm and require numerous proteins, including those of the host. More than 

40 percent of the human genome consists of retrotransposons, only a few of which (e.g., 40-100) are thought to be capable of transposition. More than 


one mechanism of retrotransposition is known. 


codes for a reverse transcriptase. Retroviruses, such as the virus 
responsible for AIDS, use a very similar mechanism to replicate their 
RNA genome and integrate a DNA copy into a host chromosome. 


The Role of Mobile Genetic Elements 
in Genome Evolution 


It was noted on page 386 that moderately repeated DNA sequences 
constitute a significant portion of eukaryotic genomes. Unlike the 
highly repeated fraction of the genome (satellite, minisatellite, and 
microsatellite DNA), whose sequences reside in tandem and arise by 
DNA duplication, most of the moderately repeated sequences of the 
genome are interspersed and arise by transposition of mobile genetic 
elements. In fact, the two most common families of moderately 
repeated sequences in human DNA—the Alu and L1 families—are 
retrotransposons. Recall from page 387 that there are two classes of 
interspersed elements, SINEs and LINEs. Alu is an example of the 
former and L1 is an example of the latter. A full-length, transposable 
L1 sequence (at least 6000 base pairs in length) encodes a unique 
protein with two catalytic activities: a reverse transcriptase that 
makes a DNA copy of the RNA that encoded it and an endonuclease 
that nicks the target DNA prior to insertion. The human genome is 
estimated to contain about 500,000 copies of L1, but the vast major- 
ity of these are incomplete, immobile elements. Even still, L1 mobil- 
ity continues to affect human evolution. In one study, for example, 
that compared the DNA sequences of 25 different people, any two 


individuals in the group differed in the presence or absence of an L1 
element, on average, at 285 sites in their respective genomes. 

Even more abundant than L1 are the Alu sequences, which are 
interspersed at more than one million different sites throughout the 
human genome. Alu is a family of short, related sequences about 300 
base pairs in length. The Alu sequence closely resembles that of the 
small RNA present in the signal recognition particles found in con- 
junction with membrane-bound ribosomes (Figure 8.12). It is pre- 
sumed that during the course of evolution, this cytoplasmic RNA 
was copied into a DNA sequence by reverse transcriptase and inte- 
grated into the genome. The tremendous amplification of the Alu 
sequence is thought to have occurred by retrotransposition using the 
reverse transcriptase and endonuclease encoded by L1 sequences. 

Given its prevalence in the human genome, one might expect the 
Alu sequence to be repeated in genomes throughout the rest of the 
animal kingdom, but this is not the case. Comparative genomic stud- 
ies indicate that the Alu sequence first appeared as a transposable ele- 
ment in the genome of primates about 60 million years ago and has 
been increasing in copy number ever since. The rate of Alu transposi- 
tion has slowed dramatically over the course of primate evolution to 
its current estimated rate in humans of approximately once in every 
200 births. These transposition events generate differences in the 
locations of Alu sequences from one person to another and thus con- 
tribute to the genetic diversity in the human population (Section 10.15). 

When something new like a transposable element is discovered 
in an organism, a biologist’s first question is usually: What is its 
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transposable elements are primarily “junk?” According to this view, a 
transposable element is a type of genetic parasite that can invade a 
host genome from the outside world, spread within that genome, 
and be transmitted to offspring—so long as it doesn't have serious 
adverse effects on the ability of the host to survive and reproduce. 
Even if this is the case, it doesn’t mean that transposable elements 
cannot make positive contributions to eukaryotic genomes. Keep in 
mind that evolution is an opportunistic process—there is no preset 
path to be followed. Regardless of its origin, once a DNA sequence is 
present in a genome, it has the potential to be “put to use” in some 
beneficial manner during the course of evolution. For this reason, 
the portion of the genome formed by transposable elements has 
been referred to as a “genetic scrap yard.” There are several ways that 
transposable elements appear to have been involved in adaptive 
evolution: 


1. Transposable elements can, on occasion, carry adjacent parts of 
the host genome with them as they move from one site to another. 
Theoretically, two unlinked segments of the host genome could 
be brought together to form a new, composite segment. This may 
be a primary mechanism in the evolution of proteins that are 
composed of domains derived from different ancestral genes (as 
in Figure 2.38). 


2. DNA sequences that were originally derived from transposable 
elements are found as parts of eukaryotic genes as well as parts of 
the DNA segments that regulate gene expression. For example, 
several transcription factors, which are the proteins that regulate 
gene expression, bind to sites in the DNA that arose originally 
from transposable elements. Even when there is no direct evi- 
dence of a function, many transposable elements are very similar 
in position and sequence to elements in the genomes of distant 
vertebrate relatives. This type of evolutionary conservation sug- 
gests that these sequences carry out some beneficial role in the 
lives of their hosts (Section 10.13). 


3. In some cases, transposable elements themselves appear to have 
given rise to genes. The enzyme telomerase, which plays a key 
role in replicating the DNA at the ends of chromosomes (see 
Figure 12.24c), may be derived from a reverse transcriptase 
encoded by an ancient retrotransposon. The enzymes involved in 
the rearrangement of antibody genes (see Figure 17.18) are 
thought to be derived from a transposase encoded by an ancient 
DNA transposon. If this is in fact the case, our ability to ward off 
infectious diseases is a direct consequence of transposition. 


4. A number of recent studies have found evidence that mamma- 
lian brain cells have a greatly elevated level of L1 retrotransposi- 
tion compared to that of other tissues. It is hypothesized that 
these mobile elements, which insert at different sites in the 
genomes of neurons, contribute to functional differences in the 
activities of brain cells. 


One point is clear: Transposition has had a profound impact on 
the genetic composition of organisms. 

It is interesting to note that, only a few decades ago, molecular 
biologists considered the genome to be a stable repository of genetic 
information. Now it seems remarkable that organisms can maintain 
themselves from one day to the next in the face of this large-scale 
disruption by genetic rearrangement. In hindsight, it is not surpris- 
ing that transposition was first discovered in plants, because trans- 
posable elements tend to be much more active in plants than they are 


in other eukaryotes. For her discovery of transposition, Barbara 
McClintock was the sole recipient of a Nobel Prize in 1983, at age 81, 
approximately 35 years after her initial report. 
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1. Describe two mechanisms by which genetic elements 
are able to move from one site in the genome to 
another. 


2. Describe the impact that transposable elements have 
had on the structure of the human genome over the 
past 50 million years. 
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10.12 Sequencing Genomes: 
The Footprints of Biological 
Evolution 


Determining the nucleotide sequence of all the DNA in a genome is 
a formidable task. During the 1980s and 1990s, the technology to 
accomplish this effort gradually improved as researchers developed 
new vectors to clone large segments of DNA and increasingly auto- 
mated procedures to determine the nucleotide sequences of these 
large fragments (discussed in Section 18.22). The first complete 
sequence of a prokaryotic organism was reported in 1995, and the 
first complete sequence of a eukaryote, the budding yeast S. cerevisiae, 
was published the following year. Over the next few years—as the 
scientific community awaited the results of work on the human 
genome—the genomic sequences of numerous prokaryotic and 
eukaryotic organisms (including a fruit fly, nematode, and flowering 
plant) were reported. Researchers were able to sequence these 
genomes with relative speed because they are considerably smaller 
than the human genome, which contains approximately 3.2 billion 
base pairs. To put this number into perspective, if each base pair in 
the DNA were equivalent to a single letter on this page, the informa- 
tion contained in the human genome would produce a book approx- 
imately 1 million pages long. 

By 2001, the rough draft of the nucleotide sequence of the human 
genome had been published. The sequence was described as “rough” 
because each segment was sequenced an average of about four times, 
which is not enough to ensure complete accuracy, and many regions 
that proved difficult to sequence were excluded. The first attempts 
to annotate the human genomic sequence, that is, to interpret the 
sequence in terms of the numbers and types of genes it encoded, led 
to a striking observation concerning gene number. Researchers con- 
cluded that the human genome probably contained in the neighbor- 
hood of 30,000 protein-coding genes. Until it had been sequenced, it 
had been widely assumed that the human genome contained at least 
50,000 and maybe as many as 150,000 different genes. 

The “finished” version of the human genome sequence was 
reported in 2004, which meant that (1) each site had been sequenced 
7 to 10 times to ensure a very high degree of accuracy (at least 99.99 
percent), and (2) the sequence contained a minimal number of gaps. 
The gaps that persisted contain regions of the chromosomes—often 
referred to as “dark matter”—that consist largely of long stretches of 
highly repeated DNA, especially those in and around the centro- 
meres of each chromosome. Despite exhaustive efforts, these regions 


have proven impossible to clone, or their sequences have proven 
impossible to order correctly using current technology. 

Despite this remarkable achievement in nucleotide sequencing, 
the actual number of protein-coding genes in the human genome 
remains uncertain. Identification of genes using various computer 
software programs (algorithms) has been plagued with difficulties, 
and, in fact, the earlier estimate of 30,000 protein-coding human 
genes has been revised steadily downward. Although it has come as 
a shock to most biologists, current estimates place the number in the 
neighborhood of 21,000! This means that humans have roughly the 
same number of protein-coding genes as a microscopic worm, 
whose entire body—nervous system and all—consists of approxi- 
mately 1000 cells (FIGURE 10.28).* 

It is evident from this data that it is not possible, as was once 
thought, to understand the nature of an organism by simply know- 
ing a list of the genes that make up that organism's genome. If the 
differences in complexity between organisms cannot be explained by 
the number of protein-coding genes in their genomes, how can they 
be explained? We really don't have a very good answer to that ques- 
tion, but we can list a few possibilities to consider. 


1. As will be described in Chapter 12, a single gene can encode a 
number of related proteins as a result of a process termed alterna- 
tive splicing (see Section 12.18). Several recent studies suggest 
that over 90 percent of human genes might engage in alternative 
splicing, so that the actual number of proteins encoded by the 
human genome is at least several times greater than the number 
of genes it contains. It is likely that greater differences between 
organisms will emerge when this and other “gene-enhancing” 
mechanisms are explored further. This expectation is consistent 
with the observations that vertebrate genes tend to be more com- 
plex (i.e., contain more exons) than those of flies and worms and 
exhibit a higher incidence of alternative splicing. 


2. Molecular biologists have devoted an enormous effort to study- 
ing the mechanisms that regulate gene expression. Despite this 
effort, our understanding of these mechanisms is very limited. 
We have learned, for example, that over 70 percent of the genome 
is transcribed into a bewildering array of RNAs, but we have very 
little information as to what most of these RNAs are doing within 
the cell. There is a growing belief that many of these RNAs have a 
gene regulatory function. There is also growing evidence that the 
number and complexity of these noncoding RNAs can be corre- 
lated with the levels of complexity of diverse organisms. In one 
study, for example, researchers sought to identify the number of 
microRNAs produced by various organisms. As discussed in the 
next chapter, microRNAs are one of the best studied regulatory 
RNAs. Researchers found that sponges expressed about 10 differ- 
ent microRNAs, and sea anemones approximately 40. This com- 
pares to approximately 150 identified in worms and fruit flies 
and at least 1000 in humans. This is not to imply that the number 


“It can also be seen from Figure 10.28 that there is very little correlation between the 
number of protein-coding genes and the total amount of DNA in the genome. The 
puffer fish, for example, which has about the same number of genes as other verte- 
brates, has a genome that is about one-eighth the size of its human counterpart. The 
ancestors of puffer fish, like most bony fish, are thought to have had typical verte- 
brate-sized genomes, which demonstrates that “excess” DNA can be lost in a lineage 
over evolutionary time. At the other end of the spectrum from the puffer fish, the 
genome of certain salamanders is roughly 30 times the size of the human genome. 
The contrasts in genome size among vertebrates reflects a striking difference in the 
content of noncoding, largely repetitive DNA. The evolutionary significance of these 
differences is unclear. 
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FIGURE 10.28 Genome comparisons. Among eukaryotes whose 
genomes have been sequenced, the number of protein-coding genes (blue 
bars) varies from about 6,200 in yeast to 57,000 in apples, with vertebrates 
thought to possess about 20,000. (The high gene number in an apple 
reflects a relatively recent whole-genome duplication.) It is particularly 
interesting to note that apparent increases in the complexity of organisms 
are not reflected in dramatic increases in gene numbers. For example, 
neither (1) the transition from single-celled eukaryotes, such as 
Clamydomonas, to the simplest multicellular animals, sponges, nor (2) the 
transition from invertebrates to vertebrates, is accompanied by major 
changes in the number of protein-coding genes. Whereas the number of 
estimated protein-coding genes varies over a modest range among 
eukaryotes, the amount of DNA in a genome (red bars) varies widely, 
reaching values of 90 billion base pairs in some salamanders (the actual 
gene number for these amphibians is unknown). 


of microRNAs is the primary determinant of morphological 
complexity. Rather, it suggests that we will have to learn a great 
deal more about gene regulation if we are going to understand 
the basis of biological diversity. 


3. Over the past decade or so, a new area of biological study (called 
systems biology) has emerged that focuses on the ways that pro- 
teins work together as complex networks rather than as individ- 
ual actors. A very simple example of a protein network was 
presented in Figure 2.48. Given that cells produce thousands of 
different proteins, with varying degrees of interaction, these net- 
works can become very complex and dynamic. A relatively small 
increase in the number of elements that make up a network, or an 
increase in the size and complexity of those elements, could dra- 
matically increase the complexity of the network and thus the 
complexity of the entire organism. 


Numerous other factors could be added to this discussion, but the 
general point is clear: The apparent difference in complexity between 
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amount of genetic information in an organism's genome than that of 
the manner in which that information is put to use. 
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1. How many genes are contained In the human 
genome? Why did researchers find this number so 
surprising? 

2. What additional factors outside of gene number 


might contribute to the complexity of organisms? 
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10.13 Comparative Genomics: 
“If It’s Conserved, It Must 
Be Important” 


Consider the following facts: (1) The majority of the genome con- 
sists of DNA that resides between the genes, and thus represents 
intergenic DNA, and (2) each of the roughly 21,000 or so protein- 
coding human genes consists largely of noncoding portions (intronic 
DNA). Taken together, these facts indicate that the protein-coding 
portion of the genome represents a remarkably small percentage of 
the total DNA (estimated at about 1.5 percent). Most of the inter- 
genic and intronic DNA of the genome does not contribute to the 
survival and reproductive capability of an individual, so there is no 
selective pressure to maintain its sequence unchanged. As a result, 
most intergenic and intronic sequences tend to change rapidly as 
organisms evolve. In other words, these sequences tend not to be 
conserved. In contrast, those segments of the genome that encode 
protein sequences or contain regulatory sequences that control gene 
expression (see Figure 12.44) are subject to natural selection. Natural 
selection tends to eliminate individuals whose genome contains 
mutations in these functional elements.” As a result, these sequences 
tend to be conserved. It follows from these comments that the best 
way to identify functional sequences is to compare the genomes of 
different types of organisms. 

Despite the fact that humans and mice have not shared a com- 
mon ancestor for about 75 million years, the two species share simi- 
lar genes, which tend to be clustered in a remarkably similar pattern. 
For example, the number and order of human globin genes depicted 
in Figure 10.24 is basically similar to that found on a comparable 
segment of DNA in the mouse genome. As a result, it becomes pos- 
sible to align corresponding regions of the mouse and human 
genomes. Human chromosome number 12, for example, is com- 
posed of a series of segments, in which each segment corresponds 


5One can recognize two opposing sides of natural selection. Negative or purifying 
selection acts to maintain (conserve) sequences with important functions as they are 
because changes reduce the likelihood that the individual will survive and reproduce. 
These regions of the genome evolve more slowly than nonfunctional sequences whose 
change has no effect on the fitness of the individual and are not subject to natural 
selection (such sequences are said to evolve neutrally). In contrast, positive or 
Darwinian selection promotes speciation and evolution by selecting for changes in 
sequence that cause individuals to be better adapted to their environment and thus 
more likely to survive and reproduce. These regions evolve more rapidly than non- 
functional segments. 


roughly to a block of DNA from a mouse chromosome. The number 
of the mouse chromosome containing each block is indicated. 


Human chromosome 12 


Even a quick examination of this drawing of a human chromo- 
some reveals the dramatic changes in chromosome structure that occur 
as evolution generates new species over time. Blocks of genes that are 
present on the same chromosome in one species can become separated 
into different chromosomes in a subsequent species. Over time, chro- 
mosome numbers can increase or decrease as chromosomes split apart 
or fuse together. One estimate suggests that approximately 180 break- 
age and fusion events have taken place in the mouse and human line- 
ages since the time these two present-day mammals last shared a 
common ancestor. Taken by themselves, changes in the positions of 
genes have very little effect on the phenotypes of the organisms, but 
they provide a clear visual footprint of the evolutionary process. 

When homologous segments of the mouse and human genome 
are aligned on the basis of their nucleotide sequences, it is found that 
approximately 5 percent of DNA sequences are highly conserved 
between the two species. This is a considerably higher percentage 
than would have been expected by combining the known protein- 
coding regions and gene regulatory regions (together accounting for 
roughly 2 percent of the genome). If we adhere to one of the fore- 
most principles of molecular evolution: “if it’s conserved, it must be 
important,’ then these studies are telling us that parts of the genome 
that were presumed to consist of “useless” sequences actually have 
important, unidentified functions. Some of these regions undoubt- 
edly encode RNAs that have various regulatory functions (discussed 
in Sections 11.12 and 11.13). Others likely have “chromosomal func- 
tions” rather than “genetic functions.’ For example, these conserved 
sequences could be important for chromosome pairing prior to cell 
division. Whatever the function, these genomic elements are often 
located at great distances from the closest gene and include some of 
the most conserved sequences ever discovered, exhibiting virtual 
identity between human, rat, and mouse genomes. 

By comparing stretches of the genome between two distantly 
related species, such as the human and mouse, one can identify those 
regions that have been conserved for tens of millions of years. 
However, this approach won't identify functional sequences in the 
human genome that have a more recent evolutionary origin. This 
could include, for example, genes that are present in humans and 
absent from mice, or regulatory regions that have changed their 
sequence over the course of primate evolution to allow them to bind 
new regulatory proteins. A concerted effort (called the ENCODE 
Project) is underway to identify all of the functional elements present 
in the human genome. Unfortunately, we lack the knowledge neces- 
sary to recognize many of these functional elements, which obviously 
confounds the task. One point has become clear from recent studies: 
A significant proportion of functional DNA sequences are constantly 
evolving and thus are not highly conserved. In other words, if we 
restrict our search to conserved sequences, we run the risk of missing 
many of the most important functional elements in the genome. 
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1. What proportion of the human genome is known to 
encode protein? What functions might conserved 
non-coding regions serve? 


eo 


10.14 The Genetic Basis 
of “Being Human” 


By focusing on conserved sequences, we tend to learn about charac- 
teristics that we share with other organisms. If we want to better 
understand our own unique biological evolution, we have to look 
more closely at those parts of the genome that distinguish us from 
other organisms. Chimpanzees are our closest living relative, having 
shared a common ancestor as recently as 5-7 million years ago. It 
was thought that a detailed analysis of the differences in DNA 
sequence and gene organization between chimpanzees and humans 
might tell us a great deal about the genetic basis for recently evolved 
features that make us uniquely human, such as our upright walk and 
advanced use of tools and language. These latter characteristics can 
be traced to our brain, which has a volume of about 1300 cm*— 
nearly four times that of a chimpanzee. 

The rough draft of the chimpanzee genome was reported in 
2005. Overall, the chimpanzee and human genomes differ by roughly 
4 percent, which amounts to tens of millions of differences—a level 
of divergence that is considerably greater than expected from pre- 
liminary studies. While some of this divergence is due to single 
nucleotide changes between the two genomes, most is attributed to 
larger differences, such as deletions and segmental duplications (see 
footnote, page 390). 

Researchers have been able to identify hundreds of genes in the 
human lineage that are evolving more quickly than the background 
(or neutral) rate, presumably in response to natural selection. 
However, it remains unclear which, if any, of these genes contribute 
to “making us human.” Some of the fastest evolving genes encode 
proteins involved in regulation of gene expression (i.e., transcription 
factors). These are precisely the types of genes that would be expected 
to generate major phenotypic differences because they can affect the 
expression of large numbers of other genes. In fact, differences 
between chimp and human transcription factors are presumably 
responsible for many of the differences in expression of diverse brain 
proteins. This can be illustrated by closer examination of a brain- 
specific transcription factor called FOXP2. 

A comparison of the FOXP2 protein between humans and 
chimps shows two amino acid differences that have appeared in the 
human lineage since the time of separation from our last common 
ancestor. To assess the effects of these amino acid substitutions on 
FOXP2 function, human neuronal cells that lacked their own FOXP2 
gene were engineered to express either the chimp or human version 
of the gene. These two cell populations were then studied in culture 
to assess the effects of the alternate versions of the transcription fac- 
tor. It was found that more than 100 target genes were significantly 
upregulated or downregulated in cells expressing the human FOXP2 
gene when compared to cells expressing the chimp version of the 
transcription factor. What makes the FOXP2 gene so interesting is 
that persons with mutations in the gene suffer from a severe speech 
and language disorder. Among other deficits, persons with this dis- 
order are unable to perform the fine muscular movements of the lips 
and tongue that are required to engage in vocal communication. 
Calculations suggest that the changes in this “speech gene” that dis- 
tinguish it from the chimp version became “fixed” in the human 
genome within the past 120,000 to 200,000 years, a time when mod- 
ern humans are thought to have emerged. (An alteration in DNA 
sequence is said to be fixed if it is now present in virtually all mem- 
bers of the species.) These findings suggest that changes in the 


FOXP2 gene may have played an important role in the evolution and 397 


development of human speech. 

Many other genomic differences between chimps and humans 
have also been described, including alterations in certain genes that 
appear to affect brain development. One of these genes, called 
SRGAP2, is known to have been duplicated in humans but not in 
chimps or other primates. In addition to a full-length parental ver- 
sion of SRGAP2, humans have three additional truncated copies, 
resulting from partial gene duplication events that occurred between 
1 and 3.5 million years ago. Experiments using mouse and cell cul- 
ture models have shown that the expression of one of these trun- 
cated duplicates, called SRGAP2C, is able to dimerize with the 
parental SRGAP2 protein and turn off its activity. As a result, neu- 
rons showed a dramatically increased number of dendritic spines (a 
part of a neuron that receives messages from surrounding neurons), 
which may have contributed to the size of the human brain relative 
to that of our primate relatives. 

Another gene of interest encodes the starch-digesting enzyme 
amylase that is present in saliva. Chimps have a relatively low-starch 
diet and have one copy of the AMYI gene in their genome 
(Figure 10.29a). During the evolution of humans, there has been an 
apparent selection for an increased number of copies of the AMY1 
gene, which has resulted in an increased concentration of the amylase 
enzyme in human saliva. One might have predicted that an increased 
amylase level would have been accomplished during evolution by an 
increase in the level of expression of the single existing AMY1 gene in 
the genome, but in this particular case it has resulted from gene dupli- 
cation. As seen in FIGURE 10.29b and discussed in the next section, 
the number of copies of the AMY1 gene among humans is variable. In 
fact, the number of copies of this gene tends to be higher in human 
populations that ingest greater quantities of starch in their diet. 

One of the most interesting questions in the area of human evo- 
lution concerns possible relationships between modern humans (c.e., 
Homo sapiens) and those of other “human.” species (i-e., archaic mem- 
bers of the Homo genus that are now extinct). Humans that would be 
indistinguishable from our present species, that is, anatomically mod- 
ern humans, are thought to have first evolved in Africa approximately 
200,000 years ago. But Homo sapiens are not the only members of the 
Homo genus to have inhabited Earth in the past 200,000 years. In fact, 
two other members of the genus, the Neanderthals and the more 
recently discovered Denisovans were residents of Europe as recently 
as 35,000 years ago, thousands of years after the arrival of modern 
humans on the continent. The three species are thought to have 
shared some of the same regions in the Middle East at an even earlier 
time period. Given that these species inhabited similar areas and were 
very similar anatomically, paleontologists have wondered whether 
(1) the different species had interbred or (2) modern humans had 
simply replaced other species without interbreeding. An answer to 
that question requires detailed information about the differences in 
DNA sequences between modern and archaic humans. 

Beginning in the late 1990s, researchers have developed increas- 
ingly sophisticated techniques to isolate and sequence snippets of 
mitochondrial DNA (mtDNA) from fossil remains of Neanderthals. 
Mitochondrial DNA is easier to analyze from fossils than is nuclear 
DNA because each cell contains many copies of the mitochondrial 
genome, and it is much smaller in size. Results of these studies sug- 
gested that the Neanderthal mtDNA sequence was sufficiently dif- 
ferent from that of modern human mtDNA to conclude that 
Neanderthals went extinct without contributing genetic material to 
the modern human mitochondrial genome. In other words, the two 
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FIGURE 10.29 Duplication of the amylase gene during human 
evolution. In the results depicted here, a pair of homologous chromo- 
somes from a chimp (a) or a human (b) have been hybridized to red and 
green colored probes that bind to different portions of the AMY1 gene. 
This gene encodes the starch-digesting enzyme salivary amylase. The 
number of copies of the AMY1 gene on each chromosome is revealed by 
the number of times the fluorescent probes are repeated. The chimp has 
one copy of the AMY1 gene on each chromosome (i.e., one copy per 
genome), whereas humans have a number of copies. The number of copies 
of the gene varies within the human genome as illustrated here by the fact 
that one of the two homologous chromosomes from this individual has 4 
copies of the gene and the other homologous chromosome has 10 copies. 
This is an example of a copy number variation (page 399). Moreover, the 
number of copies of the AMY1 gene in the genomes of a given human 
population tends to correlate with the amount of starch in the diet of that 
population. This correlation strongly suggests that the copy number of the 
AMY1 gene has been influenced by natural selection. 

Source: From George H. Perry, et al., courtesy of Nathaniel J. Dominy, 
Nature Gen. 39:1257, 2007. Reprinted by permission from Macmillan 
Publishers Ltd. 


species had not interbred and, moreover, had not shared a common 
ancestor for at least 300,000 years. 

By 2010, Svante Paabo and colleagues at the Max Planck Institute 
in Germany had assembled a relatively complete sequence of the 
Neanderthal nuclear genome and concluded that modern humans 
and Neanderthal genomes are 99.84 percent identical. Neanderthals, 
for example, had the same FOXP2 allele (discussed above) as do 
modern humans, which hinted that they too had verbal language 
skills. Moreover, a detailed comparison of the nuclear genomes from 
a number of different Neanderthal fossils and those of several 
present-day humans suggests that approximately 1 to 4 percent of the 
DNA of modern Europeans and Asians is derived from a Neanderthal 
source. Many of the genes in the Neanderthal-derived DNA contrib- 
ute to the immune system’s recognition of pathogens. Acquisition of 
these genes may have served to protect the newly arrived humans 
from diseases to which the Neanderthals had previously been 
exposed. In contrast, the genomes of Africans show no evidence of 


a contribution by Neanderthals. According to this data, modern 
humans and Neanderthals interbred at some point in time after 
modern humans had left Africa and before they had spread into 
Europe and Asia. Recent data from the full genome of additional 
Neanderthals and a Denisovan have confirmed the occurrence of a 
low level of interbreeding among all three Homo species during the 
Late Pleistocene era. These findings have set the stage for some very 
interesting speculation concerning the human phylogenetic tree. 
According to some researchers who envision a number of human 
species sharing the planet over the past 100,000 years or so, our pre- 
sent population can be described as “the last humans standing” 

We have focused in this section on genomic differences between 
humans and other primates because this is the subject of the present 
chapter. It is probably evident from the limited nature of this discussion 
that researchers have not yet made much progress in identifying the 
genetic basis of “being human.” Many researchers believe that too much 
attention in this field has been focused on changes in protein-coding 
sequences. Instead, they would argue that changes in the regulation of 
gene expression have played a primary role in human evolution, but it 
is difficult to ascertain which of these changes have real importance. 
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1. How similar is the human genome to the chimpanzee 
genome? Provide examples of genetic changes that 
may have led to changes in phenotype between 


humans and chimps. 
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10.15 Genetic Variation within 
the Human Species Population 


No one else in the world looks exactly like you because no one else, 
other than an identical twin, has the same DNA sequence throughout 
his or her genome. The human genome that was sequenced by the orig- 
inal Human Genome Project was derived largely from a single human 
male. Since the completion of that sequence, a great deal of attention 
has been focused on how DNA sequence varies within the human 
population. Genetic polymorphisms are sites in the genome that 
vary among different individuals. The term usually refers to a genetic 
variant that occurs in at least 1 percent of a species population. The 
concept of genetic polymorphisms began with the discovery in 1900 by 
an Austrian physician, Karl Landsteiner, that people could have at least 
three alternate types of blood, A, B, or O. As discussed in Figure 4.12, 
the ABO blood group is a result of members of the population having 
different alleles of a gene encoding a sugar-transferring enzyme. 


DNA Sequence Variation 


The most common type of genetic variability in humans occurs at 
sites in the genome where single nucleotide differences are found 
among different members of the population. When present in at least 
1 percent of the population, these sites are called single nucleotide 
polymorphisms (SNPs). The vast majority of SNPs (pronounced 
“snips”) occur as two alternate alleles, such as A or G. Most SNPs are 
thought to have arisen by mutation only once during the course 
of human evolution and are shared by individuals through com- 
mon descent. Commonly occurring SNPs have been identified by 


comparing DNA sequences from the genomes of hundreds of differ- 
ent individuals from diverse ethnic populations. On average, two 
randomly selected human genomes have about 3 million single 
nucleotide differences between them, or one every thousand base 
pairs. Although this might appear to be a large number, it means that 
humans are, on average, 99.9 percent similar to one another with 
respect to nucleotide sequence. 

Recent large-scale projects utilizing next-generation sequencing 
methods, such as the 1000 Genomes project (which has publicly 
released the full genomes of 1092 individuals from different geo- 
graphic populations), have reported a surprisingly high number of 
rare single-nucleotide variants (SNVs) that occur in less than 
0.5 percent of the population. Current estimates suggest that each 
person harbors over 100 rare single nucleotide variants in his or her 
exome (the portion of the genome that codes for proteins). Researchers 
have hypothesized that the large number of SNVs observed in mod- 
ern human populations is due to explosive population growth— 
around three orders of magnitude over 400 generations—that is so 
recent that natural selection has not had time to “weed out” deleteri- 
ous mutations. It is this large body of genetic variation that is thought 
to be largely responsible for the phenotypic variability and vulnera- 
bility to disease that is observed among humans. The impact of 
human genome sequence variation on the practice of medicine is 
discussed in the accompanying Human Perspective. 


Structural Variation 


As illustrated in FIGURE 10.30a, segments of the genome can change 
as the result of duplications, deletions, insertions, inversions (when 
a piece of DNA exists in a reversed orientation), and other events. 


Normal gene order 


These types of changes typically involve segments of DNA ranging 399 


from hundreds to millions of base pairs in length. Because of their 
relatively large size, these types of polymorphisms are called struc- 
tural variants, and we are just beginning to grasp the extent and 
importance of their presence. 

It has been known since the early days of microscopic chromo- 
some analysis that not all healthy people have identical chromo- 
some structures. Figure 10.30b shows an example of a common 
chromosome inversion whose existence has been known for over 
30 years. Recent studies have revealed that intermediate-sized 
structural variants—those that are too small to be seen under the 
microscope and too large to be readily detected by conventional 
sequence analysis—are much more common than previously 
thought. According to one estimate, a typical human genome carries 
approximately 1000 structural variants, ranging in length from 
about 500 bases to 1.3 million bases (Mb). Although the estimates 
vary considerably, it is evident that the fraction of the genome (i.e., 
total number of base pairs) affected by structural variation is greater 
than that affected by SNPs. As a result, structural variation has a 
significant effect on phenotypic variation among humans. 


Copy Number Variation 


As discussed in Figure 10.19, the technique of DNA fingerprinting 
depends on differences in the lengths of minisatellite sequences, 
which in turn depend on the number of copies of the sequence that 
are present at particular sites in the chromosomes. This is an example 
of a copy number variation (or CNV). Recent findings have revealed 
that much larger-sized CNVs (> 1 kb) are also common in the human 
population, affecting approximately 10-15 percent of the genome, 


fff o-oo 


Gene duplication (leads to copy number polymorphism) 


Deletion 


—f-E-f-e—-o—e—c-e— i 


Inversion 


i-to-i 


Insertion 
(a) 


(b) 


FIGURE 10.30 Structural variants. (a) Schematic representation of the major types of genomic polymorphisms that involve a significant segment of a 
chromosome (e.g., thousands of base pairs). Most of these polymorphisms are too small to be detected by microscopic examination of chromosomes but 
are detected when the number and/or chromosomal location of individual genes are determined. (b) On the left is a normal human chromosome #9, and on 
the right is a chromosome #9 containing a large inversion that includes the chromosome’s centromere (arrow). This inversion, which is clearly visible 


microscopically, is present in 1-3 percent of the population. 


Source: From Charles Lee, Nature Gen. 37:661,2005. Reprinted by permission from Macmillan Publishers Limited. 
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CNVs are a type of structural variation that results from a duplication 
or deletion of a DNA segment. Because of such CNVs, many of us 
carry extra copies of one or more genes that encode important physi- 
ological proteins. Extra copies of genes are generally associated with 
overproduction of a protein, which can disrupt the delicate biochem- 
ical balance that exists within a cell. It is found, for example, that a 
significant number of persons who develop early-onset Alzheimer’s 
disease possess extra copies of the APP gene, which encodes the pro- 
tein thought to be responsible for the disease (Section 2.13). Although 
the subject has been controversial, there is a growing consensus 
among researchers that individuals with autism or schizophrenia 
have a greater-than-normal number of rare CNVs in their genomes. 
Some of the genes affected by these CNVs are involved in synapse 
formation and other neurological processes. In some cases, extra 


copies of a gene can be beneficial, as illustrated by the repeated dupli- 
cation of the amylase gene (AMY1), which has been found in certain 
human populations that have a high starch content in their diet 
(Figure 10.29b). Figure 10.29b also provides an excellent example of 
a CNV in that it shows the number of AMY] genes on both copies of 
one person's homologous chromosomes. It can be seen that one chro- 
mosome contains only four copies of the AMY1 gene, whereas the 
homologous chromosome contains 10 copies of the gene. 


REVIEW | 


1. Describe three different types of genetic 
polymorphisms in humans. Which is the 
most common? 


10.16 THE HUMAN PERSPECTIVE 


Application of Genomic Analyses to Medicine 


Over the past several decades, more than a thousand genes have 
been identified as the cause of rare, heritable diseases. In the 
vast majority of cases, the gene responsible for the disease being 
investigated was identified through traditional genetic linkage 
studies. Such studies begin by locating a number of families 
whose members exhibit a disproportionately high frequency of 
a particular disease. The initial challenge is to determine which 
altered region of the genome is shared by all affected members 
of the family. Once a region linked to the disease gene is iden- 
tified, the DNA of that segment can be isolated and the mutant 
gene pinpointed. This type of genetic approach is well suited 
for the identification of genes that have very high penetrance, 
that is, genes whose mutant form virtually guarantees that the 
individual will be afflicted with the disorder. The mutated gene 
responsible for Huntington's disease (Section 10.9), for example, 
exhibits very high penetrance. Furthermore, no other gene defect 
in the genome causes this disease. Genes that exhibit very high 
penetrance are said to follow Mendelian inheritance, and more 
than 3000 such genes have been identified. 

Most of the common diseases that afflict our species, such 
as cancer, heart failure, Alzheimer’s disease, diabetes, asthma, 
autism, depression, and various age-related diseases, have a 
genetic component, which is to say that they show at least some 
tendency to run in families. But unlike inherited disorders such 
as Huntington's disease, there is no single gene that is clearly 
linked with the condition. Instead, numerous genes are likely to 
affect the disease risk, with each of them making only a small con- 
tribution to the overall likelihood of developing the disorder. In 
addition, the development of the disorder is often influenced by 
nongenetic (i.e., environmental) factors. The likelihood of devel- 
oping diabetes, for example, is greatly increased in individuals 
who are seriously overweight, and the likelihood of developing 
lung cancer is greatly increased by smoking. One of the goals 
of the medical research community is to identify genes that con- 
tribute to the development of these common, but genetically 
complex, diseases. 


Because of their low penetrance, most genes that increase 
the risk of developing common diseases cannot be identified 
through family linkage studies.* Instead, such genes are best 
identified by analyzing disease occurrence in large populations. 
To carry out this type of investigation, researchers compare the 
genotypes of individuals who have the disease in question with 
individuals of a similar ethnic background who are not afflicted. 
The goal is to discover an association (or correlation) between a 
particular condition, such as diabetes, and common genetic poly- 
morphisms. The potential value of this approach is illustrated by 
the discovery in the early 1990s of a strong association between 
one common allele of the gene encoding the lipoprotein ApoE 
and the likelihood of developing Alzheimer's disease. It was 
found in these studies that individuals who have at least one copy 
of the APOE4 allele of the gene are much more likely to develop 
this disabling neurodegenerative disease than persons lacking 
the allele. These findings have opened a major branch of investi- 
gation into the relationship between cholesterol metabolism, car- 
diovascular health, and Alzheimer’s disease. 

As the name suggests, genome-wide association studies 
(GWASs) look for associations between a disease state and poly- 
morphisms that may be located anywhere in the genome. This 
requires determining the nucleotide sequence of large parts of 
the genome of all the subjects participating in the study. GWASs 
are predicated on the premise that common diseases are caused 
by common variants—that is, variants present in 1 percent or 
more of the population. If a disease-causing variant is present at 
a lower frequency, it will not be detected in these types of studies. 
As noted earlier, the most common types of genetic variability 


*Not all genes involved in common diseases have a low penetrance. 
Alzheimer's disease, for example, occurs with high frequency in persons 
having certain APP alleles (Section 2.13) and breast cancer occurs with high 
frequency in persons having certain BRCA alleles (Section 16.8). But these 
types of rare, high-penetrance polymorphisms are not major factors in the 
development of the large majority of cases for either of these diseases. 


in humans are single nucleotide differences (SNPs), which are 
spread almost uniformly throughout the genome. It is likely that 
many of the SNPs that change the coding specificity of a gene, 
or the regulation of a gene’s expression, play an important role in 
our susceptibility to complex diseases. As the cost of genotyp- 
ing human DNA samples has decreased dramatically, researchers 
have begun scanning the genomes of large numbers of people 
to identify SNPs that occur more often in people afflicted with a 
particular disease than in healthy individuals. Even if the identi- 
fied SNPs are not directly responsible for the disease, they serve 
as genetic markers for a nearby locus that might be responsible. 
These principles are well illustrated by a landmark genome-wide 
association study published in 2005 on age-related macular 
degeneration (AMD), which is the leading cause of blindness in 
the elderly. Initially, the researchers compared 96 AMD cases with 
50 controls, looking for an association of the disease with over 
100,000 SNPs that had been genotyped in the subjects. They 
found a strong association between individuals with the disease 
and a common SNP present within the intron (noncoding section) 
of a gene called CFH that is involved in immunity and inflamma- 
tion. Persons homozygous for this SNP had a 7.4-fold greater risk 
of having the disease. Once they had pinned down this region 
of the genome as being associated with AMD, they sequenced 
the entire CFH gene in 96 individuals. They found that the SNP 
they had identified was tightly linked to a polymorphism within 
the coding region of the CFH gene that placed a histidine at a 
particular site in the protein. Other alleles of the CFH protein that 
were not associated with a risk for AMD had a tyrosine at this 
position. These findings provide additional evidence that inflam- 
mation is an underlying factor in the development of AMD and 
point out a clear target in the search for treatments of the disease. 
A large number of GWASs have been conducted over the 
past few years. In many of these studies, the genomes of thou- 
sands of individuals have been scrutinized, and the results have 
been confirmed in independent analyses of separate case and 
control populations. These studies have become practical with 
the availability of commercially produced chips that house a mil- 
ion or more DNA snippets containing the most common SNPs 
in the human population. Researchers can incubate the chip with 
a DNA sample to be tested and quickly determine which of the 
possible SNPs is present at each genomic location in that person's 
DNA. Once a particular SNP is identified as being associated with 
a particular disease, investigators can search for a likely gene in 
that region of the genome that is responsible for the association 
see discussion below on haplotypes). Moreover, the data gener- 
ated by a number of different studies on a particular disease can 
be pooled, which increases the sample size and gives investiga- 
tors greater statistical power for identifying disease-causing var- 
iants. As a result of these GWASs, we now have lists of genes, as 
well as noncoding sites that regulate gene expression, for which 
certain alleles or variants increase the likelihood that an individual 
will develop various diseases, including heart disease, Crohn's 
disease, types 1 and 2 diabetes, and several types of cancer. 
When researchers began these association studies, it was 
hoped that the studies would reveal new insights into the under- 
lying basis of common diseases by turning up genes not pre- 
viously suspected of being involved in particular diseases. The 
products of such genes may ultimately become targets of new 
avenues of drug therapy, just as identification of the importance 
of the HMG CoA reductase gene and LDL receptor gene in the 
disease familial hypercholesterolemia (Section 8.18) led to the 
development of the cholesterol-lowering statins. Although these 
genetic association studies have not yet led to the emergence 
of new drugs, they have uncovered important mechanisms and 


pathways involved in the development of numerous common 
diseases. To cite just one example, several of the susceptibility 
alleles that contribute to type 2 diabetes encode proteins that 
function in the insulin-secreting cells of the pancreas, which has 
focused renewed attention on the dysfunction of these cells in 
the development of this disease. (Readers interested in exam- 
ining the results of these studies can find them at the user-friendly 
site, www.snpedia.com) 

Despite the fact that GWASs have turned up many hun- 
dreds of common genetic variants that contribute to dozens of 
common diseases, they have also left many clinical geneticists 
disappointed. For many years, we have known the degree to 
which heredity contributes to the overall susceptibility of most 
common diseases. This measure has been obtained primarily by 
comparing the incidence of a given disease in members of the 
same family, as compared to members of the general population. 
When the results of the recent GWASs are examined, the vast 
majority of the common “susceptibility alleles” that have been 
identified lead to only a small increase in the risk of developing a 
particular disease, typically less than 1.5 times the odds without 
the allele. If one adds up the increased risk a person would bear 
if he or she carried all of the identified susceptibility alleles for 
a particular disease, such as Crohn's disease or type 2 diabetes, 
it does not come close to accounting for the contribution that 
genetics is known to play in the development of that disease. In 
other words, there is a great deal of “missing heritability” that 
is yet to be discovered. It was initially thought that much of the 
missing heritability might be accounted for by structural varia- 
tions (page 399), which are not subject to detection in association 
studies that rely on SNPs. However, as techniques have been 
developed to map the structural variants in the human genome, 
this notion now also seems unlikely. 

Many factors could be responsible for the fact that most of 
the genetic risk factors for common diseases have yet to be iden- 
tified. For example, a very large number of common alleles may 
contribute such a small risk (i-e., have a very low penetrance) that 
they have not been detected in these GWASs. If this turns out 
to be the case, then identification of such low-penetrant variants 
may be of little practical value. On the opposite end of the scale, 
there may be a large number of rare variants that significantly raise 
the risk of disease (i.e., they are moderately penetrant). However, 
because they are rare (e.g., 0.1 to 1 percent frequency), this 
group of variants would also have been overlooked in the current 
GWASs. Another explanation of much of the missing heritability 
is a phenomenon called epistasis, which describes the observa- 
tion that the presence of a particular allele at one genetic locus 
can affect the expression of an allele at a different locus. In other 
words, genes can interact with one another, and thus researchers 
may have to consider the role of specific combinations of alleles 
as determinants of particular phenotypes. It is difficult enough to 
determine the effects of individual variants on a particular disease 
outcome without attempting to measure the effects of combina- 
tions of variants. 

Despite the present difficulties, it may be possible, one day, 
to determine whether a person is genetically predisposed to 
develop a particular disease simply by identifying which nucle- 
otides are present at key positions within that person's genome. 
Information about genetic variation may also provide an indica- 
tion as to how persons will react to a particular drug, whether they 
are likely to be helped by it and/or whether they may develop 
serious side effects. To cite just one example, individuals who carry 
an allele with two specific SNPs in a gene encoding the enzyme 
TPMT are unable to metabolize a class of thiopurine drugs that 
are commonly used to treat a type of childhood leukemia. Persons 
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who are homozygous for this allele are at very high risk of devel- 
oping a life-threatening suppression of their bone marrow when 
treated with normal doses of thiopurine drugs. Most diseases can 
be treated by a number of alternate medications, so that proper 
drug selection is an important aspect of the practice of medicine. 
The pharmaceutical industry is hoping that SNP data will eventu- 
ally lead to an era of “customized drug therapy,” allowing physi- 
cians to prescribe specific drugs at specific doses that are tailored 
to each individual patient based on his or her genetic profile. This 
era may have begun with the FDA approval of a DNA-containing 
chip that allows physicians to screen a patient’s DNA to determine 
which variants of two different cytochrome P-450 genes he or she 
possesses. These genes help determine how efficiently a person 
is able to metabolize various drugs, ranging from painkillers and 
antidepressants to over-the-counter heartburn medications. 

The use of SNP data in association studies presents a for- 
midable challenge because of the sheer number of these poly- 
morphisms within the genome. As researchers began to study the 
distribution of SNPs in different human populations, they made a 
striking discovery that has made association studies much simpler: 
Large blocks of SNPs have been inherited together as a unit over 
the course of recent human evolution. These stretches of SNPs 
are thought to have remained together in the genome because 
genetic recombination (i.e., crossing over) does not occur ran- 
domly along the DNA, as was suggested in the discussion of 
gene mapping on page 371. Instead, there are short segments 
of DNA (1-2 kb) where recombination is likely to occur and blocks 
of DNA between these “hot spots” that have a low frequency 
of recombination. As a result, certain blocks of DNA (typically 
about 20 kilobases long) tend to remain intact as they are trans- 
mitted from generation to generation. These blocks are called 
haplotypes. Haplotypes are like “giant multigenic alleles”; if you 
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FIGURE 1. The genome is divided into blocks (haplotypes). The top 
line shows a hypothetical segment of DNA containing a number of SNPs 
(each SNP is indicated by a filled circle). This particular segment consists 
of five haplotypes separated by short stretches of DNA that are highly 
variable. Each haplotype occurs as a small number of variants. Each 
haplotype shown here exists as three to six different variants. Each 
haplotype variant is characterized by a specific set of SNPs, indicated by 
the colored circles. All of the SNPs of a particular haplotype variant are 
drawn in the same color to indicate that they are inherited as a group 
and are found together in different members of the population. Each 
person represented at the bottom of the illustration has a particular 
combination of haplotypes on each of his or her two chromosomes. 
Some haplotype variants are found in many different ethnic groups; 
others have a more restricted distribution. 


select a particular site on a particular chromosome, only a limited 
number of alternate haplotypes can be found in that region 
(FIGURE 1). Each of the alternate haplotypes is defined by a small 
number of SNPs (called “tag SNPs") in that region of the genome. 
Once the identity of a handful of tag SNPs within a haplotype has 
been determined, the identity of the entire haplotype is known. 
In 2002 about 25 groups of investigators began a collabo- 
ration, called the International HapMap Project, aimed at iden- 
tifying and mapping the various haplotypes that exist within the 
human population. The HapMap would contain all of the common 
haplotypes (i.e., haplotypes present in at least 5 percent of the 
population) that were present along the length of each of the 
24 different human chromosomes in 270 members of four ethnically 
different populations. The project was completed in 2006 and 
resulted in publication of a HapMap built on more than four mil- 
lion tag SNPs spaced evenly throughout the genome. A number 
of studies have since used the HapMap to identify associations 
between a particular disease and a particular haplotype. The anal- 
ysis of HapMaps has also provided data on the origins and migra- 
tions of human populations and has yielded valuable insights into 
the factors that have shaped the human genome. This is illustrated 
by the following example. Whether we are able to drink milk as an 
adult without upsetting our stomach (i.e., whether we are lactose 
tolerant) depends on which alleles of the lactase gene we carry 
in our genome. Lactose tolerance is associated with an unusu- 
ally long haplotype that contains a variation within a regulatory 
site that causes the lactase gene to be persistently expressed 
into adulthood. This particular haplotype is present at high fre- 
quency in European populations, which have had a long history of 
raising dairy animals, and is rare in most sub-Saharan African and 
Southeast Asian populations, which historically did not raise such 
animals. Its high frequency among Europeans suggests that this 
particular haplotype was under strong positive selective pressure 
in populations that depended on dairy products for nutrition. This 
single haplotype is over 1 million bases long, which indicates that 
it has not been together as a block long enough for crossing over 
to have had a chance to break it into smaller segments. In fact, it 
is estimated that this haplotype was subjected to strong selective 
pressure between 5,000 and 10,000 years ago, about the time that 
dairy farming is thought to have arisen. It is interesting to note 
that several East African populations that engage in dairy farming 
also exhibit persistent lactase expression. The mutations respon- 
sible for the lactose-tolerant phenotype in these Africans are dif- 
ferent from the mutation found in Europeans, indicating that the 
trait evolved independently in the two groups and represents a 
good example of convergent evolution within the human lineage. 
As celebrated as the success of the HapMap was in 2006, 
some geneticists acknowledged that the HapMap was a stopgap 
until a time when high-throughput whole genome sequencing 
was more feasible. The Human Genome Project, which involved 
the efforts of more than a thousand researchers working for sev- 
eral years and led to the first sequence of the human genome, is 
estimated to have cost in the neighborhood of one billion dollars. 
By 2014, the price of sequencing an entire human genome had 
dropped to about $1,000. This dramatic reduction in cost and 
effort has been made possible by the development of a new 
generation of methods to determine DNA sequences (See the 
Experimental Walkthrough Video: Genome sequencing for this chapter). 
Using these new technologies, large-scale sequencing efforts 
are now underway. An international collaboration, called the 
1000 Genomes Project, began in 2008 to compare the complete 
genome sequences of at least one thousand individuals. By 
2014, the team had sequenced and publicly released the whole 
genome sequences of 1092 individuals from 14 populations. 
Researchers have started to mine this genomic data to identify 


variants in the genome that may contribute to human disease. A 
new large-scale sequencing effort, called the Genomic Sequencing 
and Newborn Screening Disorders Project, was launched in 2013. 
This project seeks to complete the full genomes of 1,500 newborn 
babies to investigate how newborn screening for diseases can be 
improved. Importantly, funds have also been set aside for studies 
exploring the ethical, legal, and social issues surrounding genome 
sequencing—issues that will likely be hotly debated for years to 
come as genome sequencing becomes more commonplace. 


template DNA. These techniques have allowed some researchers 
to accomplish an impressive feat: sequencing a full genome 
from a single cell. From numerous studies, it is clear that there 
is considerable cell-to-cell genomic variability in both healthy 
and diseased tissues. The progression of certain cancers, for 
example, has been attributed to the accumulation of a number 
of specific mutations in tumor cells over time. Single cell genomic 
studies hold much promise in providing a detailed look at ge- 
netic changes that occur in tumor cells that may eventually lead 
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Another major development in recent years is the ability to 
sequence a genome with vanishingly small amounts of starting 


to innovations in treatment and prevention. 


Analytic Questions| 


1. How would Mendel'’s results have differed if two of the seven 
traits he studied were encoded by genes that resided close to 
one another on one of the chromosomes of a pea plant? 


8. Suppose you had two solutions of DNA, one single stranded and 
the other double stranded, with equivalent absorbance of ultra- 
violet light. How would the concentrations of DNA compare in 


2. Sutton was able to provide visual evidence for Mendel's law of these two solutions? 


segregation. Why would it have been impossible for him visually 9. What type of labeling pattern would you expect after in situ 

to confirm or refute Mendel’s law of independent assortment? hybridization between mitotic chromosomes and labeled myo- 
3. Genes X, Y, and Z are all located on one chromosome. Draw a globin mRNA? between the same chromosomes and labeled 

simple map showing the gene order and relative distances histone DNA? 

among the genes X, Y, and Z using the data below: 10. According to Chargaff's determination of base composition, 
which of the following would characterize any sample of DNA? 
(1) [A] +[T] =[G] +1C]; (2)[A] /[T] =1; (3) [G] =[C1;(4) [A] + [G] =[T] +[C]. 
If the C content of a preparation of double-stranded DNA is 


Crossover Frequency Between These Genes 


36% X-Z : 
a YZ 15 percent, what is the A content? (See Quantitative Tutorial Video). 
26% X-Y 11. If 30 percent of the bases on a single strand of DNA are T, then 


30 percent of the bases on that strand are A. True or false? Why? 


12. Would you agree with the statement that the T,, represents the 
temperature at which 50 percent of the DNA molecules in a solu- 
tion are single stranded? 


4. Alleles on opposite ends of a chromosome are so likely to be 
separated by crossing over between them that they segregate 
independently. How would one be able to determine, using 
genetic crosses, that these two genes belong to the same link- 13. Suppose you were to find a primate that had a B-globin gene 
age group? How might this be established using nucleic acid with only one intervening sequence. Do you think this animal 
hybridization? may have evolved from a primitive ancestor that split away from 


5. Where in the curve of Figure 10.18 would you expect to see the other animals prior to the appearance of the second intron? 


reannealing of the DNA that codes for ribosomal RNA? Suppose 14. In 1996, a report was published in the journal Lancet on the level 
you had a pure preparation of DNA fragments whose sequences of trinucleotide expansion in the DNA of blood donors. These 
code for ribosomal RNA. Draw the reannealing of this DNA super- researchers found that the level of trinucleotide expansion 
imposed over the curve for total DNA shown in Figure 10.18. decreased significantly with age. Can you provide any explana- 


6. Approximately 5 percent of the present human genome consists tion for these findings? 


of segmental duplications that have arisen during the past 15. Suppose you were looking at a particular SNP in the genome 
35 million years. How do you suppose researchers are able to where half the population had an adenine (A) and the other half 
estimate how long it has been since a particular region of a chro- had a guanine (G). Is there any way that you might be able to 
mosome was duplicated? determine which of these variants was present in ancestral 


7. It was noted on page 392 that at least two thirds of the human humans and which became frequent during human evolution? 


genome is derived from transposable elements. The actual num- 16. Look over Figures 10.24 and 10.29. The arrangement of genes 
ber could be much higher, but it is impossible to make a deter- depicted in the former figure has evolved over hundreds of mil- 
mination about the origin of many other sequences. Why do you lions of years, whereas that in the latter figure has evolved over 
suppose it is difficult to make assignments about the origin of thousands of years. Discuss the likely possibilities for the future 
many of the sequences in the human genome? evolution of the human amylase genes. 
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The Central Dogma: 
DNA to RNA to Protein 


Ths DNA in our genome contains all the necessary infor- 
mation to make all proteins of our body, including the 
proteins required to duplicate DNA, to transcribe DNA into 
RNA, and then to translate RNA into protein. Protein pro- 
duction requires DNA and RNA, and DNA replication and 
RNA production require proteins. This mutual dependence 
creates a chicken-and-egg problem for evolution. Protein- 
based life would be impossible without DNA or RNA to 
encode the protein sequence. And DNA- or RNA-based life 
would be impossible without protein enzymes to copy the 
genetic material. So how could life have started in the first 
place? As we will see in this chapter, RNA is able to work like 
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A model of the large subunit of a prokaryotic ribosome at 2.4 A resolution as 
determined by X-ray crystallography. This view looks into the subunit's active 
site cleft, which consists entirely of RNA. RNA is shown in gray, protein in 
gold, and the active site is revealed by a bound inhibitor (green). The role of 
RNA as a catalyst for making protein is thought to be a vestige of the early 
origins of life on our planet, in which RNA, rather than DNA or protein, 


played the leading role as both genome and catalyst. 
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an enzyme in some cases, and in fact RNA-based enzymatic 
catalysis is an essential part of the machinery for synthesizing 
proteins. While DNA duplication is driven by protein 
enzymes in modern organisms, the ability of DNA and RNA 
to base pair with free nucleotides suggests a scenario in 
which an RNA sequence spontaneously formed that was 
then able to self-replicate, ultimately giving rise to many 
self-propagating copies of itself. This is known as the “RNA 
world” hypothesis, which proposes that life began as RNA. 
Exactly what the primordial self-propagating RNA molecules 


11.1 The Relationship between 
Genes, Proteins, and RNAs 


In many ways, progress in biology over the past century is reflected 
in our changing concept of the gene. As the result of Mendel’s work, 
biologists learned that genes are discrete elements that govern the 
appearance of specific traits. Given the opportunity, Mendel might 
have argued for the concept that one gene determines one trait. 
Boveri, Weismann, Sutton, and their contemporaries discovered that 
genes have a physical embodiment as parts of the chromosome. 
Morgan, Sturtevant, and colleagues demonstrated that genes have 
specific addresses—they reside at particular locations on particular 
chromosomes, and these addresses remain constant from one indi- 
vidual of a species to the next. Griffith, Avery, Hershey, and Chase 
demonstrated that genes were composed of DNA, and Watson and 
Crick solved the puzzle of DNA structure, which explained how this 
remarkable macromolecule could encode heritable information. 

Although the formulations of these concepts were milestones 
along the path to genetic understanding, none of them addressed the 
mechanism by which the information stored in a gene is put to work 
in governing cellular activities. This is the major topic to be dis- 
cussed in the present chapter. We will begin with additional insights 
into the nature of a gene, which brings us closer to its role in the 
expression of inherited traits. 


Evidence That DNA Is the 
Genetic Material 


The first meaningful insight into gene function was gained by 
Archibald Garrod, a Scottish physician who reported in 1908 that 
the symptoms exhibited by persons with certain rare inherited dis- 
eases were caused by the absence of specific enzymes. One of the 
diseases investigated by Garrod was alcaptonuria, a condition read- 
ily diagnosed because the urine becomes dark on exposure to air. 
Garrod found that persons with alcaptonuria lacked an enzyme in 
their blood that oxidized homogentisic acid, a compound formed 
during the breakdown of the amino acids phenylalanine and tyros- 
ine. As homogentisic acid accumulates, it is excreted in the urine and 
darkens in color when oxidized by air. Garrod had discovered the 
relationship between a genetic defect, a specific enzyme, and a 
specific metabolic condition. He called such diseases “inborn errors 
of metabolism” As seems to have happened with other early 


would have looked like remains unknown, but at least this 
particular origin-of-life scenario is experimentally testable, 
because we can synthesize RNA molecules in the lab and 
test their ability to self-replicate and perform other basic 
biochemical functions. Probably the earliest life was some 
combination of self-replicating RNA and a simple membrane 
of some kind that could selectively exchange chemicals with 
the surrounding primordial soup. Throughout this chapter, 
we will see RNA playing a central role in molecular biology, 
perhaps serving as a reminder of its ancient origins. 


observations of basic importance in genetics, Garrod’s findings went 
unappreciated for decades. 

The idea that genes direct the production of enzymes was resur- 
rected in the 1940s by George Beadle and Edward Tatum of the 
California Institute of Technology. They studied Neurospora, a tropi- 
cal bread mold that grows in a very simple medium containing a 
single organic carbon source (e.g., a sugar), inorganic salts, and bio- 
tin (a B vitamin). Because it needs so little to live on, Neurospora was 
presumed to synthesize all of its required metabolites. Beadle and 
Tatum reasoned that an organism with such broad synthetic capacity 
should be very sensitive to enzymatic deficiencies, which should be 
easily detected using the proper experimental protocol. A simplified 
outline of their protocol is given in FIGURE 11.1. 

Beadle and Tatum’s plan was to irradiate mold spores and screen 
them for mutations that caused cells to lack a particular enzyme. 
Radiation was used to damage DNA, thus creating changes in DNA 
sequence (mutations). To screen for such mutations affecting 
enzymes, irradiated spores were tested for their ability to grow in a 
minimal medium that lacked the essential compounds known to be 
synthesized by this organism (Figure 11.1). If a spore is unable to 
grow in minimal medium but a genetically identical spore can grow 
in a medium supplemented with a particular metabolic compound 
(e.g., pantothenic acid), then the researchers could conclude that the 
cells have an enzymatic deficiency that prevents them from synthe- 
sizing this essential compound. 

Beadle and Tatum began by irradiating over a thousand cells. 
Two of the spores proved unable to grow on the minimal medium: 
one needed pyridoxine (vitamin B,), and the other needed thiamine 
(vitamin B,). Eventually, the progeny of about 100,000 irradiated 
spores were tested, and dozens of mutants were isolated. Each 
mutant had a gene defect that resulted in an enzyme deficiency that 
prevented the cells from catalyzing a particular metabolic reaction. 
The results were clear-cut: A gene carries the information for the 
construction of a particular enzyme. This conclusion became known 
as the “one gene-one enzyme” hypothesis. Once it was learned that 
enzymes are often composed of more than one polypeptide, each of 
which is encoded by its own gene, the concept became modified to 
“one gene-one polypeptide.” 

Although this relationship remains a close approximation of the 
basic function of a gene, it too has had to be modified owing to the 
discovery that a single gene often generates a variety of polypeptides, 
primarily as the result of alternative splicing (discussed in 
Section 12.18). It would also become apparent that describing a gene 
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FIGURE 11.1 The Beadle-Tatum experiment for the isolation of genetic mutants in Neurospora. Spores were irradiated to induce mutations (step 1) 
and allowed to grow into colonies in tubes containing supplemented medium (step 2). Genetically identical spores produced by the colonies were then 
tested for their ability to grow on either supplemented or minimal medium (step 3). Those that failed to grow on minimal medium were mutants, and the 
task was to identify the mutant gene. In the example shown in step 4, a sample of mutant cells was found to grow in the minimal medium supplemented 
with vitamins, but would not grow in medium supplemented with amino acids. This observation indicates a deficiency in an enzyme leading to the formation 
of a vitamin. In step 5, growth of these same cells in minimal medium supplemented with one or another of the vitamins indicates the deficiency resides in a 


gene involved in the formation of pantothenic acid (part of coenzyme A). 


strictly as an information store for polypeptides is far too narrow of 
a definition. Many genes encode RNA molecules that, rather than 
containing information for polypeptide synthesis, function as RNAs 
in their own right. With this in mind, it might be best to define a 
gene as a segment of DNA that contains the information for either 
a single polypeptide chain or for one or more functional RNAs. 


An Overview of the Flow of 
Information through the Cell 


Genes contain the information needed to make polypeptides, but 
this information is stored in an inert form, as a stretch of DNA bases, 
that is not able to perform any metabolic function in the cell. Gene 
expression refers to the production of a functional product (for 
example, an enzyme) using the information encoded in a gene. The 
information present in a segment of DNA is made available to the 
cell through formation of a molecule of RNA. The synthesis of an 
RNA from a DNA template is called transcription. The term tran- 
scription denotes a process in which the information encoded in the 
four deoxyribonucleotide letters of DNA is rewritten, or transcribed, 
into a similar language composed of four ribonucleotide letters of 


RNA. We will examine the mechanism of transcription shortly, 
but first we will continue with the role of a gene in the formation of 
polypeptides. 

Studies carried out in the 1950s uncovered the relationship 
between genetic information and amino acid sequence, but this 
knowledge by itself provided no clue about the mechanism by which 
the specific polypeptide chain is generated. As we now know, there is 
an intermediate between a gene and its polypeptide; the intermedi- 
ate is messenger RNA (mRNA). The momentous discovery of 
mRNA was made in 1961 by Frangois Jacob and Jacques Monod of 
the Pasteur Institute in Paris, Sydney Brenner of the University 
of Cambridge, and Matthew Meselson of the California Institute of 
Technology. A messenger RNA is assembled as a complementary 
copy of one of the two DNA strands that make up a gene. Because its 
nucleotide sequence is complementary to that of the gene from 
which it is transcribed, the mRNA retains the same information for 
polypeptide assembly as the gene itself. For this reason, an mRNA 
can also be described as a “sense” strand or a coding RNA. Eukaryotic 
mRNAs are not synthesized in their final (or mature) form but 
instead must be carved out (or processed) from much larger pre- 
mRNAs. An overview of the role of mRNA in the flow of informa- 
tion through a eukaryotic cell is illustrated in FIGURE 11.2. 


FIGURE 11.2 An overview of the flow of information in a 
eukaryotic cell. The DNA of the chromosomes located within the 
nucleus contains the entire store of genetic information. Selected 
sites on the DNA are transcribed into pre-mRNAs (step 1), which are 
processed into messenger RNAs (step 2). The messenger RNAs are 
transported out of the nucleus (step 3) into the cytoplasm, where they 
are translated into polypeptides by ribosomes that move along the 
mRNA (step 4). Following translation, the polypeptide folds to assume 
its native conformation (step 5). 


The use of messenger RNA allows the cell to separate 
information storage from information utilization. While the 
gene remains stored away in the nucleus as part of a huge, 
stationary DNA molecule, its information can be imparted to 
a much smaller, mobile nucleic acid that passes into the 
cytoplasm. Once in the cytoplasm, the mRNA can serve as a 
template to direct the incorporation of amino acids in a particular 
order encoded by the nucleotide sequence of the DNA and mRNA. 
The use of a messenger RNA also allows a cell to greatly amplify its 
synthetic output. One DNA molecule can serve as the template in 
the formation of many mRNA molecules, each of which can be used 
in the formation of a large number of polypeptide chains. The con- 
cept of a DNA-based gene encoding an RNA-based message that is 
then translated into a protein is known as the Central Dogma. 

Proteins are synthesized in the cytoplasm by a complex process 
called translation. Translation requires the participation of dozens 
of different components, including ribosomes. Ribosomes are com- 
plex, cytoplasmic “machines” that can be programmed, like a com- 
puter, to translate the information encoded by any mRNA. 
Ribosomes contain both protein and RNA. The RNAs of a ribosome 
are called ribosomal RNAs (or rRNAs), and like mRNAs, each is 
transcribed from one of the DNA strands of a gene. Rather than 
functioning in an informational capacity, rRNAs provide structural 
support to build the ribosome and help to catalyze the chemical 
reaction in which amino acids are covalently linked to one another. 
Transfer RNAs (or tRNAs) constitute a third major class of RNA 
that is required during protein synthesis. Transfer RNAs are required 
to translate the information in the mRNA nucleotide code into the 
amino acid “alphabet” of a polypeptide. 

Both rRNAs and tRNAs owe their activity to their complex sec- 
ondary and tertiary structures. Unlike DNA, which has a similar 
double-stranded, helical structure regardless of the source, many 
RNAs fold into a complex three-dimensional shape, which is mark- 
edly different from one type of RNA to another. Thus, like proteins, 
RNAs carry out a diverse array of functions because of their different 
shapes. As with proteins, the folding of RNA molecules follows cer- 
tain rules. Whereas protein folding is driven by the withdrawal of 
hydrophobic residues into the interior, RNA folding is driven by the 
formation of regions having complementary base pairs (FIGURE 11.3). 
As seen in Figure 11.3, base-paired regions typically form double- 
stranded (and double-helical) “stems; which are connected to sin- 
gle-stranded “loops.” Unlike DNA, which consists exclusively of 
standard Watson-Crick (G-C, A-T) base pairs, RNAs often contain 
nonstandard base pairs (inset, Figure 11.3) and modified nitroge- 
nous bases. These unorthodox regions of the molecule serve as rec- 
ognition sites for proteins and other RNAs, promote RNA folding, 
and help stabilize the structure of the molecule. The importance of 
complementary base-pairing extends far beyond tRNA and rRNA 
structure. As we will see throughout this chapter, base-pairing 


DNA of PremRNAs Segment of DNA 
chromosome being transcribed Cytoplasm 
mRNA being translated Protein 


Nucleus 


between RNA molecules plays a central role in most of the activities 
in which RNAs are engaged. 

The roles of mRNAs, rRNAs, and tRNAs are explored in detail 
in the following sections of this chapter. Eukaryotic cells make a host 
of other RNAs, which also play vital roles in cellular metabolism; 
these include small nuclear RNAs (snRNAs), small nucleolar RNAs 
(snoRNAs), small interfering RNAs (siRNAs), piRNAs, microRNAs 
(miRNAs), and a variety of other noncoding RNAs, that is, RNAs that 
do not contain information for amino acid sequences. Most of these 
RNAs serve a regulatory function, controlling various aspects of 
gene expression. Several of these RNAs have burst onto the scene in 
the past few years, reminding us once again that there are many hid- 
den cellular activities waiting to be discovered. 


FIGURE 11.3 Two-dimensional structure of a bacterial ribosomal RNA 
showing the extensive base-pairing between different regions of the 
single strand. The expanded section shows the base sequence of a stem 
and loop, including a nonstandard base pair (G-U) and a modified 
nucleotide, methyladenosine. One of the helices is shaded differently 
because it plays an important role in ribosome function as discussed on page 
445. An example of the three-dimensional structure of an RNA is shown in 
Figure 11.41b. 
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For background material on the structure of RNA, you might 
review section 2.18. 


REVIEW | 
1. How were Beadle and Tatum able to conclude that a 
gene encoded a specific enzyme? 


2. Distinguish between the two-dimensional and three- 
dimensional structure of RNAs. 


eo | 


11.2 The Role of RNA 
Polymerases in Transcription 


Transcription is a process in which a DNA strand provides the infor- 
mation for the synthesis of an RNA strand. The enzymes responsible 
for transcription in both prokaryotic and eukaryotic cells are called 
DNA-dependent RNA polymerases, or simply RNA poly- 
merases. These enzymes are able to incorporate nucleotides, one at 
a time, into a strand of RNA whose sequence is complementary to 
one of the DNA strands, which serves as the template. 

The first step in the synthesis of an RNA is the association of the 
polymerase with the DNA template. This brings up a matter of more 
general interest, namely, the specific interactions of two very differ- 
ent macromolecules, proteins and nucleic acids. Just as different pro- 
teins have evolved to bind different types of substrates and catalyze 
different types of reactions, so too have some of them evolved to 
recognize and bind to specific sequences of nucleotides in a strand of 
nucleic acid. The site on the DNA to which an RNA polymerase 
molecule binds prior to initiating transcription is called the 
promoter. Cellular RNA polymerases are not capable of recogniz- 
ing promoters on their own but require the help of additional pro- 
teins called transcription factors. In addition to providing a 
binding site for the polymerase, the promoter contains the informa- 
tion that determines which of the two DNA strands is transcribed 
and the site at which transcription begins. 

RNA polymerase moves along the template DNA strand toward 
its 5’ end (ie., in a 3’-5’ direction). As the polymerase progresses, 
the DNA is temporarily unwound, and the polymerase assembles a 
complementary strand of RNA that grows starting from its 5’ terminus 
in a 3’ direction (FIGURE 11.4a,b). RNA polymerase catalyzes the 
highly favorable reaction 


RNA, +NTP > RNA,,,, + PP, 


in which ribonucleoside triphosphate substrates (NTPs) are cleaved 
into nucleoside monophosphates as they are polymerized into a 
covalent chain (Figure 11.4c). Reactions leading to the synthesis of 
nucleic acids (and proteins) are inherently different from those 
of intermediary metabolism discussed in Chapter 3. Whereas some of 
the reactions leading to the formation of small molecules, such as 
amino acids, may be close enough to equilibrium that a considerable 
reverse reaction can be measured, those reactions leading to the syn- 
thesis of nucleic acids and proteins must occur under conditions in 
which there is virtually no reverse reaction. This condition is met 
during transcription with the aid of a second favorable reaction 


PP, > 2P, 


catalyzed by a different enzyme, a pyrophosphatase. In this case, the 
pyrophosphate (PP;) produced in the first reaction is hydrolyzed to 


inorganic phosphate (P,). The hydrolysis of pyrophosphate releases a 
large amount of free energy and makes the nucleotide incorporation 
reaction essentially irreversible. 

As the polymerase moves along the DNA template, it incorpo- 
rates complementary nucleotides into the growing RNA chain. A 
nucleotide is incorporated into the RNA strand if it is able to form a 
proper (Watson-Crick) base pair with the nucleotide in the DNA 
strand being transcribed. This can be seen in Figure 11.4c where the 
incoming adenosine 5’-triphosphate pairs with the thymine-containing 
nucleotide of the template. Once the polymerase has moved past a 
particular stretch of DNA, the DNA double helix re-forms (as in 
Figure 11.4a,b). Consequently, the RNA chain does not remain asso- 
ciated with its template as a DNA-RNA hybrid (except for about 
nine nucleotides just behind the site where the polymerase is operat- 
ing). RNA polymerases are capable of incorporating from about 20 
to 50 nucleotides into a growing RNA molecule per second, and 
many genes in a cell are transcribed simultaneously by a hundred or 
more polymerases. The frequency at which a gene is transcribed is 
tightly regulated and can vary dramatically depending on the given 
gene and the prevailing conditions. The electron micrograph of 
Figure 11.4d shows a molecule of phage DNA with a number of 
bound RNA polymerase molecules. 

RNA polymerases are capable of forming prodigiously long 
RNAs. Consequently, the enzyme must remain attached to the DNA 
over long stretches of template (the enzyme is said to be processive). 
At the same time, the enzyme must be associated loosely enough 
that it can move from nucleotide to nucleotide of the template. It is 
difficult to study certain properties of RNA polymerases, such as 
processivity, using biochemical methodologies that tend to average 
out differences between individual protein molecules. Consequently, 
researchers have developed techniques to follow the activities of single 
RNA polymerase molecules similar to those used to study individual 
cytoskeletal motors. Two examples of such studies are depicted in 
FIGURE 11.5. In both of these examples, a single RNA polymerase is 
attached to the surface of a glass coverslip and allowed to transcribe 
a DNA molecule containing a fluorescent bead covalently linked to 
one of its ends. The movement of the fluorescent bead can be moni- 
tored under a fluorescence microscope. 

In Figure 11.5a, the bead is free to move in the medium, and its 
range of movement is proportional to the length of the DNA between 
the polymerase and the bead. As the polymerase transcribes the 
template, the connecting DNA strand is elongated, and the move- 
ment of the bead is increased. This type of system allows investiga- 
tors to study the rate of transcription of an individual polymerase 
and to determine if the polymerase transcribes the DNA in a steady 
or discontinuous movement. In Figure 11.5), the bead at the end of 
the DNA molecule being transcribed is trapped by a focused laser 
beam (page 319). The minute force exerted by the laser trap can be 
varied, until it is just sufficient to stop the polymerase from continu- 
ing to transcribe the DNA. Measurements carried out on single RNA 
polymerase molecules in the act of transcription indicate that these 
enzymes can move over the template, one base (3.4 A) at atime, with 
a force several times that of a myosin molecule (see Chapter 9 for 
discussion of myosin and other motor proteins). 

Even though polymerases are relatively powerful motors, these 
enzymes do not necessarily move in a steady, continuous fashion but 
may pause at certain locations along the template or even backtrack 
before resuming their forward progress. A number of elongation 
factors have been identified that enhance the enzyme’ ability to trav- 
erse these various roadblocks. In some cases, as occurs following the 
rare incorporation of an incorrect nucleotide, a stalled polymerase 
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FIGURE 11.4 Chain elongation during transcription. (a) A schematic model of the elongation of a newly synthesized RNA molecule during transcription. 
The polymerase covers approximately 35 base pairs of DNA, the transcription bubble composed of single-stranded (melted) DNA contains about 15 base 
pairs, and the segment present in a DNA-RNA hybrid includes about nine base pairs. The enzyme generates overwound (positively supercoiled) DNA ahead 
of itself and underwound (negatively supercoiled) DNA behind itself (page 381). These conditions are relieved by topoisomerases (page 381). (b) Schematic 
drawing of an RNA polymerase in the act of transcription elongation. The downstream DNA lies in a groove within the polymerase, clamped by a pair of 
jaws formed by the two largest subunits of the enzyme. The DNA makes a sharp turn in the region of the active site, so that the upstream DNA extends 
upward in this drawing. The nascent RNA exits from the enzyme’s active site through a separate channel. (c) Chain elongation results following an attack by 
the 3’ OH of the nucleotide at the end of the growing strand on the 5’ a-phosphate of the incoming nucleoside triphosphate. The pyrophosphate released 
is subsequently cleaved, which drives the reaction in the direction of polymerization. The geometry of base-pairing between the nucleotide of the template 
strand and the incoming nucleotide determines which of the four possible nucleoside triphosphates is incorporated into the growing RNA chain at each site. 
(d) Electron micrograph of several RNA polymerase molecules bound to a phage DNA template. 


Source: (d) Courtesy of Robley C. Williams. 
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FIGURE 11.5 Examples of experimental techniques to follow the 
activities of single RNA polymerase molecules. (a) In this protocol, the 
RNA polymerase molecule is attached to the underlying coverslip and 
allowed to transcribe a DNA molecule containing a fluorescent bead at the 
upstream end. The arrows indicate the movement of the DNA through the 
polymerase. The rate of movement and progression of the polymerase can 
be followed by observing the position of the bead over time using a 
fluorescence microscope. (b) In this protocol, the attached polymerase is 
transcribing a DNA molecule with a bead at its downstream end. As in a, 
the arrows indicate direction of DNA movement. The bead is caught in an 
optical (laser) trap, which delivers a known force that can be varied by 
adjusting the laser beam. This type of apparatus can measure the forces 
generated by a transcribing polymerase. 

Source: Reprinted from J. Gelles and R. Landick, Cell 93:15, 1998, 
copyright 1998, with permission from Elsevier Science.) 


must backtrack and then digest away the 3’ end of the newly synthe- 
sized transcript and resynthesize the missing portion before contin- 
uing its movement. The ability of RNA polymerase to recognize and 
remove a misincorporated nucleotide is referred to as proofread- 
ing. This corrective function is carried out by the same active site 
within the enzyme that is responsible for nucleotide incorporation. 


REVIEW | 


1. What is the role of a promoter in gene expression? 
Where are the promoters for bacterial polymerases 
located? 


oe 


11.3 An Overview of 
Transcription in Both Prokaryotic 
and Eukaryotic Cells 


At this point, it is useful to examine the differences in the process of 
transcription between bacterial and eukaryotic cells. 


Transcription in Bacteria 


Bacteria, such as E. coli, contain a single type of RNA polymerase 
composed of five subunits that are tightly associated to form a core 
enzyme. If the core enzyme is purified from bacterial cells and added 
to a solution of bacterial DNA molecules and ribonucleoside triphos- 
phates, the enzyme binds to the DNA and synthesizes RNA. The 
RNA molecules produced by a purified polymerase, however, are 
not the same as those found within cells because the core enzyme 
has attached to random sites in the DNA (FIGURE 11.6a), sites that it 
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FIGURE 11.6 Schematic representation of the initiation of transcription 
in bacteria. (a) In the absence of the o factor, the core enzyme does not 
interact with the DNA at specific initiation sites. (b-d) When the core 
enzyme is associated with the o factor, the complete enzyme (or holoenzyme) 
is able to recognize and bind to the promoter regions of the DNA, separate 
the strands of the DNA double helix, and initiate transcription at the proper 
start sites (see Figure 11.7). In the traditional model shown here, the 

o factor dissociates from the core enzyme, which is capable of transcription 
elongation. Several studies suggest that, in at least some cases, 6 may 
remain with the polymerase. 


would normally have ignored in vivo. If, however, a purified acces- 
sory polypeptide called sigma factor (o) is added to the RNA poly- 
merase before it attaches to DNA, transcription begins at selected 
locations (Figure 11.6b-d). Attachment of o factor to the core enzyme 
increases the enzyme’s affinity for promoter sites in DNA and 
decreases its affinity for DNA in general. As a result, the complete 
enzyme is thought to slide freely along the DNA until it recognizes 
and binds to a suitable promoter region. 

X-ray crystallographic analysis of the bacterial RNA polymerase 
(see Figure 11.8) reveals a molecule shaped like a crab claw with a 
pair of mobile pincers (or jaws) enclosing a positively charged inter- 
nal channel. As the o factor interacts with the promoter, the jaws 
of the enzyme grip the downstream DNA duplex, which resides 
within the channel (as in Figure 11.4b). The enzyme then separates 
(or melts) the two DNA strands in the region surrounding the start site 
(Figure 11.6c). The complex of the polymerase, o factor, and DNA 
with the strands separated is called the Open Complex. Strand sepa- 
ration makes the template strand accessible to the enzyme’s active 
site, which resides at the back wall of the channel. Initiation of tran- 
scription appears to be a difficult undertaking because an RNA poly- 
merase typically makes several unsuccessful attempts to assemble an 
RNA transcript. Once about 10 nucleotides have been successfully 
incorporated into a growing transcript, the enzyme undergoes a 
major change in conformation and is transformed into a transcrip- 
tional elongation complex that can move processively along the DNA. 
In the model shown in Figure 11.6d, the formation of an elongation 
complex is followed by release of the o factor. 

As noted above, promoters are the sites in DNA that bind RNA 
polymerase. Bacterial promoters are located in the region of a DNA 
strand just preceding the initiation site of RNA synthesis 
(FIGURE 11.7). The nucleotide at which transcription is initiated is 
denoted as +1 and the preceding nucleotide as —1. Those portions of 
the DNA preceding the initiation site (toward the 3’ end of the tem- 
plate strand) are said to be upstream from that site. Those portions of 
the DNA succeeding it (toward the 5’ end of the template strand) are 
said to be downstream from that site. Analysis of the DNA sequences 
just upstream from a large number of bacterial genes reveals that two 
short stretches of DNA are similar from one gene to another. One of 
these stretches is centered at approximately 35 bases upstream from 
the initiation site and typically occurs as the sequence TTGACA 
(Figure 11.7). This TTGACA sequence (known as the —35 element) 
is called a consensus sequence, which indicates that it is the most 
common version of a conserved sequence, but that some variation 
occurs from one gene to another. The second conserved sequence is 
found approximately 10 bases upstream from the initiation site and 
occurs at the consensus sequence TATAAT (Figure 11.7). This site in 
the promoter, named the -10 element after its position or the Pribnow 


FIGURE 11.7 The basic elements of a 
promoter region in the DNA of the 
bacterium E. coli. The key regulatory 
sequences required for initiation of 
transcription are found in regions located 
at -35 and —-10 base pairs from the site at 
which transcription is initiated. The 
initiation site marks the boundary 
between the + and — sides of the gene. 


box after its discoverer, is responsible for identifying the precise 411 


nucleotide at which transcription begins. As the sigma factor recog- 
nizes the Pribnow box, amino acid residues within the protein inter- 
act with each of the six nucleotides of the TATAAT sequence of the 
nontemplate strand. Two of these nucleotides are flipped out of the 
nucleotide stack and into the core of the protein. This action likely 
initiates melting of the adjoining region of promoter DNA and for- 
mation of the transcription bubble seen in Figures 11.6 and 11.7. 

Bacterial cells possess a variety of different o factors that recog- 
nize different versions of the promoter sequence. The o”” is known 
as the “housekeeping” o factor, because it initiates transcription of 
most genes. Alternative o factors initiate transcription of a small 
number of specific genes that participate in a common response. For 
example, when E. coli cells are subjected to a sudden rise in tempera- 
ture, a new o factor is synthesized that recognizes a different 
promoter sequence and leads to the coordinated transcription of a 
battery of heat-shock genes. The products of these genes protect the 
proteins of the cell from thermal damage (See Experimental 
Pathways for Chapter 2). 

Just as transcription is initiated at specific points in the chromo- 
some, it also terminates when a specific nucleotide sequence is 
reached. In roughly half of the cases, a ring-shaped protein called rho 
is required for termination of bacterial transcription. Rho encircles 
the newly synthesized RNA and moves along the strand in a 5’3' 
direction to the polymerase, where it separates the RNA transcript 
from the DNA to which it is bound. In other cases, the polymerase 
stops transcription when it reaches a terminator sequence. Terminator 
sequences typically fold into a hairpin loop that causes the RNA 
polymerase to release the completed RNA chain without requiring 
additional factors. 


Transcription and RNA Processing 
in Eukaryotic Cells 


Eukaryotic cells have three distinct transcribing enzymes in their cell 
nuclei. Each of these enzymes is responsible for synthesizing a differ- 
ent group of RNAs (Table 11.1). Plants have two additional RNA 
polymerases that are not essential for life. No prokaryote has been 
found with multiple RNA polymerases, whereas the simplest eukar- 
yotes (yeast) have the same three nuclear types that are present in 
mammalian cells. This difference in number of RNA polymerases is 
another sharp distinction between prokaryotic and eukaryotic cells. 

FIGURE 11.8a shows the surface structures of RNA polymerases 
from each of the three domains of life: archaea, bacteria, and eukary- 
otes. Several features are apparent from a close examination of this 
illustration. It is evident that the archaeal and eukaryotic enzymes 
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412 TABLE 11.1 Eukaryotic Nuclear RNA Polymerases 
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RNAs Synthesized 


larger rRNAs (288, 18S, 5.8S) 
mRNAs, most small nuclear 
RNAs (snRNAs and snoRNAs), 
most microRNAs, and telomerase 
RNA 
other small RNAs, including 
tRNAs, 5S rRNA, and U6 snRNA 
siRNAs 


Enzyme 


RNA polymerase I 
RNA polymerase II 


RNA polymerase III 


RNA Polymerase IV, V 
(Plants only) 


are more similar in structure to one another than are the bacterial 
and archaeal enzymes. This feature reflects the evolutionary rela- 
tionship between archaea and eukaryotes that was discussed back in 
section 1.9. The subunits that make up each of the proteins depicted 
in Figure 11.8a are indicated by a different color, and it is immedi- 
ately evident that RNA polymerases are multisubunit enzymes. 
Those subunits between the different enzymes that are homologous 
to one another (i.e., are derived from a common ancestral polypep- 
tide) are shown in the same color. Thus, it is also evident from Figure 
11.84 that the RNA polymerases from the three domains share a 
number of subunits. Although the yeast enzyme shown in Figure 
11.84 has a total of 12 subunits, seven more than its bacterial coun- 
terpart, the fundamental core structure of the two enzymes is virtu- 
ally identical. This evolutionary conservation in RNA polymerase 
structure is revealed in Figure 11.8b, which shows the homologous 
regions of the bacterial and eukaryotic enzymes at much higher 
resolution. 

Our understanding of transcription in eukaryotes was greatly 
advanced with the 2001 publication of the X-ray crystallographic 
structure of yeast RNA polymerase II by Roger Kornberg and col- 
leagues at Stanford University. As a result of these studies, and those 
of other laboratories in subsequent years, we now know a great deal 
about the mechanism of action of RNA polymerases as they move 
along the DNA, transcribing a complementary strand of RNA. A major 


Bacterial RNAP Archaeal RNAP 


(a) 


distinction between transcription in prokaryotes and eukaryotes is 
the requirement in eukaryotes for a large variety of accessory pro- 
teins, or transcription factors. These proteins play a role in virtually 
every aspect of the transcription process, from the binding of the 
polymerase to the DNA template, to the initiation of transcription, 
to its elongation and termination. Although transcription factors are 
crucial for the operation of all three types of eukaryotic RNA poly- 
merases, they will only be discussed in regard to the synthesis of 
mRNAs by RNA polymerase II (page 418). 

All three major types of eukaryotic RNAs—mRNAs, rRNAs, and 
tRNAs—are derived from precursor RNA molecules that are consid- 
erably longer than the final RNA product. The initial precursor RNA 
is equivalent in length to the full length of the DNA transcribed and is 
called the primary transcript, or pre-RNA. The corresponding seg- 
ment of DNA from which a primary transcript is transcribed is called 
a transcription unit. Primary transcripts do not exist within the cell 
as naked RNA but become associated with proteins even as they are 
synthesized. Primary transcripts typically have a fleeting existence, 
being processed into smaller, functional RNAs by a series of “cut-and- 
paste” reactions. RNA processing requires a variety of small RNAs (90 
to 300 nucleotides long) and their associated proteins. In the following 
sections, we will examine the activities associated with the transcription 
and processing of each of the major eukaryotic RNAs. 


REVIEW | 


1. Describe the steps during initiation of transcription in 
bacteria. What is the role of the o factor? What is the 
nature of the reaction in which nucleotides are 
incorporated into a growing RNA strand? How is the 
specificity of nucleotide incorporation determined? 
What is the role of pyrophosphate hydrolysis? 

2. How do the number of RNA polymerases distinguish 
prokaryotes and eukaryotes? What is the relationship 
between a pre-RNA and a mature RNA? 
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FIGURE 11.8 A comparison of prokaryotic and eukaryotic RNA polymerase structure. (a) RNA polymerases from the three domains of life. Each 
subunit of an enzyme is denoted by a different color and labeled according to conventional nomenclature for that enzyme. Homologous subunits are 
depicted by the same color. It can be seen that the archaeal and eukaryotic polymerases are more similar in structure to one another than are the bacterial 
and eukaryotic enzymes. RNA polymerase II (shown here) is only one of three major eukaryotic nuclear RNA polymerases. (b) Ribbon diagram of the core 
structure of yeast RNA polymerase II. Regions of the bacterial polymerase that are structurally homologous to the yeast enzyme are shown in green. The large 
channel that grips the downstream DNA is evident. A divalent Mg?* ion, situated at the end of the channel and within the active site, is seen as a red sphere. 


Source: (a) From Akira Hirata, Brianna J. Klein, and Katsuhiko S. Murakami, Nature 451:852, 2008, reprinted by permission from Macmillan Publishers Ltd.; 
(b) From Patrick Cramer, et al., Science 292:1874, 2001, Fig. 12b. © 2001, reprinted with permission from AAAS. Courtesy of Roger Kornberg, Stanford 


University School of Medicine. 


11.4 Synthesis and Processing 
of Eukaryotic Ribosomal and 
Transfer RNAs 


A eukaryotic cell may contain millions of ribosomes, each consisting 
of several molecules of rRNA together with dozens of ribosomal 
proteins. The composition of a mammalian ribosome is shown in 
FIGURE 11.9. Ribosomes are so numerous that more than 80 percent 
of the RNA in most cells consists of ribosomal RNA. To furnish the 
cell with such a large number of transcripts, the DNA sequences 
encoding rRNA are normally repeated hundreds of times. This 
DNA, called rDNA, is typically clustered in one or a few regions of 
the genome. The human genome has five rDNA clusters, each on a 
different chromosome. In a nondividing (interphase) cell, the clus- 
ters of rDNA are gathered together as part of one or more irregularly 
shaped nuclear structures, called nucleoli (singular, nucleolus), 
that function in producing ribosomes (FIGURE 11.10a). 

The bulk of a nucleolus is composed of nascent ribosomal subu- 
nits that give the nucleolus a granular appearance (Figure 11.10). 
Embedded within this granular mass are one or more rounded cores 
consisting primarily of fibrillar material. As discussed in the legend 
of Figure 11.10, the fibrillar material is thought to consist of rDNA 
templates and nascent rRNA transcripts. In the following sections, 
we will examine the process by which these rRNA transcripts are 
synthesized. 
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FIGURE 11.9 The macromolecular composition of a mammalian 
ribosome. This schematic drawing shows the components present in each 
of the subunits of a mammalian ribosome. The synthesis and processing of 
the rRNAs and the assembly of the ribosomal subunits are discussed in the 
following pages. 


Source: From D. P. Snustad et al., Principles of Genetics; Copyright © 1997, 
John Wiley & Sons, Inc., reprinted by permission of John Wiley & Sons, Inc. 
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FIGURE 11.10 The nucleolus. (a) Light micrograph of human lung 
fibroblast cells stained with a fluorescent dye that binds to RNA. The nuclei 
(highlighted in blue with a blue fluorescent DNA dye) contain multiple 
nucleoli, factories in which ribosomes are made and which are enriched in 
both ribosomal RNA and ribosomal proteins. (6) Electron micrograph of a 
section of part of a nucleus with a nucleolus. Three distinct nucleolar 
regions can be distinguished morphologically. The bulk of the nucleolus 
consists of a granular component (gc), which contains ribosomal subunits in 
various stages of assembly. Embedded within the granular regions are 
fibrillar centers (fc) that are surrounded by a more dense fibrillar compo- 
nent (dfc). The inset shows a schematic drawing of these parts of the 
nucleolus. According to one model, the fc contains the DNA that codes for 
ribosomal RNA, and the dfc contains the nascent pre-rRNA transcripts and 
associated proteins. According to this model, transcription of the pre-rRNA 
precursor takes place at the border between the fc and dfc. (Note: Nucleoli 
have other functions unrelated to ribosome biogenesis that are not 
discussed in this text.) Bar, 1 um. 

Source: (a) Courtesy of Thermo Fisher Scientific; (b) From Pavel Hozak, et al., 
J. Cell Science 107:646, 1994. Reproduced with permission of The Company 
of Biologists Ltd. http://jcs.biologists.org/content/ 107/2/639. full. 
pdf+html?sid=8e44240f- 0860-4bea-bef4-b79b866c0261 


Synthesis and Processing 
of the rRNA Precursor 
Eukaryotic ribosomes have four distinct ribosomal RNAs, three in 


the large subunit and one in the small subunit. In humans, the large 
subunit contains a 288, 5.88, and 5S RNA molecule, and the small 
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28S, 18S, and 5.88) are carved by various nucleases from a single 
primary transcript (called the pre-rRNA) which is synthesized by 
an enzyme called RNA polymerase I, which is specialized for 
transcribing the pre-rRNA. The 5S rRNA is synthesized from a sepa- 
rate RNA precursor outside the nucleolus by a different enzyme 
called RNA polymerase III. We will begin our discussion of rRNA 
processing with the pre-rRNA. 

Two of the peculiarities of the pre-rRNA, as compared with 
other RNA transcripts, are the large number of methylated nucleo- 
tides and pseudouridine residues. By the time a human pre-rRNA 
precursor is first cleaved, over 100 methyl groups have been added to 
ribose groups in the molecule, and approximately 95 of its uridine 
residues have been enzymatically converted to pseudouridine (see 
Figure 11.13a). All of these modifications occur after the nucleotides 
are incorporated into the nascent RNA, that is, posttranscriptionally. 
The altered nucleotides are located at specific positions and are clus- 
tered within portions of the molecule that have been conserved 
among organisms. All of the nucleotides in the pre-rRNA that are 


The S value (or Svedberg unit) refers to the sedimentation coefficient of the RNA; 
The larger the number, the more rapidly the molecule moves through a field of force 
during centrifugation, and (for a group of chemically similar molecules) the larger the 
size of the molecule. The 28S, 18S, 5.88, and 5S RNAs consist of nucleotide lengths of 
about 5000, 2000, 160, and 120 nucleotides, respectively. 
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altered remain as part of the final products, while unaltered sections 
are discarded during processing. The functions of the methyl groups 
and pseudouridines are unclear. These modified nucleotides may 
protect parts of the pre-rRNA from enzymatic cleavage, promote 
folding of rRNAs into their final three-dimensional structures, and/ 
or promote interactions of rRNAs with other molecules. 

Because rRNAs are so heavily methylated, their synthesis can be 
followed by incubating cells with radioactively labeled methionine, a 
compound that serves in most cells as a methyl group donor. The 
methyl group is transferred enzymatically from methionine to 
nucleotides in the pre-rRNA. When ['*C] methionine is provided to 
cultured mammalian cells for a short period of time, a considerable 
fraction of the incorporated radioactivity is present in a 45S RNA 
molecule, equal to a length of about 13,000 nucleotides. The 45S 
RNA is cleaved into smaller molecules that are then trimmed down 
to the 28S, 18S, and 5.88 rRNA molecules. The combined length of 
the three mature rRNAs is approximately 7000 nucleotides, or a little 
more than half of the primary transcript. 

Some of the steps in the processing pathway from the 45S pre- 
rRNA to the mature rRNAs can be followed by incubating mamma- 
lian cells very briefly with labeled methionine and then chasing the 
cells in nonlabeled medium for varying lengths of time (FIGURE 11.11). 
(This type of “pulse-chase” experiment was discussed on page 260). 
As noted above, the first species to become labeled in this type of 
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FIGURE 11.11 Kinetic analysis of rRNA synthesis and processing. A culture of mammalian cells was incubated for 10 minutes in ['*C]methionine and 
was then chased in unlabeled medium for various times as indicated in each panel. After the chase, cells were washed free of isotope and homogenized, 
and nucleolar and cytoplasmic fractions were prepared. The RNA was extracted from each fraction and analyzed by sucrose density-gradient sedimentation. 
As discussed in Section 18.10, this technique separates RNAs according to size (the larger the size, the closer the RNA is to the bottom of the tube, which 
corresponds to fraction 1). The continuous blue line represents the UV absorbance of each cellular fraction, which provides a measure of the amount of RNA 
of each size class. This absorbance profile does not change with time. The solid red line shows the radioactivity at various times during the chase. The 
graphs of nucleolar RNA (upper profiles) show the synthesis of the 45S rRNA precursor and its subsequent conversion to a 32S molecule, which is a 
precursor to the 28S and 5.8S rRNAs. The other major product of the 45S precursor leaves the nucleus very rapidly and, therefore, does not appear 
prominently in the nucleolar RNA. The lower profiles show the time course of the appearance of the mature rRNA molecules in the cytoplasm. The 18S rRNA 
appears in the cytoplasm well in advance of the larger 28S species, which correlates with the rapid exodus of the former from the nucleolus. 


Source: From H. Greenberg and S. Penman, J. Mol. Biol. 21:531, 1966; Copyright 1966, by permission of the publisher Academic Press. 


experiment is the 45S primary transcript, which is seen as a peak 
of radioactivity (red line) in the nucleolar RNA fraction after 
10 minutes. After about an hour, the 45S RNA has disappeared 
from the nucleolus and is largely replaced by a 32S RNA, which is 
one of the two major products produced from the 45S primary tran- 
script. The 32S RNA is seen as a distinct peak in the nucleolar frac- 
tion from 40 minutes to 150 minutes. The 32S RNA is a precursor to 
the mature 28S and 5.88 rRNAs. The other major product of the 45S 
pre-rRNA leaves the nucleolus quite rapidly and appears in the cyto- 
plasm as the mature 18S rRNA (seen in the 40-minute cytoplasmic 
fraction). After a period of two or more hours, nearly all of the radio- 
activity has left the nucleolus, and most has accumulated in the 28S 
and 18S rRNAs of the cytoplasm. The radioactivity in the 45 RNA 
peak includes the 5.8S rRNA as well as methyl groups that have been 
transferred to small tRNA molecules. FIGURE 11.12 shows a likely 
pathway in the processing of an rRNA primary transcript. 


The Role of snoRNAs in the 
Processing of Pre-rRNA 


The processing of the pre-rRNA is accomplished with the help of a 
large number of small, nucleolar RNAs (or snoRNAs) that are 
packaged with particular proteins to form particles called small, 
nucleolar ribonucleoproteins (or snoRNPs). Electron micro- 
graphs indicate that snoRNPs begin to associate with the rRNA pre- 
cursor before it is fully transcribed. The first RNP particle to attach 
to a pre-rRNA transcript contains the U3 snoRNA and more 
than two dozen different proteins. This huge component of the 
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FIGURE 11.12 A proposed scheme for the processing of mammalian 
ribosomal RNA. The primary transcript for the rRNAs is a 45S molecule of 
about 13,000 bases. The principal cleavage events during processing of 
this pre-rRNA are indicated by the boxed numbers. Cleavage of the 
primary transcript at sites 1 and 5 removes the 5’ and 3’ external tran- 
scribed sequences and produces a 41S intermediate. The second cleav- 
ages can occur at either site 2 or 3 depending on the type of cell. Cleavage 
at site 3 generates the 32S intermediate seen in curves of the previous 
figure. During the final processing steps, the 28S and 5.8S sections are 
separated from one another, and the ends of the various intermediates are 
trimmed down to their final mature size. 

Source: After R. P. Perry, J. Cell Biol. 91:29s, 1981; by Copyright permission 
of the Rockefeller University Press. 


rRNA-processing machinery catalyzes the removal of the 5’ end of 415 


the transcript (Figure 11.12). Some of the other enzymatic cleavages 
indicated in Figure 11.12 are thought to be catalyzed by the “exo- 
some,’ which is an RNA degrading machine that consists of nearly a 
dozen different exonucleases. 

U3 and several other snoRNAs were identified years ago 
because they are present in large quantities (about 10° copies per cell). 
Subsequently, a different class of snoRNAs present at lower concentra- 
tion (about 10* copies per cell) was discovered. These low-abundance 
snoRNas can be divided into two groups based on their function and 
similarities in nucleotide sequence. Members of one group (called the 
box C/D snoRNAs) determine which nucleotides in the pre-rRNA will 
have their ribose moieties methylated, whereas members of the other 
group (called the box H/ACA snoRNAs) determine which uridines are 
converted to pseudouridines. The structures of nucleotides modified 
by these two reactions are shown in FIGURE 11.13a. 

Both groups of snoRNAs contain relatively long stretches (10 to 
21 nucleotides) that are complementary to sections of the rRNA tran- 
script. These snoRNAs provide an excellent example of the principle 
that single-stranded nucleic acids having complementary nucleotide 
sequences are capable of forming double-stranded hybrids. In this 
case, each snoRNA binds to a specific portion of the pre-rRNA to 
form an RNA-RNA duplex. The bound snoRNA then guides an 
enzyme—either a methylase or a pseudouridylase—within the 
snoRNP to modify a particular nucleotide in the pre-rRNA. Taken 
together, there are roughly 200 different snoRNAs, one for each site in 
the pre-rRNA that is ribose-methylated or pseudouridylated. If the 
gene encoding one of these snoRNAs is deleted, one of the nucleotides 
of the pre-rRNA fails to be enzymatically modified. The mechanism 
of action of the two types of snoRNAs is illustrated in Figure 11.130,c. 

The nucleolus is the site not only of rRNA processing, but also 
of assembly of the two ribosomal subunits. More than 200 different 
proteins have been found to be associated with the rRNAs at differ- 
ent stages of ribosome assembly. These proteins can be divided into 
two groups: ribosomal proteins that remain in the subunits, and 
accessory proteins that have a transient interaction with the rRNA 
intermediates and are required only for processing. Among this lat- 
ter group are more than a dozen RNA helicases, which are enzymes 
that unwind regions of double-stranded RNA. These enzymes are 
presumably involved in the many structural rearrangements that 
occur during ribosome formation, including the dissociation of 
snoRNAs from pre-ribosomal particles. 


Synthesis and Processing of the 5S rRNA 


A5S rRNA, about 120 nucleotides long, is present as part of the large 
ribosomal subunit of both prokaryotes and eukaryotes. In eukary- 
otes, the 5S rRNA molecules are encoded by a large number of iden- 
tical genes that are separate from the other rRNA genes and are 
located outside the nucleolus. Following its synthesis, the 5S rRNA is 
transported to the nucleolus to join the other components involved 
in the assembly of ribosomal subunits. 

The 5S rRNA genes are transcribed by RNA polymerase II]. 
RNA polymerase III is unusual among the three polymerases in that 
it can bind to a promoter site located within the transcribed portion 
of the target gene.” The internal position of the promoter was first 
clearly demonstrated by introducing modified 5S rRNA genes into 


°RNA polymerase III transcribes several different RNAs. It binds to an internal pro- 
moter when transcribing a pre-5S RNA or pre-tRNA, but binds to an upstream 
promoter when transcribing the precursors for several others, including U6 snRNA. 
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FIGURE 11.13 Modifying the pre-rRNA. (a) The most frequent 
modifications to nucleotides in a pre-rRNA are the conversion (isomeriza- 
tion) of a uridine to a pseudouridine and the methylation of a ribose at the 
2’ site of the sugar. To convert uridine to pseudouridine, the N1I—C1’ bond 
is cleaved, and the uracil ring is rotated through 120°, which brings the C5 
of the ring into place to form the new bond with C1’ of ribose. These 
chemical modifications are catalyzed by a protein component of the 
snoRNPs called dyskerin. (6) The formation of an RNA-RNA duplex 
between the U20 snoRNA and a portion of the pre-rRNA that leads to 2’ 
ribose methylation. In each case, the methylated nucleotide in the rRNA is 
hydrogen bonded to a nucleotide of the snoRNA that is located five base 
pairs from box D. Box D, which contains the invariable sequence CUGA, is 
present in all snoRNAs that guide ribose methylation. (c) The formation of 
an RNA-RNA duplex between Ué8 snoRNA and a portion of the pre-rRNA 
that leads to conversion of a uridine to pseudouridine (y). Pseudouridylation 
occurs at a fixed site relative to a hairpin fold in the snoRNA. The snoRNAs 
that guide pseudouridylation have a common ACA sequence. 

Source: (b) After J.-P. Bachellerie and J. Cavaillé, Trends in Bioch. Sci. 
22:258, 1997; Copyright 1997, with permission from Elsevier Science; 

(c) After P. Ganot et al., Cell 89:802, 1997. 


host cells and determining the ability of that DNA to serve as tem- 
plates for the host polymerase IIL. It was found that the entire 5’ flank- 
ing region could be removed and the polymerase would still transcribe 
the DNA starting at the normal initiation site. If, however, the dele- 
tion included the central part of the gene, the polymerase would not 
transcribe the DNA or even bind to it. If the internal promoter from 
a 5S rRNA gene is inserted into another region of the genome, the 
new site becomes a template for transcription by RNA polymerase III. 


Transfer RNAs 


Plant and animal cells are estimated to have approximately 50 differ- 
ent species of transfer RNA, each encoded by a repeated DNA 
sequence. The degree of repetition varies with the organism: Yeast 
cells have a total of about 275 tRNA genes, fruit flies about 850, and 
humans about 1300. Transfer RNAs are synthesized from genes that 
are found in small clusters scattered around the genome. A single 
cluster typically contains multiple copies of different tRNA genes, 
and conversely, the DNA sequence encoding a given tRNA is typi- 
cally found in more than one cluster. The DNA within a cluster 
(tDNA) consists largely of nontranscribed spacer sequences with the 
tRNA coding sequences situated at irregular intervals in a tandemly 
repeated arrangement (FIGURE 11.14). 

Like the 5S rRNA, tRNAs are transcribed by RNA polymerase 
III and the promoter sequence lies within the coding section of the 
gene rather than being located at its 5’ flank. The primary transcript 
of a transfer RNA molecule is larger than the final product, and 
pieces on both the 5’ and 3’ sides of the precursor tRNA (and a small 
interior piece in some cases) must be trimmed away. In addition, 
numerous bases must be modified (see Figure 11.40). One of the 
enzymes involved in pre-tRNA processing is an endonuclease called 
ribonuclease P, which is present in both bacterial and eukaryotic cells 
and consists of both RNA and protein subunits. It is the RNA subunit 
of ribonuclease P that catalyzes cleavage of the pre-tRNA substrate, a 
subject discussed in the Experimental Pathways on page 451. 
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1. Describe the differences between a primary 
transcript, a transcription unit, a transcribed spacer, 
and a mature rRNA. 


2. Draw a representation of an electron micrograph of 
rDNA being transcribed. Label the nontranscribed 
spacer, the transcribed spacer, the RNA polymerase 
molecules, the U3 snRNP, and the promoter. 


3. Compare the organization of the genes that code for 
the large rRNAs and the tRNAs within the vertebrate 
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FIGURE 11.14 The arrangement of genes that code for transfer RNAs 
in Xenopus. A 3.18-kilobase fragment of DNA, showing the arrangement 
of various tRNA genes and spacers. 

Source: From S. G. Clarkson et al., in D. D. Brown, ed., Developmental 
Biology Using Purified Genes, Academic Press, 1981. 


11.5 Synthesis and Structure 
of Eukaryotic Messenger RNAs 


The first step in expression of DNA as a protein is to synthesize mes- 
senger RNA that can carry the genetic instructions from the nucleus 
to the cytoplasm. Transcription of RNA begins when RNA polymer- 
ase binds to the DNA at a region called the promoter. The process is 
facilitated by a group of proteins called general transcription factors. 


The Formation of Heterogeneous 
Nuclear RNA (hnRNA) 


When eukaryotic cells are incubated for a short period in [*H]uri- 
dine or [*’P]phosphate and immediately killed, most of the radioac- 
tivity is incorporated into RNA molecules that have the following 
properties: (1) they have large molecular weights (up to about 80S, 
or 50,000 nucleotides); (2) as a group, they are represented by RNAs 
of diverse (heterogeneous) nucleotide sequence; and (3) they are 
found only in the nucleus. Because of these properties, these RNAs 
are referred to as heterogeneous nuclear RNAs (hnRNAs), and 
they are indicated by the red radioactivity line in FIGURE 11.15a. 
When cells that have been incubated in [*H]Juridine or [*?P]phos- 
phate for a brief pulse are placed into unlabeled medium, chased for 
several hours before they are killed and the RNA extracted, the 
amount of radioactivity in the large nuclear RNAs drops sharply and 
appears instead in much smaller mRNAs found in the cytoplasm 
(red line of Figure 11.15). These early experiments, begun initially 
by James Darnell, Jr., Klaus Scherrer and colleagues, suggested that 
the large, rapidly labeled hnRNAs were primarily precursors to the 
smaller cytoplasmic mRNAs. This interpretation has been fully sub- 
stantiated by a large body of research. 

It is important to note that the blue and red lines in Figure 11.15 
follow a very different course. The blue lines, which indicate the optical 
density (i.e., the absorbance of UV light) of each fraction, provide infor- 
mation about the amount of RNA in each fraction following centrifu- 
gation. It is evident from the blue lines that most of the RNA in the cell 
is present as 18S and 28S rRNA (along with various small RNAs that 
stay near the top of the tube). The red lines, which indicate the radioac- 
tivity in each fraction, provide information about the number of radio- 
active nucleotides incorporated into different-sized RNAs during the 
brief pulse. It is evident from these graphs that neither the hnRNA 
(Figure 11.15a) nor the mRNA (Figure 11.15b) constitutes a significant 
fraction of the RNA in the cell. If they did, there would be a greater cor- 
respondence between the blue and red lines. Thus, even though the 
mRNAs (and their heterogeneous nuclear RNA precursors) constitute 
only a small percentage of the total RNA of most eukaryotic cells, they 
constitute a large percentage of the RNA that is being synthesized by 
that cell at any given moment (Figure 11.15a). The reason that there is 
little or no evidence of the hnRNAs and mRNAs in the optical density 
plots of Figure 11.15 is that these RNAs are degraded after relatively 
brief periods of time. This is particularly true of the hnRNAs, which are 
processed into mRNAs (or completely degraded) even as they are being 
synthesized. In contrast, the rRNAs and tRNAs have half-lives that are 
measured in days or weeks and thus gradually accumulate to become 
the predominant species in the cell. Some accumulation of radioactivity 
in the mature 28S and 18S rRNAs can be seen after a 3-hour chase 
(Figure 11.15b). The half-lives of mRNAs vary depending on the par- 
ticular species, ranging from about 15 minutes to a period of days. 
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FIGURE 11.15 The formation of heterogeneous nuclear RNA (hnRNA) 
and its conversion into smaller mRNAs. (a) Curves showing the sedimen- 
tation pattern of total RNA extracted from duck blood cells after exposure 
to [3*P]phosphate for 30 minutes. The larger the RNA, the farther it travels 
during centrifugation and the closer to the bottom of the tube it ultimately 
lies. The absorbance (blue line) indicates the total amount of RNA in 
different regions of the centrifuge tube, whereas the red line indicates the 
corresponding radioactivity. It is evident that most of the newly synthesized 
RNA is very large, much larger than the stable 18S and 28S rRNAs. These 
large RNAs are the hnRNAs. (b) The absorbance and radioactivity profiles 
of the RNA were extracted from cells that were pulse-labeled for 

30 minutes as in part a, but then chased for 3 hours in the presence of 
actinomycin D, which prevents the synthesis of additional RNA. It is evident 
that the large hnRNAs have been processed into smaller RNA products. 
Source: From G. Attardi et al., J. Mol. Biol. 20:160, 1966; Copyright 1966, 
by permission of the publisher Academic Press. 


The Machinery for mRNA Transcription 


All eukaryotic mRNA precursors are synthesized by RNA polymer- 
ase II, an enzyme composed of a dozen different subunits that is 
remarkably conserved from yeast to mammals. RNA polymerase II 
binds the promoter with the cooperation of a number of general 
transcription factors (GTFs) to form a preinitiation complex 
(PIC). These proteins are referred to as general transcription factors 
because the same ones are required for accurate initiation of tran- 
scription of a diverse array of genes in a wide variety of different 
organisms. The promoter elements that nucleate PIC assembly lie to 
the 5’ side of each transcription unit, although the enzyme makes 
contact with the DNA on both sides of the transcription start site 
(see FIGURE 11.16b). We will restrict the discussion to the best stud- 
ied promoters, which are those associated with highly expressed, tis- 
sue-specific genes such as the ovalbumin gene, which encodes the 
white of a chicken egg, and the globin genes, which encode the poly- 
peptides of hemoglobin. A critical portion of the promoter of such 
genes lies between 24 and 32 bases upstream from the site at which 
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TTGGG GGAGAGCAGGGCAGCTGCTGCTAACAC 


GAGCA GAGGTAGGATCAGTTGCTCCTCACATT 
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TFIID [ (RNAPII) — TFIIF 
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TFIIF 


Start site 


Preinitiation complex (PIC) 


(b) 


FIGURE 11.16 Initiation of transcription from a eukaryotic polymerase 
Il promoter. (a) Nucleotide sequence of the region just upstream from the 
site where transcription is initiated in three different eukaryotic genes. The 
TATA box is indicated by the blue shading. Many eukaryotic promoters 
contain a second, conserved core promoter element called the initiator 
(Inr), which includes the site where transcription is initiated (shown in 
orange). Other promoter elements are depicted in Figure 12.45. It should 
be noted that (1) most eukaryotic promoters lack a recognizable TATA box 
and (2) numerous other promoter elements, for example, the DPE shown in 
part b, have been identified that lie downstream of the transcription start 
site. Different genes contain different combinations of promoter elements 
so that only a subset are required to nucleate PIC assembly. (b) A highly 
schematic model of the steps in the assembly of the preinitiation complex 
for RNA polymerase II. The polymerase itself is denoted RNAPII; the other 
components are the various general transcription factors required in 
assembly of the complete complex. TFIID includes the TBP subunit, which 
specifically binds to the TATA box, and a number of other subunits, which 
collectively are called TBP-associated factors (TAFs). TFIIB is thought to 
provide a binding site for RNA polymerase. TFIIF, which contains a subunit 
homologous to the bacterial o factor, is bound to the entering polymerase. 
TFIIH contains 10 subunits, 3 of which possess enzymatic activities. 


transcription is initiated (Figure 11.16a). This region often contains 
a consensus sequence that is either identical or very similar to the 
oligonucleotide 5’-TATAAA-3’ and is known as the TATA box. 

The first step in assembly of the preinitiation complex is bind- 
ing of a protein, called TATA-binding protein (TBP), that recognizes 
the TATA box of these promoters (Figure 11.16b). Thus, as in bacte- 
rial cells, a purified eukaryotic polymerase is not able to recognize a 
promoter directly and cannot initiate accurate transcription on its 
own. TBP is present as a subunit of a much larger protein complex 
called TFIID.* X-ray crystallography has revealed that binding of 
TBP to a polymerase II promoter causes a dramatic distortion in 
conformation of the DNA. As shown in FIGURE 11.17a, TBP inserts 
itself into the minor groove of the double helix, bending the DNA 
molecule more than 80° at the site of DNA-protein interaction. 
While TBP binds TATA, other subunits of the TFIID complex bind 
to other regions of the DNA, including elements that lie downstream 
of the transcription start site. 

Binding of TFIID sets the stage for the assembly of the complete 
preinitiation complex, which is thought to occur in a stepwise man- 
ner as depicted in Figure 11.16b. Interaction of three of the GTFs 
(TBP of TFHD, TFIIA, and TFIIB) with DNA is shown in Figure 
11.17a. The presence of these three GTFs bound to the promoter 
provides a platform for the subsequent binding of the huge, multi- 
subunit RNA polymerase with its attached TFIIF (Figure 11.160). 
Once the RNA polymerase-TFIIF is in position, another pair of 
GTFs (TFIIE and TFIIH) join the complex and transform the poly- 
merase into an active, transcribing machine. A three-dimensional 
model of the preinitiation complex is shown in Figure 11.17). 

TFIIH is the only GTF known to possess enzymatic activities. 
One of the subunits of TFIJH functions as a protein kinase to phos- 
phorylate RNA polymerase (discussed below), whereas two other 
subunits of this protein complex function as DNA unwinding 
enzymes (helicases). DNA helicase activity is required to separate the 
DNA strands of the promoter, allowing the template strand to find its 
way into the active site of the polymerase. Once transcription begins, 
certain of the GTFs (including TFIID) may be left behind at the pro- 
moter, while others are released from the complex (FIGURE 11.18). 
As long as TFIID remains bound to the promoter, additional RNA 
polymerase molecules may be able to attach to the promoter site and 
rapidly initiate additional rounds of transcription. 

The carboxyl-terminal domain (CTD) of the largest subunit of 
RNA polymerase II has an unusual structure; it consists of a sequence 
of seven amino acids (-Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7-) that 
is repeated over and over. In humans, the CTD consists of 52 repeats 
of this heptapeptide. All seven residues of the heptapeptide can be 
enzymatically modified in one way or another; we will limit the dis- 
cussion to serines two and five, which are prime candidates for phos- 
phorylation by protein kinases. The RNA polymerase that assembles 
into the preinitiation complex is not phosphorylated, whereas the same 
enzyme engaged in transcription is heavily phosphorylated; all of 
the added phosphate groups are localized in the CTD (Figure 11.18). 
CTD phosphorylation can be catalyzed by at least four different pro- 
tein kinases, including TFIIH, which phosphorylates the serine resi- 
dues at position #5. Phosphorylation of the polymerase by TFITH 


ST BP is actually a universal transcription factor that mediates the binding of all three 
eukaryotic RNA polymerases. TBP is present as one of the subunits of three different 
protein complexes. As a subunit of TFIID, TBP promotes the binding of RNA polymer- 
ase II. As subunits of SL1 or TFIIIB, TBP promotes the binding of RNA polymerases I 
and III, respectively. Polymerase I and III promoters lack a TATA box, as do many poly- 
merase II promoters, yet all of these regions of the DNA bind TBP. 


TFIIB 


(a) (b) 


FIGURE 11.17 Structural models of the formation of the preinitiation complex. (a) Model of the complex formed by DNA and three of the GTFs, 
TBP of TFIID, TFIIA, and TFIIB. Interaction between the TATA box and TBP bends the DNA approximately 80° and allows TFIIB to bind to the DNA 
both upstream and downstream of the TATA box. (b) Top and bottom view of a model of the preinitiation complex. Unlike the schematic model of 
Figure 11.166, DNA (shown in white) is thought to wrap around the preinitiation complex so that GTFs can contact the DNA on both sides of the 


transcription start site. 


Source: (a) From Gourisankar Ghosh and Gregory D. Van Duyne, Structure 4:891, 1996, Fig. 2, with permission from Elsevier; (b) From M. Douziech et al., 
Mol. Cell Biol. 20:8175, 2000, Fig. 7a. Courtesy of Benoit Coulombe. Reproduced with permission from American Society for Microbiology. 


may act as the trigger that uncouples the enzyme from the GTFs 
and/or the promoter DNA, allowing the enzyme to escape from the 
preinitiation complex and move down the DNA template. At the 
earliest stages of transcription, the CTD is phosphorylated on 
the Ser5 positions, which provide binding sites for proteins involved 
in the earliest stages of mRNA processing, such as 5’ cap formation 
(see Figure 11.32). As the RNA polymerase moves along the gene 
being transcribed, another kinase (P-TEFb) phosphorylates the CTD 
on the serine residues at position #2 (as in Figure 11.32). This change 
in phosphorylation pattern is thought to facilitate the recruitment of 


TFIID TFIIE 
TFIIF 


Start site 


P-TEFb 


FIGURE 11.18 Initiation of transcription by RNA polymerase II is 
associated with phosphorylation of the C-terminal domain (CTD). 

The initiation of transcription is associated with TFIIH-catalyzed 
phosphorylation of serine residues at the 5 position of each heptad repeat 
of the CTD. Phosphorylation is thought to provide the trigger for the 
separation of the transcriptional machinery from the general transcription 
factors and/or the promoter DNA. TFIIS, ELL, and P-TEFb are three of a 
number of elongation factors that may become associated with the 
polymerase as it moves along the DNA. TFIIS helps the polymerase get 
moving again after it pauses, whereas P-TEFb is a kinase that 
phosphorylates the Ser2 residues of the CTD after elongation begins. 
Phosphorylation of Ser2 residues is thought to promote the recruitment of 
RNA splicing and polyadenylation factors, whose activities are discussed in 
the following sections (see Figure 11.32). 


additional protein factors involved in RNA splicing and addition of 
a poly(A) tail, as disucussed below. In this way, the CTD acts as a 
platform for the dynamic gain and loss of factors required for the 
formation of a mature mRNA. According to some estimates, an elon- 
gating RNA polymerase II may contain over 50 components and con- 
stitute a total mass of more than 3 million daltons. 

Termination of transcription by RNA polymerase II is not well 
understood. There is no evidence that the DNA of protein-coding 
genes contains well-defined termination sequences as in bacterial 
cells. In fact, a transcribing RNA polymerase II can travel a variable 
and extensive distance past the point that will ultimately give rise to 
the 3’ terminus of the processed mRNA. Thus, unlike bacteria, 
where the 3’ end of the mRNA is generated simply by transcription 
termination, formation of the 3’ end of a eukaryotic mRNA is deter- 
mined by a separate series of processing steps (page 423). 

Together, RNA polymerase II and its GTFs are sufficient to pro- 
mote a low, basal level of transcription from most promoters under 
in vitro conditions. As will be discussed at length in Chapter 12, a 
variety of specific transcription factors are able to bind at numerous 
sites in the regulatory regions of the DNA. These specific transcription 
factors can determine (1) whether a preinitiation complex assembles 
at a particular promoter and/or (2) the rate at which the polymerase 
initiates new rounds of transcription from that promoter. Before dis- 
cussing the pathway by which mRNAs are produced, let us first 
describe the structure of mRNAs so that the reasons for some of the 
processing steps will be clear. 


The Structure of mRNAs 


Messenger RNAs share certain properties: 


1. They contain a continuous sequence of nucleotides encoding a 
specific polypeptide. 

2. They are found in the cytoplasm. 

3. They are attached to ribosomes when they are translated. 


4. Most mRNAs contain a significant noncoding segment, that is, a 
portion that does not direct the assembly of amino acids. For 
example, approximately 25 percent of each globin mRNA consists 
of noncoding, nontranslated regions (FIGURE 11.19). Noncoding 
portions are found at both the 5’ and 3’ ends of a messenger RNA 
and contain sequences that have important regulatory roles 
(Section 12.19). 
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FIGURE 11.19 Structure of the human B-globin mRNA. The mRNA 
contains a 5’ methylguanosine cap, a 5’ and 3’ noncoding region that flanks 
the coding segment, and a 3’ poly(A) tail. The lengths of each segment are 
given in numbers of nucleotides. The length of the poly(A) tail is variable. It 
typically begins at a length of about 250 nucleotides and is gradually reduced 
in length, as shown in Figure 12.62. The structure of the 5’ cap is shown. 


5. Eukaryotic mRNAs have special modifications at their 5’ and 3’ 
termini that are not found on either bacterial mRNAs or on 
tRNAs or rRNAs. The 3’ end of nearly all eukaryotic mRNAs has 
a string of 50 to 250 adenosine residues that form a poly(A) tail, 
whereas the 5’ end has a methylated guanosine cap (Figure 11.19). 


We will return shortly to describe how mRNAs are provided with 
their specialized 5’ and 3’ termini. First, however, it is necessary to 
take a short detour to understand how mRNAs are formed in the cell. 


REVIEW | 


1. What is the relationship between hnRNAs and 


mRNAs? How was this relationship uncovered? 
oe 


11.6 Split Genes: An 
Unexpected Finding 


Almost as soon as hnRNAs were discovered, it was proposed that 
this group of rapidly labeled nuclear RNAs were precursors to cyto- 
plasmic mRNAs (page 417). The major sticking point was the differ- 
ence in size between the two RNA populations: The hnRNAs were 
several times the size of the mRNAs (FIGURE 11.20). Why would 
cells synthesize huge molecules that were precursors to much smaller 
versions? Early studies on the processing of ribosomal RNA had 


shown that mature RNAs were carved from larger precursors. Recall 
that large segments were removed from both the 5’ and 3’ sides of 
various rRNA intermediates (Figure 11.12) to yield the final, mature 
rRNA products. It was thought that a similar pathway might account 
for the processing of hnRNAs to mRNAs. But mRNAs constitute 
such a diverse population that it was impossible to follow the steps in 
the processing of a single mRNA species as had been attempted for 
the rRNAs. The problem was solved by an unexpected discovery. 

Until 1977, molecular biologists assumed that a continuous lin- 
ear sequence of nucleotides in a messenger RNA is complementary 
to a continuous sequence of nucleotides in one strand of the DNA of 
a gene. Then in that year, a remarkable new finding was made by 
Phillip Sharp, Susan Berget, and their colleagues at MIT and Richard 
Roberts, Louise Chow, and their colleagues at the Cold Spring 
Harbor Laboratories in New York. These groups found that the 
mRNAs they were studying were transcribed from segments of DNA 
that were separated from one another along the template strand. 

The first observations of major importance were made during 
the analysis of transcription of the adenovirus genome. Adenovirus is 
a pathogen capable of infecting a variety of mammalian cells. It was 
found that a number of different adenovirus mRNAs had the same 
150- to 200-nucleotide 5’ terminus. One might expect that this leader 
sequence represents a repeated stretch of nucleotides located near the 
promoter region of each of the genes for these mRNAs. However, fur- 
ther analysis revealed that the 5’ leader sequence is not complemen- 
tary toa repeated sequence and, moreover, is not even complementary 
to a continuous stretch of nucleotides in the template DNA. Instead, 
the leader is transcribed from three distinct and separate segments of 
DNA (represented by blocks x, y, and z in the top line of FIGURE 11.21). 
The regions of DNA between these blocks, referred to as interven- 
ing sequences (I, to I; in Figure 11.21), are somehow missing in the 
corresponding mRNA. It could have been argued that the presence of 
intervening sequences is a peculiarity of viral genomes, but the basic 
observation was soon extended to cellular genes themselves. 

The presence of intervening sequences in nonviral, cellular 
genes was first reported in 1977 by Alec Jeffreys and Richard Flavell 
in The Netherlands and Pierre Chambon in France. Jeffreys and 
Flavell discovered an intervening sequence of approximately 600 
bases located directly within a part of the globin gene that coded for 
the amino acid sequence of the globin polypeptide (FIGURE 11.22). 
The basis for this finding is discussed in the legend accompanying 
the figure. Intervening sequences were soon found in other genes, 
and it became apparent that the presence of genes with intervening 
sequences—called split genes—is the rule not the exception. 
Those parts of a split gene that contribute to the mature RNA prod- 
uct are called exons, whereas the intervening sequences are called 
introns. Split genes are widespread among eukaryotes, although the 
introns of simpler eukaryotes (e.g., yeast and nematodes) tend to be 
fewer in number and smaller in size than those of more complex 
plants and animals. Introns are found in all types of genes, including 
those that encode tRNAs, rRNAs, as well as mRNAs. 

The discovery of genes with introns immediately raised the ques- 
tion as to how such genes were able to produce messenger RNAs lacking 
these sequences. One likely possibility was that cells produce a primary 
transcript that corresponds to the entire transcription unit, and that 
those portions of the RNA corresponding to the introns in the DNA are 
somehow removed. If this were the case, then the segments correspond- 
ing to the introns should be present in the primary transcript. Such an 
explanation would also provide a reason why hnRNA molecules are so 
much larger than the mRNA molecules they ultimately produce. 
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FIGURE 11.20 The difference in size between hnRNAs and mRNAs. 

(a-b) Electron micrographs of metal-shadowed preparations of poly(A)-mRNA (a) and 
poly(A)-hnRNA (6) molecules. Representative size classes of each type are shown. 
Reference molecule is @ X-174 viral DNA. (c) Size distribution of hnRNA and mRNA 
from mouse L cells as determined by density-gradient sedimentation. The red line 
represents rapidly labeled hnRNA, whereas the blue line represents mRNA that was 
isolated from polyribosomes after a 4-hour labeling period. The abscissa has been 
converted from fraction number (indicated by the points) to molecular size by 


Research on nuclear RNA had proceeded by this time to the 
point where the size of a few mRNA precursors (pre-mRNAs) had 
been determined. The globin sequence, for example, was found to be 
present within a nuclear RNA molecule that sediments at 15S, unlike 
the final globin mRNA, which has a sedimentation coefficient of 10S. 
An ingenious technique (known as R-loop formation) was employed 
by Shirley Tilghman, Philip Leder, and their co-workers at the 
National Institutes of Health to determine the physical relationship 
between the 15S and 10S globin RNAs and provide information on 
the transcription of split genes. 


calibration of the gradients. 


Source: (a-b) From John A. Bantle and William E. Hahn, Cell 8:145, 1976. Reproduced 
with permission from Elsevier. 


Recall from page 384 that single-stranded, complementary DNA 
strands can bind specifically to one another. Single-stranded DNA 
and RNA molecules can also bind to one another as long as their 
nucleotide sequences are complementary; this is the basis of the tech- 
nique of DNA-RNA hybridization discussed in Section 18.18 (the 
DNA-RNA complex is called a hybrid). Tilghman and her co-workers 
used the electron microscope to examine a fragment of DNA contain- 
ing the globin gene that had been hybridized to the 15S globin RNA. 
The hybrid was seen to consist of a continuous, double-stranded, 
DNA-RNA complex (the red dotted and blue lines of the inset to 
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Primary 5'ppp 3' 
transcript | 
X y z Hexon 
Processing § /mGppp =e seneee 
intermediate polyA FIGURE 11.21 The discovery of intervening sequences (introns). 
A portion of the adenovirus genome is shown at the top. The 
noncontinuous sequence blocks labeled x, y, and z appear in a 
lo continuous arrangement in the mature mRNAs that code for a variety 
[5 of polypeptides, such as the hexon protein. As discussed later in the 
text, the conversion of the primary transcript to the mRNA involves the 
removal (excision) of the intervening sequences, or introns (I; to |3), 
xyz Hexon and ligation of the remaining portions to produce a continuous 
Mature mRNA 7mGppp kX Ut anaes RNA molecule (bottom). The steps by which this occurs are shown 
poly A in Figure 11.30. 


422) FIGURE 11.22 The discovery of introns in a eukaryotic gene. As discussed in 


LL YaLd VHD 


UI91O1g 01 VNY OF VNG :ewbog je11UuaD eu e 


Chapter 18, bacteria contain restriction enzymes that recognize and cleave DNA 
molecules at the site of certain nucleotide sequences. The drawing shows a map of 
restriction enzyme cleavage sites in the region of the rabbit B-globin gene (upper) and 
the corresponding map of a cDNA prepared from the B-globin mRNA (lower). (A cDNA 
is a DNA made in vitro by reverse transcriptase using the mRNA as a template. Thus, 
the cDNA has the complementary sequence to the mRNA. cDNAs had to be used 

for this experiment because restriction enzymes don't cleave RNAs.) The letters 
indicate the sites at which various restriction enzymes cleave the two DNAs. The upper 
map shows that the globin gene contains a restriction site for the enzyme BamH1 (B) 
located approximately 700 base pairs from a restriction site for the enzyme EcoR1 (E). 
When the globin cDNA was treated with these same enzymes (lower map), the 
corresponding B and E sites were located only 67 nucleotides apart. It is evident that 
the DNA prepared from the genome has a sizable region that is absent from the 
corresponding cDNA (and thus absent from the mRNA from which the cDNA was 
produced). Complete sequencing of the globin gene later showed it to contain a 
second smaller intron. 


Source: From A. J. Jeffreys and R. A. Flavell, Cell 12:1108, 1977; by permission 
from Cell Press. 
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FIGURE 11.23 Visualizing an intron in the globin gene. 
Electron micrographs of hybrids formed between (a) 15S 
globin precursor RNA and the DNA of a globin gene and 


Tite dt 


(b) 10S globin mRNA and the same DNA as in a. The red 
dotted lines in the insets indicate the positions of the RNA 
molecules. The precursor mRNA is equivalent in length and 
sequence to the DNA of the globin gene, but the 10S 
mRNA is missing a portion that is present in the DNA of 
the gene. These results suggest that the 15S RNA is 
processed by removing an internal RNA sequence and 
rejoining the flanking regions. 

Source: (a—b) From Shirley M. Tilghman et al., Proc. Nat'l. 
Acad. Sci. U.S.A. 75:1312, 1978. 


ta 


rape Fp 
Pi rte 


se 
Bs 
2a 
ft 


Ee 


F 


FIGURE 11.24 Visualizing introns in the ovalbumin gene. Electron micrograph of a hybrid formed between ovalbumin mRNA and a fragment of 
genomic chicken DNA containing the ovalbumin gene. The hybrid shown in this micrograph is similar in nature to that of Figure 11.23b. In both cases, the 
DNA contains the entire gene sequence because it was isolated directly from the genome. In contrast, the RNA has been completely processed, and those 
portions that were transcribed from the introns have been removed. When the genomic DNA and the mRNA are hybridized, those portions of the DNA that 
are not represented in the mRNA are thrown into loops. The loops of the seven introns (A-G) can be distinguished. (Courtesy of Pierre Chambon.) 


Source: Courtesy of Pierre Chambon. 


FIGURE 11.23a). In contrast, when the same DNA fragment was incu- 
bated with the mature 10S globin mRNA, a large segment of the DNA 
in the center of the coding region was seen to bulge out to form a 
double-stranded loop (Figure 11.23b). The loop resulted from a large 
intron in the DNA that was not complementary to any part of the 
smaller globin message. It was apparent that the 15S RNA does indeed. 
contain segments corresponding to the introns of the genes that are 
removed during formation of the 10S mRNA. 

At about the same time, a similar type of hybridization experi- 
ment was performed between the DNA that codes for ovalbumin, a 
protein found in hen’s eggs, and its corresponding mRNA. The oval- 
bumin DNA and mRNA hybrid contains seven distinct loops cor- 
responding to seven introns (FIGURE 11.24). Taken together, the 
introns account for about three times as much DNA as that present 
in the eight combined coding portions (exons). Subsequent studies 
have revealed that individual exons average about 165 nucleotides. 
In contrast, individual introns average more than 3500 nucleotides, 
which is why hnRNA molecules are much longer than mRNAs. To 
cite two extreme cases, the human dystrophin gene extends for 
roughly 100 times the length needed to code for its corresponding 
message, and the type I collagen gene contains over 50 introns. The 
average human gene contains about nine introns making up more 
than 95 percent of the transcription unit. 

These and other findings provided strong evidence for the 
proposition that mRNA formation in eukaryotic cells occurs by the 
removal of internal sequences of ribonucleotides from a much larger 
pre-mRNA. Let us turn to the steps by which this occurs. 


REVIEW | 


1. What is an intron? How was the existence of introns 
and exons discovered? 


11.7 The Processing of 
Eukaryotic Messenger RNAs 


RNA polymerase II assembles a primary transcript that is comple- 
mentary to the DNA of the entire transcription unit. Electron micro- 
scopic examination of transcriptionally active genes indicates that 


RNA transcripts become associated with proteins and larger parti- 
cles while they are still in the process of being synthesized 
(FIGURE 11.25). These particles, which consist of proteins and ribo- 
nucleoproteins, include the agents responsible for converting the 
primary transcript into a mature messenger. This conversion process 
requires addition of a 5’ cap and 3’ poly(A) tail to the ends of the 
transcript, and removal of any intervening introns. Once processing 
is completed, the mRNP, which consists of mRNA and associated 
proteins, is ready for export from the nucleus. 


5’ Caps and 3’ Poly(A) Tails 


The 5’ ends of all RNAs initially possess a triphosphate derived from 
the first nucleoside triphosphate incorporated at the site of initiation 
of RNA synthesis. Once the 5’ end of an mRNA precursor has been 
synthesized, several enzyme activities act on this end of the molecule 
(FIGURE 11.26). In the first step, the last of the three phosphates is 
removed, converting the 5’ terminus to a diphosphate (step 1, 
Figure 11.26). Then, a GMP is added in an inverted orientation so 
that the 5’ end of the guanosine is facing the 5’ end of the RNA chain 
(step 2, Figure 11.26). Asa result, the first two nucleosides are joined by 
an unusual 5’—5’ triphosphate bridge. Finally, the terminal, inverted 
guanosine is methylated at the 7 position on its guanine base, while 
the nucleotide on the internal side of the triphosphate bridge is 
methylated at the 2’ position of the ribose (step 3, Figure 11.26). The 
5’ end of the RNA now contains a methylguanosine cap (shown 
in greater detail in Figure 11.19). These enzymatic modifications at 
the 5’ end of the primary transcript occur very quickly, while the 
RNA molecule is still in its very early stages of synthesis. In fact, 
the capping enzymes are recruited by the phosphorylated CTD of the 
polymerase (see Figure 11.32). The methylguanosine cap at the 5’ 
end of an mRNA serves several functions: it prevents the 5’ end of 
the mRNA from being digested by exonucleases, it aids in transport 
of the mRNA out of the nucleus, and it plays an important role in the 
initiation of mRNA translation. 

As noted above, the 3’ end of an mRNA contains a string of 
adenosine residues that forms a poly(A) tail. Asa number of mRNAs 
were sequenced, it became evident that the poly(A) tail typically 
begins 10 to 30 nucleotides downstream from the sequence AAUAAA. 
This sequence in the primary transcript serves as a recognition site 
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FIGURE 11.25 Pre-mRNA transcripts are processed 
as they are synthesized (i.e., cotranscriptionally). 

(a) Electron micrograph of a nonribosomal transcription 
unit showing the presence of ribonucleoprotein particles 
attached to the nascent RNA transcripts. (b) Interpretive 
tracing of the micrograph shown in part a. The dotted 
line represents the chromatin (DNA) strand, the solid 
lines represent ribonucleoprotein (RNP) fibrils, and solid 
circles represent RNP particles associated with the fibrils. 
Individual transcripts are numbered, beginning with 1, 
which is closest to the point of initiation. The RNP 
particles are not distributed randomly along the nascent 
transcript, but rather are bound at specific sites where 
RNA processing is taking place. 


Source: From Ann L. Beyer, Oscar L. Miller, Jr., and 
Steven L. Mcknight, Cell 20:78, 1980, reproduced with 
permission from Elsevier. 


(b) 


Primary transcript 


for the assembly of a large complex of proteins that carry out the SKI ™ = 


processing reactions at the 3’ end of the mRNA (Figure 11.26). aa ae 
The poly(A) processing complex is also physically associated with the 


phosphorylated CTD of RNA polymerase II as it synthesizes the / 
primary transcript (see Figure 11.32). of WKS "Np MN 
Included among the proteins of the processing complex is an ; 


endonuclease (Figure 11.26, top) that cleaves the pre-mRNA down- ‘ COTES es , ra. Sb sie 
stream from the recognition site. Following cleavage by the nuclease, RNA Triphosphatase 
an enzyme called poly(A) polymerase adds 250 or so adenosines 
without the need of a template (steps a—c, Figure 11.26). As dis- 
cussed in Section 12.6, the poly(A) tail together with an associated 
protein protects the mRNA from premature degradation by exonu- GTP * 
cleases. It is important to note that many genes possess more than 

one poly(A) recognition sequence in their 3’ noncoding segments. 

These genes can be transcribed into mRNAs whose 3’ noncoding 1) 
segment (3’ UTR) have different lengths and thus can be subject to 

different regulatory influences. 


FIGURE 11.26 Steps in the addition of a 5’ methylguanosine cap and a 3’ poly(A) tail to 
a pre-mRNA. The 5’ end of the nascent pre-mRNA binds to a capping enzyme, which in Or 
mammals has two active sites that catalyze different reactions: a triphosphatase that removes PP; 

: : : : 2) Guanylyltransferase 
the terminal phosphate group (step 1) and a guanylyltransferase that adds a guanine residue in 
a reverse orientation, by means of a 5’-to-5’ linkage (step 2). In step 3, different methyltrans- 
ferases add a methyl group to the terminal guanosine cap and to the ribose of the nucleotide RNA Methyltransferases 
that had been at the end of the nascent RNA. A protein complex (called CBC) binds to the Methyl 
completed cap (not shown). A very different series of events occurs at the 3’ end of the pre- 
mRNA, where a large protein complex is assembled. First, an endonuclease cleaves the 
primary RNA transcript, generating a new 3’ end upstream from the original 3’ terminus. In 
steps a—c, poly(A) polymerase adds adenosine residues to the 3’ end without the involvement 
of a DNA template. A typical mammalian mRNA contains 200 to 250 adenosine residues in its 
completed poly(A) tail; the number is considerably less in lower eukaryotes. 


Source: After D. A. Micklos and G. A. Freyer, DNA Science, Carolina Biological Supply Co. © Methyiguanosine cap .) Poly(A) tail 


RNA Splicing: Removal of Introns 
from a Pre-RNA 


The key steps in the processing of a pre-mRNA are shown in 
FIGURE 11.27. In addition to formation of the 5’ cap and poly(A) tail, 
which have already been discussed, those parts of a primary tran- 
script that correspond to the intervening DNA sequences (the introns) 
must be removed by a complex process known as RNA splicing. To 
splice an RNA, breaks in the strand must be introduced at the 5’ and 
3’ ends (the splice sites) of each intron, and the exons situated on 
either side of the splice sites must be covalently joined (ligated). It is 
imperative that the splicing process occur with absolute precision, 
because the addition or loss of a single nucleotide at any of the splice 
junctions would cause the resulting mRNA to be mistranslated. 

How does the same basic splicing machinery recognize the 
exon-intron boundaries in thousands of different pre-mRNAs? 
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FIGURE 11.27 Overview of the steps during the processing of the 
globin mRNA. Introns are shown in brown, whereas green portions of the 
gene indicate the positions of the exons, that is, the DNA sequences that 
are represented in the mature messenger RNA. 
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Examination of hundreds of junctions between exons and introns in 425 


eukaryotes ranging from yeast to insects to mammals revealed the 
presence in splice sites of a conserved nucleotide sequence of ancient 
evolutionary origin. The sequence most commonly found at the 
exon-intron borders within mammalian pre-mRNA molecules is 
shown in Figure 11.28. The G/GU at the 5’ end of the intron (the 5’ 
splice site), the AG/G at the 3’ end of the intron (the 3’ splice site), and 
the polypyrimidine tract near the 3’ splice site are present in the 
vast majority of eukaryotic pre-mRNAs.’ In addition, the adjacent 
regions of the intron contain preferred nucleotides, as indicated in 
FIGURE 11.28, which play an important role in splice site recognition. 
The sequences depicted in Figure 11.28 are necessary for splice-site 
recognition, but they are not sufficient. Introns typically run for 
thousands of nucleotides and often contain internal segments that 
match the consensus sequence shown in Figure 11.28—but the cell 
doesn't recognize them as splicing signals, and consequently ignores 
them. The additional clues that allow the splicing machinery to dis- 
tinguish between exons and introns are provided by specific 
sequences, most notably the exonic splicing enhancers, or ESEs, situ- 
ated within exons (see Figure 11.30, inset A). Changes in DNA 
sequence within either a splice site or an ESE can lead to the inclu- 
sion of an intron or the exclusion of an exon. It is estimated that 
approximately 15 percent of inherited human disease results directly 
from mutations that alter pre-mRNA splicing. In addition, much of 
the “normal” genetic variation in susceptibility to common diseases 
that is present in the human population (page 400) may result from 
the effects of this variation on RNA splicing efficiency. 

Understanding the mechanism of RNA splicing has come about 
through an appreciation of the remarkable capabilities of RNA mol- 
ecules. The first evidence that RNA molecules were capable of cata- 
lyzing chemical reactions was obtained in 1982 by Thomas Cech 
and colleagues at the University of Colorado. As discussed at length 
in the Experimental Pathways (Section 11.20), these researchers 
found that the ciliated protozoan Tetrahymena synthesized an rRNA 
precursor (a pre-rRNA) that was capable of splicing itself. In addi- 
tion to revealing the existence of RNA enzymes, or ribozymes, 
these experiments changed the thinking among biologists about the 
relative roles of RNA and protein in the mechanism of RNA 
splicing. 


“Approximately 1 percent of introns have AT and AC dinucleotides at their 5’ and 3’ 
ends (rather than GU and AG). These AT/AC introns are processed by a different type 
of spliceosome, called a U12 spliceosome, because it contains a U12 snRNA in place 
of the U2 snRNA of the major spliceosome. 
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FIGURE 11.28 Nucleotide sequences at the splice sites of pre-mRNAs. In addition to encoding the information to construct a polypeptide, a pre- 
mRNA must also contain information that directs the machinery responsible for RNA splicing. The nucleotide sequences shown in the regions of the splice 
sites are based on analysis of a large number of pre-mRNAs and are therefore referred to as consensus sequences. The bases shown in red are virtually 
invariant; those in black represent the preferred base at that position. N represents any of the four nucleotides; Y represents a pyrimidine. The polypyrimi- 
dine tract near the 3’ splice site typically contains between 10 and 20 pyrimidines. The branch point sequence shown is that found in human pre-mRNAs and 
is typically about 30 bases upstream of the 3’ end of the intron. 
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The intron in the Tetrahymena pre-rRNA is an example of a 
group I intron, which wont be discussed. Another type of self- 
splicing intron, called a group II intron, was subsequently discovered. 
in fungal mitochondria, plant chloroplasts, and a variety of bacteria 
and archaea. Group II introns fold into a complex structure 
shown two-dimensionally in FIGURE 11.29a. Group II introns undergo 
self-splicing by passing through an intermediate stage, called a lariat 
(Figure 11.29b) because it resembles the type of rope used by cow- 
boys to catch runaway calves. The first step in group II intron splic- 
ing is the cleavage of the 5’ splice site (step 1, Figure 11.29b), followed. 
by formation of a lariat by means of a covalent bond between the 5’ 
end of the intron and an adenosine residue near the 3’ end of the 
intron (step 2). The subsequent cleavage of the 3’ splice site releases 
the lariat and allows the cut ends of the exon to be covalently joined 
(ligated) (step 3). 

The steps that occur during the removal of introns from pre- 
mRNA molecules in eukaryotic cells are quite similar to those fol- 
lowed by group II introns. The primary difference is that the 
pre-mRNA is not able to splice itself, requiring instead a host of 
small nuclear RNAs (snRNAs) and their associated proteins. As 
each large hnRNA molecule is transcribed, it becomes associated 
with a variety of proteins to form a heterogeneous nuclear ribo- 
nucleoprotein (hnRNP), which represents the substrate for the 
processing reactions that follow. Processing occurs as each intron of 
the pre-mRNA becomes associated with a dynamic macromolecular 
machine called a spliceosome. Each spliceosome consists of a variety 
of proteins and a number of distinct ribonucleoprotein particles, 
called small nuclear ribonucleoproteins (snRNPs) because they 
are composed of snRNAs bound to specific proteins. Spliceosomes 
are not present within the nucleus in a prefabricated state, but rather 
are assembled as their component snRNPs bind to the pre-mRNA. 
Once the spliceosome machinery is assembled, the snRNPs carry 
out the reactions that cut the introns out of the transcript and paste 
the ends of the exons together. The excised introns, which constitute 
more than 95 percent of the average mammalian pre-mRNA, are 
simply degraded within the nucleus. 

Our understanding of the steps in RNA splicing has been achieved 
largely through studies of cell-free extracts that can accurately splice 
pre-mRNAs in vitro. Some of the major steps in the assembly ofa splice- 
osome and removal of an intron are indicated in FIGURE 11.30 and 
described in some detail in the accompanying legend. Taken together, 
removal of an intron requires several snRNP particles: the U1 snRNP, 
U2 snRNP, U5 snRNP, and the U4/U6 snRNP, which contains the U4 
and U6 snRNAs bound together. In addition to its snRNA, each 
snRNP contains a dozen or more proteins. One family, called Sm pro- 
teins, are present in all of the snRNPs. Sm proteins bind to one another 
and to a conserved site on each snRNA (except U6 snRNA) to form 
the core of the snRNP. FIGURE 11.31a shows a structural model of the 
U1 snRNP, with the locations of the snRNA, Sm proteins, and other 
proteins within the particle indicated. Figure 11.31b shows just the U1 
snRNA. Sm proteins were first identified because they are the targets 
of antibodies produced by patients with the autoimmune disease sys- 
temic lupus erythematosus. The other proteins of the snRNPs are 
unique to each particle. 

The events described in Figure 11.30 provide excellent examples 
of the complex and dynamic interactions that can occur between 
RNA molecules. The multiple rearrangements among RNA mole- 
cules that occur during the assembly of a spliceosome are primarily 
mediated by ATP-consuming RNA helicases present within the 
snRNPs. RNA helicases can unwind double-stranded RNAs, such as 
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FIGURE 11.29 The structure and self-splicing pathway of group II 
introns. (a) Two-dimensional structure of a group II intron (shown in red). 
The intron folds into six characteristic domains that radiate from a central 
structure. The asterisk indicates the adenosine nucleotide that bulges out 
of domain VI and forms the lariat structure as described in the text. The two 
ends of the intron become closely applied to one another as indicated by 
the proximity of the two intron—-exon boundaries. (b) Steps in the self- 
splicing of group II introns. In step 1, the 2’ OH of an adenosine within the 
intron (asterisk in domain VI of part a) carries out a nucleophilic attack on 
the 5’ splice site, cleaving the RNA and forming an unusual 2’-5’ phospho- 
diester bond with the first nucleotide of the intron (step 2). This branched 
structure is described as a lariat. Also shown in step 2, the free 3’ OH of the 
displaced exon attacks the 3’ splice site, which cleaves the RNA at the 
other end of the intron. As a result of this reaction, the intron is released as 
a free lariat, and the 3’ and 5’ ends of the two flanking exons are ligated 
(step 3). A similar pathway is followed in the splicing of introns from 
pre-mRNAs, but rather than occurring by self-splicing, these steps require 
the aid of a number of additional factors. 
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FIGURE 11.30 Schematic model of 
the assembly of the splicing machinery 6 ] 
and some of the steps that occur 
during pre-mRNA splicing. Step 1 
shows the portion of the pre-mRNA to 
be spliced. In step 2, the first of the 
splicing components, U1 snRNP, has become attached at the 5’ splice site of the intron. The nucleotide sequence of U1 snRNA is complementary to the 5’ 
splice site of the pre-mRNA, and evidence indicates that U1 snRNP initially binds to the 5’ side of the intron by the formation of specific base pairs between 
the splice site and U1 snRNA (see inset A). The U2 snRNP is next to enter the splicing complex, binding to the pre-mRNA (as shown in inset A) in a way that 
causes a specific adenosine residue (dot) to bulge out of the surrounding helix (step 3). This is the site that later becomes the branch point of the lariat. U2 is 
thought to be recruited by the protein U2AF, which binds to the polypyrimidine tract near the 3’ splice site. U2AF also interacts with SR proteins that bind to 
the exonic splicing enhancers (ESEs). These interactions play an important role in recognizing intron/exon borders. The next step is the binding of the U4/Ué 
and U5 snRNPs to the pre-mRNA with the accompanying displacement of U1 (step 4). The assembly of a spliceosome involves a series of dynamic interac- 
tions between the pre-mRNA and specific saRNAs and among the snRNAs themselves. As they enter the complex with the pre-mRNA, the U4 and Ué 
snRNAs are extensively base-paired to one another (inset B). The U4 snRNA is subsequently stripped away from the duplex, and the regions of Ué that were 
paired with U4 become base-paired to a portion of the U2 snRNA (inset C). Another portion of the Ué snRNA is situated at the 5’ splice site (inset C), having 
displaced the U1 snRNA that was previously bound there (inset A). It is proposed that Ué is a ribozyme and that U4 is an inhibitor of its catalytic activity. 
According to this hypothesis, once the U1 and U4 snRNA have been displaced, the U6 snRNA is in position to catalyze the two chemical reactions required 
for intron removal. According to an alternate view, the reactions are catalyzed by the combined activity of U6 snRNA and a protein of the U5 snRNP. 
Regardless of the mechanism, the first reaction (indicated by the arrow in inset C) results in the cleavage of the 5’ splice site, forming a free 5’ exon and a 
lariat intron—3’ exon intermediate (step 5). The free 5’ exon is thought to be held in place by its association with the U5 snRNA of the spliceosome, which 
also interacts with the 3’ exon (step 5). The first cleavage reaction at the 5’ splice site is followed by a second cleavage reaction at the 3’ splice site (arrow, 
step 5), which excises the lariat intron and simultaneously joins the ends of the two neighboring exons (step 6). Following splicing, the snRNPs must be 
released from the pre-mRNA, the original associations between snRNAs must be restored,and the snRNPs must be reassembled at the sites of other introns. 
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the U4—U6 duplex shown in Figure 11.30, inset B, which allows the 
displaced RNAs to bind new partners. Spliceosomal helicases are 
also thought to strip RNAs from bound proteins, including the 
U2AF protein of Figure 11.30, inset A. At least eight different heli- 
cases have been implicated in splicing of pre-mRNAs in yeast. 

The fact that (1) pre-mRNAs are spliced by the same pair of 
chemical reactions that occur as group II introns splice themselves and 
(2) the snRNAs that are required for splicing pre-mRNAs closely 
resemble parts of the group II introns suggested that the snRNAs are 
the catalytically active components of the snRNPs, not the proteins. 
According to this scenario, the spliceosome would act as a ribozyme 
and the proteins would serve various supplementary roles, such as 
maintaining the proper three-dimensional structure of the snRNA 
and selecting the splice sites to be used during the processing of a par- 
ticular pre-mRNA. Of the various snRNAs that participate in RNA 
splicing, U6 is considered the most likely candidate for the catalytic 
species. However, recent studies have placed at least one of the proteins 
of the U5 snRNP (namely, a conserved protein called Prp8) very close 
to the catalytic site of the spliceosome. In addition, Prp8 contains an 
RNase domain that might be well suited for cleaving the pre-mRNA. 
This finding has revived the proposal that the combined action of both 
an RNA and a protein component of the spliceosome is responsible for 
catalyzing the two chemical reactions required for RNA splicing. 

It was mentioned above that sequences situated within exons, 
called exonic splicing enhancers (ESEs), play a key role in recogni- 
tion of exons by the splicing machinery. ESEs serve as binding sites 
for a family of RNA-binding proteins, called SR proteins—so named 
because of their large number of arginine (R)-serine (S) dipeptides. 
SR proteins are thought to form interacting networks that span the 
intron/exon borders and help recruit snRNPs to the splice sites (see 
Figure 11.30, inset A). Positively charged SR proteins may also bind 
electrostatically to the negatively charged phosphate groups that are 
added to the CTD of the polymerase as transcription is initiated 
(Figure 11.18). As a result, the assembly of the splicing machinery at 
an intron occurs in conjunction with the synthesis of the intron by 
the polymerase. The CTD is thought to recruit a wide variety of pro- 
cessing factors. In fact, most of the machinery required for mRNA 
processing and export to the cytoplasm travels with the polymerase 
as part of a giant “mRNA factory” (FIGURE 11.32). 

Because most genes contain a number of introns, the splicing 
reactions depicted in Figure 11.30 must occur repeatedly on a single 


FIGURE 11.31 The structure of an snRNP. (a) Model of a U1 snRNP 
particle based on biochemical data and structural information 
obtained by cryoelectron microscopy. At the core of the particle is a 
ring-shaped protein complex composed of the seven different Sm 
proteins that are common to all U snRNPs. Three other proteins are 
unique to the U1 snRNP (named 70K, U1-A, and U1-C). Stems |, Il and 
IV are parts of the 165-base U1 snRNA. The snRNP is assembled in the 
cytoplasm and imported into the nucleus, where it carries out its 
function. (b) A model of the U1 snRNA in the same orientation as in a. 
A higher resolution model can be found in Nature 458: 475, 2009. 
Source: (a—b) From Holger Stark et al., Nature 409:541, 2001. 
Reproduced with permission from Macmillan Publishers Limited. 


primary transcript. Evidence suggests that introns are removed in a 
preferred order, generating specific processing intermediates whose 
size lies between that of the primary transcript and the mature 
mRNA. An example of the intermediates that form during the 
nuclear processing of the ovomucoid mRNA in cells of the hen’s 
oviduct is shown in FIGURE 11.33. 
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1. What are the general steps in the processing of a 
pre-mRNA into an mRNA? What is the role of the 
snRNAs and the spliceosome? 
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FIGURE 11.32 Schematic representation of a mechanism for the 
coordination of transcription, capping, polyadenylation, and splicing. 

In this simplified model, the C-terminal domain (CTD) of the large subunit of 
the RNA polymerase (page 418) serves as a flexible scaffold for the 
organization of factors involved in processing pre-mRNAs, including those 
for capping, polyadenylation, and intron removal. In addition to the proteins 
depicted here, the polymerase is probably associated with a host of 
transcription factors, as well as enzymes that modify the chromatin template. 
The proteins bound to the polymerase at any particular time may depend 
on which of the serine residues of the CTD are phosphorylated. The pattern 
of phosphorylated serine residues changes as the polymerase proceeds 
from the beginning to the end of the gene being transcribed (compare to 
Figure 11.18). The phosphate groups linked to the #5 residues are largely 
lost by the time the polymerase has transcribed the 3’ end of the RNA. 


Source: After E. J. Steinmetz, Cell 89:493, 1997; by permission of Cell Press. 
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FIGURE 11.33 Processing the ovomucoid pre-mRNA. The photograph 
shows a Northern blot, a technique in which extracted RNA (in this case 
from the nuclei of hen oviduct cells) is fractionated by gel electrophoresis 
and blotted onto a membrane filter. The immobilized RNA on the filter is 
then incubated with a radioactively labeled cDNA (in this case a cDNA 
made from the ovomucoid mRNA) to produce bands that reveal the 
positions of RNAs containing the complementary sequence. The mature 
mRNA encoding the ovomucoid protein is 1100 nucleotides long and is 
shown at the bottom of the blot. It is evident that the nucleus contains a 
number of larger-sized RNAs that also contain the nucleotide sequence of 
the ovomucoid mRNA. The largest RNA on the blot has a length of 5450 
nucleotides, which corresponds to the size of the ovomucoid transcription 
unit; this RNA is presumably the primary transcript from which the mRNA is 
ultimately carved. Other prominent bands contain RNAs with lengths of 
3100 nucleotides (which corresponds to a transcript lacking introns 5 and 6), 
2300 nucleotides (a transcript lacking introns 4, 5, 6, and 7), and 1700 
nucleotides (a transcript lacking all introns except number 3). 


Source: Courtesy of Bert O'Malley. 


11.8 Evolutionary Implications 
of Split Genes and RNA Splicing 


The discovery that RNAs are capable of catalyzing chemical reac- 
tions has had an enormous impact on our view of biological evolu- 
tion. Ever since the discovery of DNA as the genetic material, 
biologists have wondered which came first, protein or DNA. The 
dilemma arose from the seemingly nonoverlapping functions of 
these two types of macromolecules. Nucleic acids store information, 
whereas proteins catalyze reactions. With the discovery of ribozymes 
in the early 1980s, it became apparent that one type of molecule— 
RNA—could do both. 

The discovery that RNA can be a catalyst led to the idea, intro- 
duced in the chapter opener, of an RNA World, in which DNA and 
protein were absent at the early stages of the evolution of life. During 
this period, RNA molecules performed double duty: They served as 


genetic material, and they catalyzed chemical reactions including 429 


those required for RNA replication. Only at a later stage in evolution 
were the functions of catalysis and information storage taken over by 
protein and DNA, respectively, leaving RNA to function primarily as 
a go-between in the flow of genetic information. Many researchers 
believe that splicing provides an example of a legacy from an ancient 
RNA world. 

Although the presence of introns creates an added burden for 
cells, because they have to remove these intervening sequences from 
their transcripts, introns are not without their virtues. As we will see 
in the following chapter, RNA splicing is one of the steps along the 
path to mRNA formation that is subject to cellular regulation. Many 
primary transcripts can be processed by two or more pathways so that 
a sequence that acts as an intron in one pathway becomes an exon in 
an alternate pathway. As a result of this process, called alternative 
splicing, the same gene can code for more than one polypeptide. 

The presence of introns is also thought to have had a major 
impact on biological evolution. When the amino acid sequence of a 
protein is examined, it is often found to contain sections that are 
homologous to parts of several other proteins (see Figure 2.38 for an 
example). Proteins of this type are encoded by genes that are almost 
certainly composites made up of parts of other genes. The move- 
ment of genetic “modules” among unrelated genes—a process called 
exon shuffling—is greatly facilitated by the presence of introns, 
which act like inert spacer elements between exons. Genetic rear- 
rangements require breaks in DNA molecules, which can occur 
within introns without introducing mutations that might impair the 
organism. Over time, exons can be shuffled independently in vari- 
ous ways, allowing a nearly infinite number of combinations in 
search for new and useful coding sequences. As a result of exon shuf- 
fling, evolution need not occur only by the slow accumulation of 
point mutations but might also move ahead by “quantum leaps” with 
new proteins appearing in a single generation. 
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1. What is meant by the term RNA world? What type of 


evidence argues for its existence? 
. 


11.9 Creating New Ribozymes 
in the Laboratory 


The main sticking point in the minds of many biologists concerning 
the feasibility of an RNA world in which RNA acted as the sole 
catalyst is that, to date, only a few reactions have been found to be 
catalyzed by naturally occurring RNAs. These include the cleavage 
and ligation of phosphodiester bonds required for RNA splicing and 
the formation of peptide bonds during protein synthesis. Are these 
the only types of reactions that RNA molecules are capable of cata- 
lyzing, or has their catalytic repertoire been sharply restricted by the 
evolution of more efficient protein enzymes? Several groups of 
researchers have explored the catalytic potential of RNA by creating 
new RNA molecules in the laboratory. Although these experiments 
can never prove that such RNA molecules existed in ancient organ- 
isms, they provide proof of the principle that such RNA molecules 
could have evolved through a process of natural selection. 
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In one approach, researchers have created catalytic RNAs from 
scratch without any preconceived design as to how the RNA should 
be constructed. The RNAs are produced by allowing automated 
DNA-synthesizing machines to assemble DNAs with random nucle- 
otide sequences. Transcription of these DNAs produces a population 
of RNAs whose nucleotide sequences are also randomly determined. 
Once a population of RNAs is obtained, individual members can be 
selected from the population by virtue of particular properties they 
possess. This approach has been described as “test-tube evolution.” 

In one group of studies, researchers initially selected for RNAs 
that bound to specific amino acids and, subsequently, for a subpopu- 
lation that would transfer a specific amino acid onto the 3’ end of a 
targeted tRNA. This is the same basic reaction carried out by ami- 
noacyl-tRNA synthetases, which are enzymes that link amino acids 
to tRNAs as required for protein synthesis (page 441). It is specu- 
lated that amino acids may have been used initially as adjuncts 
(cofactors) to enhance catalytic reactions carried out by ribozymes. 
Over time, ribozymes presumably evolved that were able to string 
specific amino acids together to form small proteins, which were 
more versatile catalysts than their RNA predecessors. As we will see 
later in this chapter, ribosomes—the ribonucleoprotein machines 
responsible for protein synthesis—are essentially ribozymes at heart, 
which provides strong support for this evolutionary scenario. 

As proteins took over a greater share of the workload in the 
primitive cell, the RNA world was gradually transformed into an 
“RNA-protein world” At a later point in time, RNA was presumably 
replaced by DNA as the genetic material, which propelled life forms 
into the present “DNA-RNA-protein world.” The evolution of DNA 
may have required only two types of enzymes: a ribonucleotide 
reductase to convert ribonucleotides into deoxyribonucleotides and 
a reverse transcriptase to transcribe RNA into DNA. The fact that 
RNA catalysts do not appear to be involved in either DNA synthesis 
or transcription supports the idea that DNA was the last member of 
the DNA-RNA-protein triad to appear on the scene. 

Somewhere along the line of evolutionary progress, a code had 
to evolve that would allow the genetic material to specify the 
sequence of amino acids to be incorporated into a given protein. The 
nature of this code is the subject of a later section of this chapter. 
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1. What is meant by the phrase “test-tube evolution” as 


it applies to the catalytic activity of RNA molecules? 
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11.10 RNA Interference 


The idea that RNA molecules are directly involved in the regulation 
of gene expression began with a puzzling observation. The petals of 
a petunia plant are normally light purple. In 1990, two groups of 
investigators reported on an attempt to deepen the color of the flow- 
ers by introducing extra copies of a gene encoding a pigment- 
producing enzyme. To the surprise of the researchers, the presence 
of the extra genes caused the petals to lose their pigmentation rather 
than become more darkly pigmented as expected (FIGURE 11.34a). 
Subsequent studies indicated that, under these experimental condi- 
tions, both the added genes and their normal counterparts within 
the genome were being transcribed, but the resulting mRNAs were 


(b) 


(a) (c) 


FIGURE 11.34 RNA interference. (a) Petunia plants normally have purple 
flowers. The flowers of this plant appear white because the cells contain an 
extra gene (a transgene) that encodes an enzyme required for pigment 
production. The added gene has triggered RNA interference leading to the 
specific destruction of mRNAs transcribed from both the transgene and the 
plant's own genes, causing the flowers to be largely unpigmented. (b) A 
nematode worm containing a gene encoding a GFP fusion protein (page 260) 
that is expressed specifically in the animal’s pharynx. (c) This worm devel- 
oped from a parent containing the same genotype as that shown in b, whose 
gonad had been injected with a solution of dsRNA that is complementary to 
the mRNA encoding the GFP fusion protein. The absence of visible staining 
reflects the destruction of the mRNA by RNA interference. 

Source: (a) From David Baulcombe, Curr. Biol. 12:R82, 2002, with permission 
from Elsevier. Courtesy of David Baulcombe, Sainsbury Laboratory, 
Norwich, UK.; (b) Reprinted from www.neb.com (2012) with permission from 
New England BioLabs. 


somehow degraded. The phenomenon became known as posttran- 
scriptional gene silencing (PTGS). 

It wasn't until 1998 that the molecular basis for this form of gene 
silencing became understood. In that year, Andrew Fire of Carnegie 
Institute of Washington and Craig Mello of the University of 
Massachusetts and their colleagues conducted an experiment on the 
nematode C. elegans. They injected these worms with several differ- 
ent preparations of RNA hoping to stop production of a particular 
muscle protein. One of the preparations contained “sense” RNA, that 
is, an RNA having the sequence of the mRNA that encoded the pro- 
tein being targeted; another preparation contained “antisense” RNA, 
that is, an RNA having the complementary sequence of the mRNA 
in question; and a third preparation consisted of a double-stranded 
RNA containing both the sense and antisense sequences bound to 
one another. Neither of the single-stranded RNAs had much of an 
effect, but the double-stranded RNA was very effective in stopping 
production of the encoded protein. Fire and Mello described the 
phenomenon as RNA interference (RNAi). They demonstrated 
that double-stranded RNAs (dsRNAs) were taken up by cells where 
they induced a response leading to the selective destruction of 
mRNAs having the same sequence as the added dsRNA. Let's say, for 
example, one sought to stop the production of the enzyme glycogen 
phosphorylase within the cells of a nematode worm so that the effect 
of this enzyme deficiency on the phenotype of the worm could 
be determined. Remarkably, this result can be attained by simply 


placing the worm in a solution of dsRNA that shares the same 
sequence as the target mRNA. A similar experiment is shown in 
Figure 11.34b,c. This phenomenon is similar in effect, although 
entirely different in mechanism, to the formation of knockout mice 
that are lacking a particular gene that encodes a particular protein. 
The phenomenon of dsRNA-mediated RNA interference is an 
example of the broader phenomenon of RNA silencing, in which 
small RNAs, typically working in conjunction with protein machin- 
ery, act to inhibit gene expression in various ways. RNAi is thought 
to have evolved as a type of “genetic immune system” to protect 
organisms from the presence of foreign or unwanted genetic mate- 
rial. To be more specific, RNAi probably evolved as a mechanism to 
block the replication of viruses and/or to suppress the movements of 
transposons within the genome because both of these potentially 
dangerous processes can involve the formation of double-stranded 
RNAs. Cells can recognize dsRNAs as “undesirable” because such 
molecules are not produced by the cell’s normal genetic activities. 
The steps involved in the RNAi pathway are shown in 
FIGURE 11.35a. The double-stranded RNA that initiates the response 
is first cleaved into small (21-23 nucleotide), double-stranded frag- 
ments, called small interfering RNAs (siRNAs), by a particular 
type of ribonuclease, called Dicer. Enzymes of the class to which 
Dicer belongs act specifically on dsRNA substrates and generate 


FIGURE 11.35 The formation and mechanism of action of siRNAs 
and miRNAs. (a) In step 1, both strands of a double-stranded RNA 
are cleaved by the endonuclease Dicer to form a small (21-23 
nucleotide) siRNA, which has overhanging ends (step 2). In step 3, the 
siRNA becomes associated with a protein complex, a pre-RISC, that 
contains an Argonaute protein (typically Ago2) capable of cleaving 
and removing the passenger strand of the siRNA duplex. In step 4, 
the single-stranded guide siRNA, in association with proteins of the 
RISC complex, binds to a target RNA that has a complementary 
sequence. The target RNA might be a viral RNA, a transcript from a 
transposon, or an MRNA, depending on circumstances. In step 5, the 
target RNA is cleaved at a specific site by the Argonaute protein and 
subsequently degraded. (b) MicroRNAs are derived from single- 
stranded precursor RNAs that contain complementary sequences that 
allow them to fold back on themselves to form a double-stranded 
RNA with a stem-loop at one end (step a). This pseudo-dsRNA (or 
pri-miRNA) is cleaved at a specific site near its terminal loop by a 
protein complex containing an endonuclease named Drosha to 
generate a pre-miRNA that has a 3’ overhang at one end. The pre- 
miRNA is exported to the cytoplasm (step 1) where it is cleaved by 
Dicer into a small duplex miRNA (step 2) that has a 3’ overhang at 
both ends. In step 3, the double-stranded RNA becomes associated 
with a protein complex containing an Argonaute protein (typically 
Ago), leading to the separation of the strands and removal of the 
passenger strand (called miRNA*). The single-stranded guide miRNA 
then binds to a complementary region on an mRNA (step 4) and 
inhibits translation of the message as shown in step 5 (or alternatively 
leads to deadenylation and degradation of the mRNA as discussed in 
Section 12.20). Unlike siRNAs, miRNAs that inhibit translation are only 
partially complementary to the target mRNA, hence the bulge. Most 
plant miRNAs and a handful of animal miRNAs are precisely comple- 
mentary to the mRNA, or nearly so; in these cases, the outcome of 
the interaction tends to be cleavage of the mRNA by Ago2 in the 
same manner shown in a. (A certain class of miRNAs derived from 
introns do not form long hairpin pri-miRNAs and do not require 
Drosha for processing.) 


small dsRNAs that have 3’ overhangs, as shown in Figure 11.35a. 431 


These small dsRNAs are then loaded into a complex (identified as a 
pre-RISC in Figure 11.35a) that contains a member of the Argonaute 
family of proteins. Argonaute proteins play a key role in all of the 
known RNA silencing pathways. In the RNAi pathway, one of the 
strands of the RNA duplex (called the passenger strand) is cleaved 
in two and then dissociates from the pre-RISC. The other strand of 
the RNA duplex (called the guide strand) is incorporated, together 
with its Argonaute partner, into a related protein complex named 
RISC. Several different varieties of RISCs have been identified, dis- 
tinguished by the particular Argonaute protein they contain. In ani- 
mal cells, RISCs that contain an siRNA typically include the 
Argonaute protein Ago2. The RISC provides the machinery for the 
tiny single-stranded siRNA to bind to an RNA having a comple- 
mentary sequence. Once bound, the target RNA is cleaved at a spe- 
cific site by the ribonuclease activity of the Ago2 protein. Thus, the 
siRNA acts like a guide to direct the complex toward a complemen- 
tary target RNA, which is then destroyed by a protein partner. Each 
siRNA can orchestrate the destruction of numerous copies of the 
target RNA, which might be a viral transcript, an RNA transcribed 
from a transposon or retrotransposon, or a host mRNA that is being 
targeted by a researcher as described in the beginning of this 
section. 
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RNA interference has been studied largely in plants and nema- 
tode worms. Vertebrates do not seem to utilize RNAi as a defense 
against viruses, relying instead on a well-developed immune sys- 
tem.” In fact, when a dsRNA is added to mammalian cells growing 
in culture, or injected directly into the body of a mammal, rather 
than stopping the translation of a specific protein, it usually initiates 
a global response that inhibits protein synthesis in general. This 
global shutdown of protein synthesis (discussed in Section 17.1) is 
thought to have evolved as a means to protect cells from infection by 
viruses. To get around this large-scale global response, researchers 
turned to the use of very small dsRNAs. They found that introduc- 
tion into mammalian cells of synthetic dsRNAs that were 21 nucleo- 
tides long (ie., equivalent in size to the siRNAs produced as 
intermediates during RNA interference in other organisms) did 
not trigger global inhibition of protein synthesis. Moreover, such 


5Mammals possess all of the components needed to generate siRNAs. Recent studies have 
indicated that endogenous siRNAs (or endo-siRNAs, as they are called) are produced in 
mammalian oocytes and embryonic stem cells, in part as a mechanism of defense against 
the movement of transposable elements in these cells. Formation of endo-siRNA in 
mammalian oocytes is accomplished by an oocyte-specific Dicer which, in contrast to the 
Dicer found in somatic cells, is extremely good at producing siRNA from dsRNA. 


dsRNAs were capable of RNAi, that is, they inhibited synthesis of the 
specific protein encoded by an mRNA having a matching nucleotide 
sequence. Proteins encoded by other mRNAs were generally not 
affected. This technique has become a major experimental strategy 
to learn more about the function of specific genes. One can simply 
knock out (or, more accurately, knock down) the activity of the spe- 
cific gene in question, using dsRNAs to destroy the transcribed 
mRNAs, and look for any cellular abnormalities that result from a 
deficiency of the encoded protein (see Figure 8.8 for an example). 
Libraries are now available containing siRNAs targeting each gene in 
the entire genome. With the assistance of laboratory robots, it is pos- 
sible to use these libraries to identify all the genes in the entire 
genome that are required for a process of interest in a single study. 
The potential clinical importance of RNAi is discussed in the accom- 
panying Human Perspective. 


REVIEW | 


1. What are the steps by which double stranded RNA is 
converted into siRNA and then used to destroy a 
target RNA? 
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11.11 THE HUMAN PERSPECTIVE 


Clinical Applications of RNA Interference 


Drug development is a long and difficult process. As discussed in 
Chapter 2, drugs often work by inhibiting proteins involved in a dis- 
ease. It is sometimes possible to use the three-dimensional struc- 
ture of a protein involved in a disease to develop chemicals that will 
specifically inhibit that protein. However, despite successes such 
as Gleevac, discussed in Figure 2.51, in general this approach has 
proven to be extremely difficult and failure-prone. The discovery of 
RNAi, with its ability to reduce the expression of any gene of interest 
just by introducing a short siRNA into a cell, suggested a different 
approach to developing therapeutics: Instead of making a small 
chemical that inhibits a target enzyme by binding in its active site, 
one could potentially get the same result by using RNAi to target 
the gene encoding the enzyme. This would lead to less enzyme 
activity, just as a chemical inhibitor would, but the therapeutic agent 
would be much easier to design, just by looking at the sequence of 
the gene. In addition to being easier to design, siRNA is less likely to 
suffer from the bane of all drug discovery efforts: off-target effects. 
An off-target effect happens when a drug designed to inhibit one 
protein accidentally inhibits one or more other proteins. Because 
chemical drugs are small, it is easy for them to fit into a variety of 
binding pockets on different proteins, making it extremely hard to 
ensure specific binding only to a target of interest. Often, the off- 
target effects are not noticed until clinical trials take place in human 
patients. This lack of specificity is a major problem in the drug dis- 
covery industry. RNAi is potentially much more specific because any 
potential siRNA can be compared to the entire human genome to 
ensure it is only complementary to a single gene of interest and 
not to any others. For these reasons, there has been tremendous 
investment in developing siRNA-based therapeutics. 

There are two ways to produce an RNAi response in human 
cells. The first is addition of synthetic siRNA molecules. When 
cells are incubated with a 21- to 23-nucleotide synthetic siRNA, 


the molecules are taken up by the cells, and incorporated into an 
mRNA-cleaving ribonucleoprotein complex (as in Figure 11.35a), 
which attacks the complementary mRNA. The second approach 
is to expose cells to viruses that can insert a DNA construct with 
inverted repeats into the host genome. When the construct 
is transcribed, the RNA product folds back on itself to form a 
hairpin-shaped (i.e., double-stranded) siRNA precursor (similar to 
that in Figure 11.356) that is processed into an active siRNA. The 
use of synthetic siRNAs is likely to have a transient effect, which 
may or may not be beneficial depending on the condition being 
treated. In contrast, the use of viral vectors that incorporate DNA 
into the cell’s genome has the potential to generate a long-lasting 
therapeutic effect with a single application. Unfortunately, viruses 
can trigger massive immune reactions, and they have fallen out of 
favor as a method for delivering RNAi-based therapy. 

As with all potential drug technologies, the first stages in 
development of RNAi therapeutics involved preclinical studies 
in which siRNAs were tested in cells from diseased patients or 
in animal models (i.e., animals with induced diseases similar to 
those that afflict humans). Many of these preclinical studies sug- 
gested that RNAi held great promise in the treatment of a wide 
range of diseases and viral infections. We can consider just a few 
examples. As noted above, cancer cells typically contain one or 
more mutated genes that encode abnormal proteins responsible 
for producing the cancerous phenotype. A type of leukemia, for 
example, is caused by a gene, called BCR-ABL, that is formed by 
the fusion of two normal genes. An siRNA against the mRNA pro- 
duced by the BCR-ABL fusion gene proved capable of converting 
the malignant cells to a normal phenotype in culture. Similarly, an 
siRNA against a gene carrying an expanded CAG tract of the type 
that causes Huntington's disease (page 388) was shown to block 
the production of the abnormal protein. 


A large number of preclinical studies probing the therapeutic 
value of siRNAs have focused on viral infection. Administration 
of siRNAs complementary to influenza, HIV, or hepatitis viral 
sequences has prevented or eliminated infections by these 
viruses in laboratory animals. In the case of AIDS, it is hoped that 
stem cells can be isolated from a patient, transfected with a vector 
carrying an siRNA that targets an HIV-encoded mRNA, and then 
reinfused back into the patient's bloodstream. Such cells have the 
potential to produce the siRNA more or less continuously, caus- 
ing the cell and its descendants to be resistant to destruction by 
the virus. There are roadblocks, however, for using RNAi to treat 
viral infections. The hallmark of RNAi is its extraordinary sequence 
specificity, which can be both a virtue and a handicap. Ultimately, 
RNAi may prove to be ineffective against viruses, because these 
pathogens tend to mutate rapidly. Mutations result in changes in 
genome sequence, leading to the production of mRNAs that are 
no longer fully complementary to the therapeutic siRNA. 

siRNA technology is now the basis of a growing list of clinical 
trials. The first test of RNAi therapeutics in human patients came 
against a form of macular degeneration that is caused by the 
overgrowth of blood vessels behind the retina of elderly patients, 
causing progressive loss of vision. The excessive growth of these 
blood vessels is spurred by the production of a growth factor, 
VEGF, which induces its effect by binding to the cell-surface 
receptor VEGFR1. In clinical trials, siRNA targeting the mRNA 
encoding VEGF was injected directly into the eyes of patients 
with this disease. Early-stage trials suggested that the siRNA was 
having a beneficial effect but later-stage phase Ill trials failed to 
support the findings and work on the drug (called Bevasiranib) 
was suspended. Another initially promising siRNA for treating 
macular degeneration, ANG-745, was found to be ineffective 
in phase Il trials and further development has also been halted. 
Macular degeneration and other ophthalmological diseases con- 
tinue to be viewed as promising candidates for siRNA therapy 
because it is so easy to apply the siRNA molecules directly to the 
eye, either by injection or in eye drops. 

Another series of clinical tests were performed using siRNA 
therapy against a respiratory virus, RSV, which is a common cause 
of infant hospitalization. In these trials, adult subjects inhaled 
an aerosol that contains an siRNA (named ALN-RSVO1) directed 
against mRNAs encoding a key viral protein. An initial phase II trial 
suggested that the treatment was effective in reducing virus levels 
without causing dangerous side effects, but a second phase II trial 
failed, and the treatment has not yet been studied in further trials. 

Age-related macular degeneration and RSV respiratory infec- 
tion are not typical diseases in the sense that they can be treated 
with local application of an RNAi therapeutic., making it easy to 
deliver the siRNA directly to the cells. Indeed, eyes, lungs, and 
skin have been the main targets for siRNA therapies developed 
thus far, simply for reasons of easy accessibility to the tissue. For 
diseases affecting other tissues and organs, the difficulty in deliv- 
ering siRNAs to affected tissues is a major obstacle. In an attempt 
to meet this challenge, a variety of nanosized particles have been 


FIGURE 1 PLK1 as a target for cancer therapy. Tumors from a 
mouse in which cells were inserted that carry genetic mutations 
predisposing the cells to make tumors. The tumor on the left was from 
cells treated with an inhibitor of PLK1, showing that growth of the tumor 
is greatly reduced compared to untreated tumors as shown on the right. 


Source: From Luo et al., Cell 137:835-848, 2009. 


developed that (1) either bind to siRNAs or encapsulate them and 
(2) target the siRNAs to the appropriate tissue or organ following 
their injection into the bloodstream. Such nanoparticles can be 
injected into the bloodstream where they travel to organs such as 
the kidney or liver. One such nanoparticle-encapsulated siRNA, 
TKM-PLK1, targets an enzme called polo-like kinase | (PLK1) that 
is important in regulating cell division. Laboratory experiments 
show that reducing PLK1 activity leads to cell death in cancer cells, 
and can reduce tumor growth (FIGURE 1), thus making PLK1 a 
promising target for siRNA-based cancer therapy. The TKM-PLK1 
siRNA-nanoparticle travels to the kidney to deliver the siRNA and is 
currently being evaluated in phase II clinical trials for the treatment 
of adrenocortical carcinoma (ACC), a cancer that affects cells of the 
adrenal gland, which is located near the kidney. Another nanopar- 
ticle-encapsulated siRNA called Patisiran is designed to treat a 
genetic disease called transthyretin mediated amyloidosis, or 
ATTR. ATTR is caused by a mutation in the gene encoding the pro- 
tein transthyretin (TTR), normally involved in vitamin A transport. 
In ATTR, the mutation causes the TTR protein to fold improperly 
and accumulate in protein tangles in both the nervous system and 
heart, leading to neuropathy and heart disease. Patisiran siRNA 
targets TTR, reducing its production with the goal of reducing the 
level of misfolded protein. Patisiran was found safe and effective in 
phase Il trials and is now undergoing phase III clinical trials. 

To date, more than 30 siRNA-based therapeutics have been 
tested in clinical trials, many of which are still ongoing. Thus 
far, not one of these has yet obtained FDA approval for use in 
patients, but the promise of this approach has not diminished. 
Clinical trials of siRNAs aimed at the treatment of high blood cho- 
lesterol, certain cancers, hepatitis, pandemic flu, and other dis- 
eases are either planned or underway.The studies described here 
are only the beginning steps in a long research pathway, but they 
suggest that RNA interference may one day become a valuable 
therapeutic strategy against a diverse array of disorders. 


11.12 Small RNAs: miRNAs 
and piRNAs 


By 1993, it was known that C. elegans embryos lacking the gene lin-4 
were unable to develop into normal late-stage larvae. In that year, 
Victor Ambros, Gary Ruvkun, and their colleagues at Harvard 


University reported that the lin-4 gene encodes a small RNA that is 
complementary to segments in the 3’ untranslated region of a spe- 
cific mRNA encoding the protein LIN-14. They proposed that, dur- 
ing larval development, the lin-4 RNA binds to the complementary 
mRNA, blocking translation of the message, which triggers a transi- 
tion to the next stage of development. Mutants that are unable to 
produce the small lin-4 RNA possess an abnormally high level of the 
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This was the first clear example of RNA silencing of gene expression, 
but several years passed before the broader importance of these 
findings was appreciated. In 2000, it was shown that one of these tiny 
worm RNAs—a 21-nucleotide species called let-7—is highly con- 
served throughout evolution. Humans, for example, encode several 
RNAs that are either identical or nearly identical to let-7. This obser- 
vation led to an explosion of interest in these RNAs. 


miRNAs: A Class of Small RNAs that 
Regulate Gene Expression 


It has become evident in recent years that both plants and animals 
produce a collection of tiny RNAs named microRNAs (miRNAs) 
that, because of their small size, had been overlooked for decades. As 
first discovered in nematodes, specific miRNAs such as lin-4 and 
let-7 are synthesized only at certain times during development, or in 
certain tissues of a plant or animal, and are thought to play a regula- 
tory role. An example of the selective expression of specific miRNAs 
during the development of the zebrafish is shown in FIGURE 11.36. 
The size of miRNAs, at roughly 21-24 nucleotides in length, places 
them in the same size range as the siRNAs involved in RNAi. This 
observation is more than coincidence, because miRNAs are pro- 
duced by a similar processing machinery to that responsible for the 
formation of siRNAs. miRNAs and siRNAs might be considered to 
be “cousins,” as both act in posttranscriptional RNA silencing path- 
ways. There are, however, important differences. An siRNA is 


miR-124a 
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FIGURE 11.36 MicroRNAs are synthesized in specific tissues during 
embryonic development. These micrographs of zebrafish embryos show 
the specific expression of three different miRNAs whose localization is 
indicated by the blue stain. miR-124a is expressed specifically in the nervous 
system (a), miR-206 in skeletal muscle (6) and miR-122 in the liver (c). 
Source: (a—c) From Erno Wienholds, Wigard Kloosterman, et al., Science 
309:311, 2005, courtesy of Ronald H. A. Plasterk; © 2005, reprinted with 
permission from AAAS. 


derived from the double-stranded product of a virus or transposable 
element (or a dsRNA provided by a researcher), and it targets the 
same transcripts from which it arose. In contrast, an miRNA is 
encoded by a conventional segment of the genome and targets a spe- 
cific mRNA as part of a normal cellular program. In other words, 
siRNAs serve primarily to maintain the integrity of the genome, 
whereas miRNAs serve primarily to regulate gene expression. 

The pathway for synthesis of a typical miRNA is shown in 
Figure 11.35b. Most miRNAs are synthesized by RNA polymerase II 
as a primary transcript with a 5’ cap and poly(A) tail. This primary 
transcript folds back on itself to form a long, double-stranded, hair- 
pin-shaped RNA called a pri-miRNA shown in step a of Figure 
11.35b. The pri-miRNA is cleaved within the nucleus by an enzyme 
called Drosha into a shorter, double-stranded, hairpin-shaped pre- 
cursor (or pre-miRNA) shown in step 1. The pre-miRNA is exported 
to the cytoplasm where it gives rise to the small double-stranded 
miRNA. The miRNA is carved out of the pre-miRNA by Dicer, the 
same ribonuclease responsible for the formation of siRNAs. As with 
siRNAs, the double-stranded miRNA becomes associated with an 
Argonaute protein in whose company the RNA duplex is disassem- 
bled, and one of the single strands is incorporated into a RISC com- 
plex as shown in Figure 11.350. 

A typical miRNA in animal cells is partially complementary to 
a region in the 3’ UTR of the mRNA target (as in Figure 11.350). In 
these cases, base-pairing involves six or seven nucleotides near the 
5’ end of the miRNA (typically nucleotides 2-8 of the miRNA), 
which is referred to as the “seed” region, as well as a few additional 
base pairs elsewhere in the miRNA. The miRNA of the RISC com- 
plex guides the associated Argonaute protein into close proximity to 
the bound mRNA. As discussed in Section 12.20, the Argonaute 
protein causes either degradation of the mRNA or repression of its 
translation (see Figure 12.63). It has been hypothesized that transla- 
tional repression might itself lead to subsequent degradation of the 
mRNA and that the main function of the miRNA-bound to 
Argonaute would be to block translation, which then triggers degra- 
dation of the mRNA. Some miRNAs, however, particularly those 
found in plants, direct the cleavage of a specific phosphodiester 
bond within the mRNA backbone. As with siRNAs shown in Figure 
11.35a, mRNA cleavage is catalyzed by a “slicer” activity of the Ago2 
protein present in the RISC complex. MicroRNAs that direct mRNA 
cleavage tend to be fully complementary to their mRNA target 
rather than complementary to only a portion of their target. 

MicroRNA genes have been identified in a number of different 
ways: most often from computer analysis of genomic DNA sequences, 
but also through isolation of mutants or by direct sequencing of 
small cellular RNAs. Estimates suggest that humans may encode 
over a thousand distinct miRNA species. Roughly one-third of 
human messenger RNAs contain sequences that are complementary 
to likely miRNAs, which provides a hint of the degree to which these 
small regulatory RNAs may be involved in the control of gene 
expression in higher organisms. A single miRNA may bind to doz- 
ens or even hundreds of different mRNAs. Conversely, many mRNAs 
contain sequences that are complementary to several different 
miRNAs, which suggests that these miRNAs may act in various 
combinations to fine-tune the level of gene expression. This predic- 
tion is supported by experiments in which cells are forced to take up 
and express specific miRNA genes. Under these conditions, large 
numbers of mRNAs in the genetically engineered cells are negatively 
affected. When different miRNA genes are introduced into these 


cells, different groups of mRNAs are downregulated, indicating that 
the effect is sequence-specific. This type of fine-tuning of mRNA 
expression levels by miRNAs contrasts sharply with the much more 
dramatic effects of certain miRNAs, such as lin-4, which reduce the 
expression of a target mRNA to the point that it triggers a major 
change in the course of development (page 433). 

MicroRNAs have been implicated in many processes, including 
the patterning of the nervous system, control of cell proliferation 
and cell death, leaf and flower development in plants, and differen- 
tiation of various cell types. The roles of miRNAs in the develop- 
ment of cancer are explored at the end of Section 16.11. Dissecting 
the roles of individual miRNAs during mammalian development 
and tissue homeostasis is currently the focus of intense research. 


piRNAs: A Class of Small RNAs 
that Function in Germ Cells 


We have seen in Chapter 10 that the movement of transposable ele- 
ments poses a threat to the genome because it can disrupt the activ- 
ity of a gene into which the element happens to insert. If this type of 
genomic jumping were to occur during the life of an adult liver or 
kidney cell, the consequences would likely be minimal because the 
transposition event affects only that particular cell and its daughters 
(if the somatic cell is still capable of cell division). If, however, a 
transposition event were to occur in a germ cell, that is, a cell that has 
the capability to give rise to gametes, then the event has the potential 
to affect every cell in an individual of the next generation. It is not 
surprising, therefore, that specialized mechanisms have evolved to 
suppress the movement of transposable elements in germ cells. 

It was noted on page 431 that one of the apparent functions of 
siRNAs is to prevent the movement of transposable elements. Recent 
studies indicate that germ cells of animals express a distinct class of 
small RNAs, called piwi-interacting RNAs (or piRNAs) that sup- 
press the movement of transposable elements in the germ line. 
piRNAs get their name from the proteins with which they specifically 
associate. These proteins are called PIWIs, and they are a subclass of 
the Argonaute family, the same family of proteins that associate with 
siRNAs and miRNAs as part of their RISC complexes. The roles of 
the PIWI proteins have been best studied in fruit flies, where deletion 
of these proteins leads to defects in the suppression of transposon 
movement in germ cells and ultimately to failure of gamete formation. 
piRNAs and their associated PIWI proteins are also required for 
successful gamete formation in male mice, although not in female 
mice (where protection against transposable elements may be primarily 
afforded by endo-siRNAs discussed in the footnote on page 432). 

There are a number of important differences between piRNAs 
on one hand and si/miRNAs on the other: (1) piRNAs are longer 
than these other small RNAs, measuring about 24-32 nucleotides in 
length; (2) the majority of mammalian piRNAs can be mapped to a 
small number of huge genomic loci, some of which can encode 
thousands of different piRNAs; (3) piRNAs can be subject to an 
amplification process that generates additional copies of piRNAs, 
and (4) the formation of piRNAs does not involve the formation of 
dsRNA precursors or cleavage by the Dicer ribonuclease. Instead, 
piRNA biogenesis appears to depend on a single-stranded RNA 
endonuclease called Zucchini, which acts on a long, single-stranded 
primary transcript to release the short piRNAs. Those piRNAs that 
are active in the cell against transposable elements are then amplified 


in a subsequent step. The mechanism by which piRNAs act to silence 435 


target RNA expression remains unclear, although in some cell types 
the silencing is achieved by modifying the state of chromatin as dis- 
cussed in Chapter 12. 


REVIEW | 


1. What is an siRNA, an miRNA, and a piRNA? How is 
each of them formed in the cell? What are their 
proposed functions? 


——— 


11.13 CRISPR and other 
Noncoding RNAs 


Both siRNA and piRNA evolved as defense against dangerous 
genomic elements, either viral genomes or transposons. The main 
difference is in how they arise—siRNAs are produced by any dou- 
ble stranded RNA in the cell, which is a hallmark of a viral genome, 
while piRNAs are encoded in the host genome, and provide the host 
with a genetically encoded defense, ready to attack any transcript 
with a similar sequence. This latter strategy is also employed by 
many bacteria through an RNA-based pathway known as CRISPR. 


CRISPR: Noncoding RNA in Bacteria 


Within the genome of many bacteria there are genomic loci contain- 
ing short DNA segments that are identical to sequences from the 
DNA of bacteriophage viruses, separated by a repeated DNA 
sequence. These repeat loci are called Clustered Regularly Interspaced 
Short Palindromic Repeats, or CRISPRs. The viral sequences are 
obtained from viruses that invaded the ancestors of the bacterial 
cell, and their presence in the bacterial genome constitutes a mem- 
ory of such prior attacks. The CRISPR locus is transcribed as a long 
single-stranded RNA molecule, much like the piRNA precursor. 
Then, individual small RNA sequences, called single-guide RNA 
(sgRNA) or CRISPR RNA (crRNA) are cut out and loaded onto a 
protein called cas9. Cas9 uses the bound sgRNA to locate targets and 
cleaves any DNA with sequence homology to the guide RNA. 
Although this system evolved in bacteria to defend themselves 
against viruses by cleaving the viral DNA genomes, the CRISPR- 
cas9 system has been engineered to work in eukaryotic cells. By 
expressing cas9 in a cell along with a suitable single guide sequence, 
it is possible to create double-stranded DNA breaks in the genome of 
the cell. When the cell tries to repair these breaks, it can lead to 
insertions or deletions of DNA, producing a mutant gene. The 
CRISPR-cas9 system thus allows mutants to be made in any gene of 
interest. Moreover, the repair process can lead to chromosome rear- 
rangements or the insertion of new DNA sequences that share 
homology with the region of the cut, allowing the genome to be 
edited or modified at will. The ability to create mutants and perform 
genome editing, in virtually any organism, using the CRISPR-cas9 
system has led to a revolution in genetic manipulation of cells, par- 
ticularly in organisms such as mammals and plants where targeted 
genetic modification had previously been extremely difficult. 
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As the field of cell and molecular biology has matured, we have 
gradually learned to appreciate the remarkable structural and func- 
tional diversity of RNA molecules. New types of RNAs have been 
discovered frequently over the past few decades, as described in the 
previous pages of this text. But researchers have been unprepared for 
one flurry of findings, which indicate that at least two-thirds of a 
mouse or human genome is normally transcribed, far more than 
would be expected based on the number of “meaningful” DNA 
sequences thought to be present. In fact, most of the sequences 
included among this transcribed DNA are normally thought of as 
“junk,” including large numbers of transposable elements that con- 
stitute a sizable fraction of mammalian genomes (page 394). Some 
studies raise the question whether there is any basic difference 
between a gene and an intergenic region; they report that transcrip- 
tion can begin at all kinds of unexpected sites within the genome and 
that transcripts overlap extensively with one another. Other studies 
reveal that cells often transcribe both strands of a DNA element, 
producing both a sense RNA and an antisense RNA. Some antisense 
RNAs are synthesized by polymerases that had initially bound to the 
promoter of a protein-coding gene and then moved upstream from 
that site along the opposite strand of the chromosome, that is, in the 
opposite direction from polymerase molecules that are transcribing 
the gene itself. 

Just as the DNA ofa cell or organism constitutes its genome and 
the proteins it produces constitutes it proteome, the RNAs that are 
synthesized by a cell or organism constitutes its transcriptome. 
Why is the transcriptome of a mammalian cell so large? In other 
words, why does a cell transcribe all of these various types of DNA 
sequences? We dont know the answer to this basic question. 
According to one viewpoint, much of this pervasive transcription 
activity is simply the “background noise” that accompanies the com- 
plex process of gene expression. Proponents of this viewpoint cite 
the results of studies in mice in which blocks of the genome that lack 
protein-coding genes have been deleted. It was found in these stud- 
ies that healthy mice can develop even though they cannot synthe- 
size the noncoding RNAs (ncRNAs) that would normally have been 
produced from the deleted DNA sequences. According to a con- 
trasting viewpoint, much of the noncoding RNA being produced is 
involved in diverse regulatory activities that have yet to be identi- 
fied. Supporters of this position cite results suggesting that this 
pervasive transcription is not random; many of the noncoding tran- 
scripts display distinct and reproducible patterns of tissue-specific 
distribution, and their origin can be traced to specific loci in the 
genome. Keep in mind that it has only been in the last decade or so 
that the existence of siRNAs, miRNAs, and piRNAs has come to 
light, and it is likely that other types of small ncRNAs remain unde- 
tected. Not all noncoding transcripts are small. A number of evolu- 
tionarily conserved, long (>200 bases) ncRNAs (e.g., XIST, HOTAIR, 
and AIRE) that function as regulators of chromatin structure or gene 
transcription have also been discovered, as discussed in Sections 
12.4 and 12.17, respectively. Genomic studies show that there are 
thousands of conserved long ncRNAs (IncRNAs), and the few whose 
biological functions have been identified are only the tip of the ice- 
berg. It is even possible that the vast transcriptional output of the 
mammalian genome holds the key to explaining why we have 
approximately the same number of genes as organisms we believe to 
be much less complex (see Figure 10.28). Regardless of the actual 


explanation, one point is clear: There is a great deal that we do not 
understand about the many roles of RNAs in eukaryotic cells. 


REVIEW | 


1. Compare and contrast CRISPR RNA with piRNA, in 
terms of how the RNA is produced and what proteins 
they associate with. 


2. Why is the ability of the CRISPR system to cut DNA 
so useful for researchers? Why can't RNAi be used for 
the same purpose? 


11.14 Encoding Genetic 
Information 


Once the structure of DNA had been revealed in 1953, it became 
evident that the sequence of amino acids in a polypeptide was speci- 
fied by the sequence of nucleotides in the DNA of a gene. It seemed. 
very unlikely that DNA could serve as a direct, physical template for 
the assembly of a protein. Instead, it was presumed that the informa- 
tion stored in the nucleotide sequence was present in some type of 
genetic code. With the discovery of messenger RNA as an inter- 
mediate in the flow of information from DNA to protein, attention 
turned to the manner in which a sequence written in a ribonucleo- 
tide “alphabet” might code for a sequence in an “alphabet” consisting 
of amino acids. 


The Properties of the Genetic Code 


One of the first models of the genetic code was presented by the 
physicist George Gamow, who proposed that each amino acid in a 
polypeptide was encoded by three sequential nucleotides. In other 
words, the code words, or codons, for amino acids were nucleotide 
triplets. Gamow arrived at this conclusion by a bit of armchair logic. 
He reasoned that it would require at least three nucleotides for each 
amino acid to have its own unique codon. Consider the number of 
words that can be spelled using an alphabet containing four different 
letters corresponding to the four possible bases that can be present at 
a particular site in DNA (or mRNA). There are four possible one- 
letter words, 16 (4”) possible two-letter words, and 64 (4°) possible 
three-letter words. Because there are 20 different amino acids (words) 
that have to be specified, codons must contain at least three succes- 
sive nucleotides (letters). The triplet nature of the code was soon 
verified in a number of insightful genetic experiments conducted 
by Francis Crick, Sydney Brenner, and colleagues at Cambridge 
University.° 

In addition to proposing that the code was triplet, Gamow also 
suggested that it was overlapping. Even though this suggestion proved 


Anyone looking to read a short research paper that conveys a feeling for the induc- 
tive power and elegance of the early groundbreaking studies in molecular genetics 
can find these experiments on the genetic code in Nature 192:1227, 1961. (See Cell 
128:815, 2007 for discussion of this classic experiment.) 


to be incorrect, it raises an interesting question concerning the 
genetic code. Consider the following sequence of nucleotides: 


—AGCAUCGCAUCGA— 


If the code is overlapping, then the ribosome would move along the 
mRNA one nucleotide at a time, recognizing a new codon with each 
move. In the preceding sequence, AGC would specify an amino acid, 
GCA would specify the next amino acid, CAU the next, and so on. In 
contrast, if the code is nonoverlapping, each nucleotide along the 
mRNA would be part of one, and only one, codon. In the preceding 
sequence, AGC, AUC, and GCA would specify successive amino acids. 

A conclusion as to whether the genetic code was overlapping or 
nonoverlapping could be inferred from studies of mutant proteins, 
such as the mutant hemoglobin responsible for sickle cell anemia. In 
sickle cell anemia, as in most other cases that were studied, the 
mutant protein was found to contain a single amino acid substitu- 
tion. If the code is overlapping, a change in one of the base pairs in 
the DNA would be expected to affect three consecutive codons 
(FIGURE 11.37) and, therefore, three consecutive amino acids in the 
corresponding polypeptide. If, however, the code is nonoverlapping 
and each nucleotide is part of only one codon, then only one amino 
acid replacement would be expected. These and other data indicated 
that the code is nonoverlapping. 

Given a triplet code that can specify 64 different amino acids 
and the reality that there are only 20 amino acids to be specified, the 
question arises as to the function of the extra 44 triplets. If any or all 
of these 44 triplets code for amino acids, then at least some of the 
amino acids would be specified by more than one codon. A code of 
this type is said to be degenerate. As it turns out, the code is highly 
degenerate, as nearly all of the 64 possible codons specify amino 
acids. Those that do not (three of the 64) have a special “punctua- 
tion” function—they are recognized by the ribosome as termination 
codons and cause reading of the message to stop. 


Identifying the Codons 


By 1961, the general properties of the code were known, but not one 
of the coding assignments of the specific triplets had been discovered. 
At the time, most geneticists thought it would take many years to 
decipher the entire code. But a breakthrough was made by Marshall 
Nirenberg and Heinrich Matthaei, who used an enzyme polynucleo- 
tide phosphorylase to synthesize their own artificial genetic messages 
and then determine what kind of protein they encoded. The first 
message they tested was a polyribonucleotide consisting exclusively 
of uridine; the message was called poly(U). When poly(U) was added 


Base Sequence 


Original sequence Overlapping code 


...AGCATCG... 


Sequence after 
single-base 
substitution 
...AGAATCG... 


Overlapping code 
..., AGA GAA AATATCTCG... 


...,AGC GCACATATCTCG... 
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and the materials necessary for protein synthesis (ribosomes and 
various soluble factors), the system followed the artificial messen- 
ger’s instructions and manufactured a polypeptide. The assembled 
polypeptide was analyzed and found to be polyphenylalanine—a 
polymer of the amino acid phenylalanine. Nirenberg and Matthaei 
had thus shown that the codon UUU specifies phenylalanine. 

Over the next four years, the pursuit was joined by a number of 
laboratories, and synthetic mRNAs were constructed to test the 
amino acid specifications for all 64 possible codons. The result was 
the universal decoder chart for the genetic code shown in FIGURE 11.38. 
The decoder chart lists the nucleotide sequence for each of the 64 
possible codons in an mRNA. Instructions for reading the chart are 
provided in the accompanying figure legend.” 

The first exceptions to the assignments of the codons shown in 
Figure 11.38 were found to occur in mitochondrial mRNAs. For 
example, in human mitochondria, UGA is read as tryptophan rather 
than stop, AUA is read as methionine rather than isoleucine, and 
AGA and AGG are read as stop rather than arginine. More recently, 
exceptions have been found, here and there, in the nuclear DNA 
codons of protists and fungi. It is evident, however, that these minor 
deviations have evolved as secondary changes from the standard 
genetic code and that the genetic code shown in Figure 11.38 can be 
considered to be universal, that is, present in all organisms. We can 
further conclude that all known organisms present on Earth today 
share a common evolutionary origin. 

Examination of the codon chart in Figure 11.38 indicates that 
the amino acid assignments are distinctly nonrandom. If one looks 
for the codon boxes for a specific amino acid, they tend to be clus- 
tered within a particular portion of the chart. Clustering reflects the 
similarity in codons that specify the same amino acid. As a result of 
this similarity in codon sequence, spontaneous mutations causing 
single base changes in a gene often will not produce a change in the 
amino acid sequence of the corresponding protein. A change in 
nucleotide sequence that does not affect amino acid sequence is said 
to be synonymous, whereas a change that causes an amino acid sub- 
stitution is said to be nonsynonymous. These two types of mutations 
are depicted in FIGURE 11.39, which describes several of the differ- 
ent types of mutations discussed in this chapter. Synonymous 
changes (Figure 11.39a) are much less likely to change an organ- 
ism’s phenotype than are nonsynonymous changes (Figure 11.39b). 


7As described on page 442, ribosomes always identify an AUG as the codon to start poly- 
peptide synthesis. In these early experiments with synthetic polynucleotides, the steps 
using bacterial extracts to synthesize polypeptides were carried out at unusually high 
Mg** concentrations, which allowed the ribosome to initiate translation at any codon. 


Nonoverlapping code 
...,AGCATC... 


Nonoverlapping code 
...,AGAATC... 


FIGURE 11.37 The distinction between an overlapping and a nonoverlapping genetic code. The effect on the information content of an mRNA by a 
single base substitution depending on whether the code is overlapping or nonoverlapping. The affected codons are underlined in red. 
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1st letter 


2nd 
a < letter 


Examples of tRNAs 


\ 
4 
\ 
\ 
ai 
3 
\ 
\ 


Se ACG 
~~ ._ |Codon: UGC 


te 


eo GUG 
~~. _ |Codon: CAC 


CCU 
aes, Codon: GGA 


3rd letter 


FIGURE 11.38 The genetic code. This universal decoder chart lists each of the 64 possible mRNA codons and the corresponding amino acid specified by 
that codon. To use the chart to translate the codon UGC, for example, find the first letter (U) in the indicated row on the left. Follow that row to the right 
until you reach the second letter (G) indicated at the top; then find the amino acid that matches the third letter (C) in the row on the right. UGC specifies the 
insertion of cysteine. Each amino acid (except two) has two or more codons that order its insertion, which makes the code degenerate. A given amino acid 
tends to be encoded by related codons. This feature reduces the likelihood that base substitutions will result in changes in the amino acid sequence of a 
protein. Amino acids with similar properties also tend to be clustered. Amino acids with acidic side chains are shown in red, those with basic side chains in 
blue, those with polar uncharged side chains in green, and those with hydrophobic side chains in brown. As discussed in the following section, decoding in 
the cell is carried out by tRNAs, a few of which are illustrated schematically in the right side of the figure. UAA, UAG, and UGA are read as stop codons. 


Base Sequence of RNA 


Original sequence 
...AGC AUC UGU... 


Synonymous mutation 
FIGURE 11.39 Genes can be subject to several types of mutations. Red lines denote Nonsynonymous mutation 
codons. (a) Synonymous mutations (in this case, AUC to AUA) do not affect an amino acid (c) ... AGC AUCUGA... — Nonsense mutation 
coding assignment, but they can still have a phenotypic effect by altering pre-mRNA — 
splicing events, translation efficiency, mRNA stability, and so on (b) Nonsynonymous (or 
missense) mutations (in this case, AUC to AUG) cause a change of a single amino acid in 
the polypeptide sequence. (c) Nonsense mutations (or premature termination codons) (in Frameshift mutations 
this case UGU to UGA) cause the ribosome to stop translating the mRNA at the site of 
mutation, thereby terminating protein synthesis prematurely. (d) Frameshift mutations, 
which can be caused by the insertion or deletion of one or two bases, move the 
ribosome into an incorrect reading frame, causing an abnormal amino acid sequence 
from the point of mutation onward. 


Sequences after insertion or deletion of one or two bases 


Consequently, nonsynonymous changes are much more likely to be 
selected for or against by natural selection. Now that we have 
sequenced the genomes of related organisms, such as those of chim- 
panzees and humans, we can look directly at the sequences of 
homologous genes and see how many changes are synonymous or 
nonsynonymous. Genes possessing an excess of nonsynonymous 
substitutions in their coding regions are likely to have been influ- 
enced by natural selection (see the footnote on page 396). 

The “safeguard” aspect of the code goes beyond its degeneracy. 
Codon assignments are such that similar amino acids tend to be 
specified by similar codons. For example, codons of the various 
hydrophobic amino acids (depicted as brown boxes in Figure 11.38) 
are clustered in the first two columns of the chart. Consequently, a 
mutation that results in a base substitution in one of these codons is 
likely to substitute one hydrophobic residue for another. In addition, 
the greatest similarities between amino acid-related codons occur in 
the first two nucleotides of the triplet, whereas the greatest variabil- 
ity occurs in the third nucleotide. For example, glycine is encoded by 
four codons, all of which begin with the nucleotides GG. An expla- 
nation for this phenomenon is revealed in the following section in 
which the role of the transfer RNAs is described. 


REVIEW | 


1. Explain what is meant by stating that the genetic 
code is triplet and nonoverlapping? What did the 
finding that DNA base compositions varied greatly 
among different organisms suggest about the 
genetic code? How was the identity of the UUU 
codon established? 


. Distinguish between the effects of a base 
substitution in the DNA on a nonoverlapping and 
an overlapping code. 

. Distinguish between a synonymous and a 
nonsynonymous base change. 


11.15 Decoding the Codons: 
The Role of Transfer RNAs 


Nucleic acids and proteins are like two languages written with differ- 
ent types of letters. This is the reason protein synthesis is referred to 
as translation. Translation requires that information encoded in the 
nucleotide sequence of an MRNA be decoded and used to direct the 
sequential assembly of amino acids into a polypeptide chain. 
Decoding the information in an mRNA is accomplished by transfer 
RNAs, which act as adaptors. On one hand, each tRNA is linked to a 
specific amino acid (as an aa-tRNA), while on the other hand, that 
same tRNA is able to recognize a particular codon in the mRNA. 
The interaction between successive codons in the mRNA and spe- 
cific aa-tRNAs leads to the synthesis of a polypeptide with an 
ordered sequence of amino acids. To understand how this occurs, 
we must first consider the structure of tRNAs. 


The Structure of tRNAs 


In 1965, after seven years of work, Robert Holley of Cornell University 
reported the first base sequence of an RNA molecule, that of a yeast 
transfer RNA that carries the amino acid alanine (FIGURE 11.40a). 
This tRNA is composed of 77 nucleotides, 10 of which are modified 
from the standard four nucleotides of RNA (A, G, C, U), as indicated 
by the shading in the figure. 

Over the following years, other tRNA species were purified and 
sequenced, and a number of distinct similarities present in all the 
different tRNAs became evident (Figure 11.40b). All tRNAs were 
roughly the same length—between 73 and 93 nucleotides—and all 
had a significant percentage of unusual bases that were found to 
result from enzymatic modifications of one of the four standard 
bases after it had been incorporated into the RNA chain, that is, 
posttranscriptionally. In addition, all tRNAs had sequences of nucle- 
otides in one part of the molecule that were complementary to 
sequences located in other parts of the molecule. Because of these 


oe — ——. 
OH 
FIGURE 11.40 Two-dimensional structure of i \ site for amino OH 
transfer RNAs. (a) The nucleotide sequence of the ¢ acid attachment A 3 
cloverleaf form of a yeast tRNA‘. The amino acid A ¢ 
becomes linked to the 3’ end of the tRNA, 5'p-G - C ¢ 
whereas the opposite end bears the anticodon, in G - ¢ 5 Po i 
this case IGC. The function of the anticodon is G : U I _ | Amino acid 
discussed later. In addition to the four bases, A, U, ¢ = G 4_ | acceptor stem 
G, and C, this tRNA contains yw, pseudouridine i . ; {_ [ 
(see Figure 11.13a); T, ribothymidine; ml, methyl- ie & t_6 
inosine; |, inosine; me2G, dimethylguanosine; D, 2G. i) \ NUN, D arm “Ut -—¢ T arm 
dihydrouridine; and meG, methylguanosine. The C A~U-G-C-G-ti@> A-G-G-¢-C é ea, | 7+ N78N\y 
sites of the10 modified bases in this tRNA are JA “GCG rae ard re G Ke a Wat ail i R 
indicated by the color shading. (b) Generalized i c-0 C- GA “ a G UN’ Ro wre, ae yo Meee 
representation of tRNA in the cloverleaf form. The U -A GE ae ° . iN 
bases common to all tRNAs (both prokaryotic and C-G Anticodon | -7 ae Variable 
eukaryotic) are indicated by letters; R is an ae or. a arm 
invariant purine, Y is an invariant pyrimidine, w is uU e oN, 
an invariant pseudouridine. The greatest variability %® al y R 
among tRNAs occurs in the V (variable) arm, which G-C’ Sice® 
can range from 4 to 21 nucleotides. There are two i, Anticodon s Anticodon 
a 


sites of minor variability in the length of the D arm. 
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similar way to form a structure that can be drawn in two dimensions 
as a cloverleaf. The base-paired stems and unpaired loops of the 
tRNA cloverleaf are shown in Figures 11.42 and 11.43. The unusual 
bases, which are concentrated in the loops, disrupt hydrogen bond 
formation in these regions and serve as potential recognition sites 
for various proteins. All mature tRNAs have the triplet sequence 
CCA at their 3’ end. These three nucleotides may be encoded in the 
tRNA gene (in many prokaryotes) or added enzymatically (in eukar- 
yotes). In the latter case, a single talented enzyme, known as CCA 
Adding Enzyme, adds all three nucleotides in the proper order with- 
out the benefit of a DNA or RNA template. 

Up to this point we have considered only the secondary, or 
two-dimensional, structure of these adaptor molecules. Transfer 
RNAs fold into a unique and defined tertiary structure. X-ray dif- 
fraction analysis has shown that tRNAs are constructed of two dou- 
ble helices arranged in the shape of an L (FIGURE 11.41b). Those 
bases that are found at comparable sites in all tRNA molecules (the 
invariant bases of Figure 11.40b) are particularly important in gen- 
erating the L-shaped tertiary structure. The common shapes of 
tRNAs reflects the fact that all of them take part in a similar series 
of reactions during protein synthesis. However, each tRNA has 
unique features that distinguish it from other tRNAs. As discussed 
in the following section, it is these unique features that make it pos- 
sible for an amino acid to become attached enzymatically to the 
appropriate (cognate) tRNA. 

Transfer RNAs translate a sequence of mRNA codons into a 
sequence of amino acid residues. Information contained in the 
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mRNA is decoded through the formation of base pairs between 
complementary sequences in the transfer and messenger RNAs (see 
Figure 11.47). Thus, as with other processes involving nucleic acids, 
complementarity between base pairs lies at the heart of the transla- 
tion process. The part of the tRNA that participates in this comple- 
mentary interaction with the codon of the mRNA is a stretch of 
three sequential nucleotides, called the anticodon, that is located in 
the middle loop of the tRNA molecule (Figure 11.41a). This loop is 
invariably composed of seven nucleotides, the middle three of which 
constitute the anticodon. The anticodon is located at one end of the 
L-shaped tRNA molecule opposite from the end at which the amino 
acid is attached (Figure 11.410). 

Given the fact that 61 different codons can specify an amino 
acid, a cell might be expected to have at least 61 different tRNAs, 
each with a different anticodon complementary to one of the codons 
of Figure 11.38. But recall that the greatest similarities among codons 
that specify the same amino acid occur in the first two nucleotides of 
the triplet, whereas the greatest variability in these same codons 
occurs in the third nucleotide of the triplet. Consider the 16 codons 
ending in U. In every case, if the U is changed to a C, the same amino 
acid is specified (first two lines of each box in Figure 11.38). Similarly, 
in most cases, a switch between an A and a Gat the third site is also 
without effect on amino acid determination. The interchangeability 
of the base of the third position led Francis Crick to propose that the 
same transfer RNA may be able to recognize more than one codon. 
His proposal, termed the wobble hypothesis, suggested that the steric 
requirement between the anticodon of the tRNA and the codon of 
the mRNA is very strict for the first two positions but is more flexible 


FIGURE 11.41 The structure of a tRNA. (a) Two-dimensional structure of a yeast phenylalanyl-tRNA with the various regions of the molecule color-coded 
to match part b. (b) Three-dimensional structure of tRNA": derived from X-ray crystallography. The amino acceptor (AA) arm and the TwC (T) arm form a 
continuous double helix, and the anticodon (AC) arm and the D arm form a partially continuous double helix. These two helical columns meet to form an 


L-shaped molecule. 


Source: Courtesy of Michael Carson, University of Alabama at Birmingham. 


at the third position. As a result, two codons that specify the same 
amino acid and differ only at the third position should use the same 
tRNA in protein synthesis. Once again, a Crick hypothesis proved to 
be correct. 

The rules governing the wobble at the third position of the 
codon are as follows (FIGURE 11.42): U of the anticodon can pair 
with A or G of the mRNA; G of the anticodon can pair with U or C 
of the mRNA; and I (inosine, which is derived from guanine in the 
original tRNA molecule) of the anticodon can pair with U, C, or A of 
the mRNA. As a result of the wobble, the six codons for leucine, for 
example, require only three tRNAs. 


tRNA Charging 


It is critically important during polypeptide synthesis that each 
transfer RNA molecule be attached to the correct (cognate) amino 
acid, a process known as tRNA charging. Amino acids are covalently 
linked to the 3’ ends of their cognate tRNA(s) by an enzyme called 
an aminoacyl-tRNA synthetase (aaRS) (FIGURE 11.43). Although 
there are many exceptions, organisms typically contain 20 different 
aminoacyl-tRNA synthetases, one for each of the 20 amino acids 
incorporated into proteins. Each of the synthetases is capable of 
charging all of the tRNAs that are appropriate for that amino acid 
(i.e, any tRNA whose anticodon recognizes one of the various 
codons specifying that amino acid, as indicated in Figure 11.38). 
Aminoacyl-tRNA synthetases provide an excellent example of the 
specificity of protein-nucleic acid interactions. Certain common 
features must exist among all tRNA species specifying a given amino 
acid to allow a single aminoacyl-tRNA synthetase to recognize all of 
these tRNAs while, at the same time, discriminating against all of the 
tRNAs for other amino acids. Information concerning the structural 
features of tRNAs that cause them to be selected or rejected as sub- 
strates has come primarily from two sources: 


1. Determination of the three-dimensional structure of these 
enzymes by X-ray crystallography, which allows investigators to 
identify which sites on the tRNA make direct contact with the 
protein. As illustrated in Figure 11.43, the two ends of the tRNA— 
the acceptor stem and anticodon—are particularly important for 
recognition by most of these enzymes. 


2. Determination of the changes in a tRNA that cause the molecule 
to be aminoacylated by a noncognate synthetase. It is found, for 
example, that a specific base pair in a tRNA" (the G-U base pair 
involving the third G from the 5’ end of the molecule in Figure 
11.40a) is the primary determinant of its interaction with the ala- 
nyl-tRNA synthetase. Insertion of this specific base pair into the 
acceptor stem of a tRNA?” or a tRNA’ is sufficient to cause 


Leu Leu Leu 

5P 5'P 5'P 
ZAIN ZINN ZAIN 
GA U GA U GAG 
CUA C UG CG. UU 


mRNA codons 5| —4L—_L-_ —L_-1_ 1 3: 


FIGURE 11.43 Three-dimensional portrait of the interaction between 
a tRNA and its aminoacyl-tRNA synthetase. The crystal structure of E. coli 
glutaminyl-tRNA synthetase (in blue) complexed with tRNAS"" (in red and 
yellow). The enzyme recognizes this specific tRNA and discriminates against 
others through interactions with the acceptor stem and anticodon of the tRNA. 
Source: From Thomas A. Steitz, Science vol. 246, cover of 12/1/89 issue; 

© 1989, reprinted with permission from AAAS. 


these tRNAs to be recognized by alanyl-tRNA synthetase and to 
be aminoacylated with alanine. 


Aminoacyl-tRNA synthetases carry out the following two-step 
reaction: 


Ist step: ATP + amino acid — aminoacyl-AMP + PP, 
2nd step: aminoacyl-AMP +tRNA — aminoacyl-tRNA + AMP 


In the first step, the energy of ATP activates the amino acid by for- 
mation of an adenylated amino acid, which is bound to the enzyme. 
This is the primary energy-requiring step in the chemical reactions 
leading to polypeptide synthesis. Subsequent events, such as the 
transfer of the amino acid to the tRNA molecule (step 2 above), and 
eventually to the growing polypeptide chain, are thermodynamically 
favorable. The PP; produced in the first reaction is subsequently 
hydrolyzed to P;, further driving the overall reaction toward the for- 
mation of products (page 408). As we will see later, energy is 
expended during protein synthesis, but it is not used in the forma- 
tion of peptide bonds. In the second step, the enzyme transfers its 
bound amino acid to the 3’ end of a cognate tRNA. 


Leu lle lle lle 

5'P 5'P 5'P 5'P 
ZINN ZN ZINN ZAIN 
GAG UA | UA | UA | 
C UC A U U AUC A UA 


FIGURE 11.42 The wobble in the interaction between codons and anticodons. In some cases, the nucleotide at the 5’ end of the tRNA anticodon is 
capable of pairing with more than one nucleotide at the 3’ end (third position) of the mRNA codon. Consequently, more than one codon can use the same 
tRNA. The rules for pairing in the wobble scheme are indicated in the figure as well as in the text. 
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Among the 20 amino acids incorporated into proteins, some are 
quite similar structurally to others. The synthetases that catalyze 
reactions dealing with these “similar-looking” amino acids typically 
possess a second editing site in addition to the catalytic site. Should 
the synthetase happen to place an inappropriate amino acid on a 
tRNA in the catalytic site, a proofreading mechanism of the enzyme 
is activated and the bond between the amino acid and tRNA is sev- 
ered in the editing site. 

Several methods have been developed that allow investigators to 
synthesize proteins, either in the test tube or within cells, that contain 
unnatural amino acids, that is, amino acids other than the 20 nor- 
mally incorporated by the translation machinery. These unnatural 
amino acids are not generated by modification of normal amino 
acids after their incorporation into the polypeptide but are directly 
encoded within the mRNA. This “expansion” of the genetic code gen- 
erally involves the use of an experimentally modified tRNA that rec- 
ognizes one of the stop codons and a cognate amino acyl-tRNA 
synthetase that specifically recognizes the unnatural amino acid. The 
unnatural amino acid is then incorporated into the polypeptide chain 
wherever a particular stop codon is encountered within an mRNA. 
Using these methods, researchers can synthesize a protein containing 
chemical groups capable of reporting on the protein’s activities, or 
they can design proteins with novel structures and functions for use 
as potential drugs or for other commercial applications. 


REVIEW | 


1. Why are tRNAs referred to as adaptor molecules? 
What aspects of their structure do tRNAs have in 
common? 


2. Describe the nature of the interaction between 
tRNAs and aminoacyl-tRNA synthetases. Describe 
the nature of the interaction between tRNAs and 
mRNAs. What is the wobble hypothesis? 


11.16 Translating Genetic 
Information: Initiation 


Protein synthesis, or translation, may be the most complex syn- 
thetic activity in a cell. The assembly of a protein requires all the 
various tRNAs with their attached amino acids, ribosomes, a mes- 
senger RNA, numerous proteins having different functions, cations, 
and GTP. The complexity is not surprising considering that protein 
synthesis requires the incorporation of 20 different amino acids in 
the precise sequence dictated by a coded message written in a lan- 
guage that uses different characters. In the following discussion, we 
will draw most heavily on translation mechanisms as they operate 
in bacterial cells, which is simpler and better understood. The 
process is remarkably similar in eukaryotic cells. The synthesis of a 
polypeptide chain can be divided into three rather distinct activi- 
ties: initiation of the chain, elongation of the chain, and termination 
of the chain. 

Once it attaches to an MRNA, a ribosome always moves along 
the mRNA from one codon to the next, that is, in consecutive blocks 
of three nucleotides. To ensure that the proper triplets are read, the 
ribosome attaches to the mRNA at a precise site, termed the 


initiation codon, which is specified as AUG. Binding to this codon 
automatically puts the ribosome in the proper reading frame so 
that it correctly reads the entire message from that point on. For 
example, in the following case, 


—CUAGUUACAUGCUCCAGUCCGU— 


the ribosome moves from the initiation codon, AUG, to the next 
three nucleotides, CUC, then to CAG, and so on along the entire line. 


Initiation of Translation in Prokaryotes 


The basic steps in the initiation of translation in bacterial cells are 
illustrated in Figure 11.44. 


INITIATION STEP 1: BRINGING THE SMALL 
RIBOSOMAL SUBUNIT TO THE INITIATION CODON 
As seen in FIGURE 11.44, an mRNA does not bind to an intact ribo- 
some, but to the small and large subunits in separate stages. The first 
major step of initiation is the binding of the small ribosomal subunit 
to the first AUG sequence (or one of the first) in the message, which 
serves as the initiation codon.® How does the small subunit select 
the initial AUG codon as opposed to some internal one? Bacterial 
mRNAs possess a specific sequence of nucleotides (called the Shine- 
Dalgarno sequence after its discoverers) that resides five to 10 nucle- 
otides before the initiation codon. The Shine-Dalgarno sequence is 
complementary to a sequence of nucleotides near the 3’ end of the 
16S ribosomal RNA of the small ribosomal subunit. 


rRNA Gee ue eA 3’ 


mRNA( -=- GGAGGA--—-— ) 5’ 


Interaction between these complementary sequences on the mRNA 
and rRNA positions the 30S subunit at the AUG initiation codon. 

Several of the steps outlined in Figure 11.44 require the help of 
soluble proteins, called initiation factors (designated as IFs in bac- 
teria and eIFs in eukaryotes). Bacterial cells require three initiation 
factors—IF1, IF2, and IF3—which attach to the 30S subunit (step 1, 
Figure 11.44). IF2 is a GTP-binding protein required for attachment 
of the first aminoacyl-tRNA. IF3 may prevent the large (50S) sub- 
unit from joining prematurely to the 30S subunit, and also facilitate 
entry of the appropriate initiator aa-tRNA. IF1 stabilizes attachment 
of the 30S subunit to the mRNA and prevents the initiator aa-tRNA 
from entering the wrong site on the ribosome. 


INITIATION STEP 2: BRINGING THE FIRST AA-TRNA 
INTO THE RIBOSOME Tf one examines the codon assign- 
ments (Figure 11.38), it can be seen that AUG is more than just an 
initiation codon; it is also the only codon for methionine. In fact, 
methionine is always the first amino acid to be incorporated at the 
N-terminus of a nascent polypeptide chain. (In bacteria, the initial 
methionine bears a formyl group, making it N-formylmethionine.) 
The methionine (or N-formylmethionine) is subsequently removed 
enzymatically from the majority of newly synthesized proteins. Cells 
possess two distinct methionyl-tRNAs: one used to initiate protein 


8GUG is also capable of serving as an initiation codon and is found as such in a few 
natural messages. When GUG is used, N-formylmethionine is still utilized to form 
the initiation complex despite the fact that internal GUG codons specify valine. 


Figure 11.44), where the anticodon loop of the tRNA binds to the 443 
AUG codon of the mRNA. IF1 and IF3 are released. 


3 GTP 


INITIATION STEP 3: ASSEMBLING THE COMPLETE 
INITIATION COMPLEX Once the initiator tRNA is bound to 
the AUG codon and IF3 is displaced, the large subunit (50S) joins 
the complex and the GTP bound to IF2 is hydrolyzed (step 3, 
Figure 11.44). GTP hydrolysis drives a conformational change in the 
ribosome that is required for the release of IF2-GDP, allowing trans- 
lation to proceed. 


Initiation of Translation in Eukaryotes 


mRNA 

is. 

GTP 

5 

— if Eukaryotic ribosomes are larger than bacterial ribosomes, having 
large and small subunits that are 60S and 43S, contain larger rRNAs, 
Met and also contain additional proteins not found in bacterial ribosomes. 
Eukaryotic mRNAs possess specialized features, such as 5’ caps and 3’ 
tails. Not surprisingly, initiation of translation in eukaryotes is quite 
different and more complex than the corresponding process in bacte- 
UAC ria and, moreover, it is an important step at which control over gene 
e | expression is exerted (see Section 12.19). Eukaryotic cells require at 
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least 12 initiation factors, known as eIFs to denote eukaryotic IFs, 
comprising a total of more than 25 polypeptide chains. As indicated 
in FIGURE 11.45, several of these eIFs (e.g., eIFl, eIF1A, eIF5, and 
eIF3) bind to the 40S subunit, which prepares the subunit for binding 
to the mRNA. The initiator tRNA linked to a methionine also binds 
to the 40S subunit prior to its interaction with the mRNA. The initia- 
E tor tRNA enters the P site of the subunit in association with eIF2- 
GTP. Once these events have occurred, the small ribosomal subunit 
with its associated initiation factors and charged tRNA (which 
together form a 43S preinitiation complex) is ready to find the 5’ end 
of the mRNA, which bears the methylguanosine cap (page 420). 

The 43S complex is originally recruited to the mRNA with the 
help of a cluster of initiation factors that are already bound to 
the mRNA (Figure 11.45). Among these factors: (1) eIF4E binds to the 
5‘ cap of the eukaryotic mRNA; (2) eIF4A moves along the 5’ end of 
the message removing any double-stranded regions that would 
interfere with the movement of the 43S complex along the mRNA; 
and (3) eIF4G serves as a linker between the 5’ capped end and the 


. P; 
mn 3 tRNA 
a GDP 
elF3 


FIGURE 11.44 Initiation of protein synthesis in bacteria. In step 1, 


elF2-GTP 


MeL ie 
elF1 


Poly (A) binding 
protein (PABP) 


elF4G 


mRNA 


initiation of translation begins with the association of the 30S ribosomal 


subunit with the mRNA at the AUG initiation codon, a step that requires IF1 


and IF3. The 30S ribosomal subunit binds to the mRNA at the AUG 
initiation codon as the result of an interaction between a complementary 
nucleotide sequence on the rRNA and mRNA, as discussed in the text. In 


AOS subunit 


elF4A AUG 


mRNA complex 


43S complex elF4E 


FIGURE 11.45 Initiation of protein synthesis in eukaryotes. As 


discussed in the text, initiation begins with the union of two complexes. 
One (called the 43S complex) contains the 40S ribosomal subunit bound to 
several initiation factors (elFs) and the initiator tRNA, whereas the other 
contains the mRNA bound to a separate group of initiation factors. This 
union is mediated by an interaction between elF3 on the 43S complex and 
elF4G on the mRNA complex. elF1 and elF1A are thought to induce a 
conformational change in the small ribosomal subunit that opens a “latch” 
to accommodate the entry of the mRNA. Once the 43S complex has bound 
to the 5’ end of the mRNA, it scans along the message until it reaches the 
appropriate AUG initiation codon. 


step 2, the formylmethionyl-tRNA™* becomes associated with the mRNA 
and the 30S ribosomal subunit complex by binding to IF2-GTP. In step 3, 
the 50S subunit joins the complex, GTP is hydrolyzed, and IF2-GDP is 
released. The initiator tRNA enters the P site of the ribosome, whereas all 
subsequent tRNAs enter the A site (see Figure 11.47). 


synthesis and a different one to incorporate methiony] residues into 
the interior of a polypeptide. The initiator aa-tRNA is positioned by 
the IF2 initiation factor within the P site of the ribosome (step 2, 


444 3’ polyadenylated end of the mRNA (Figure 11.45). Thus EIF4G, in 
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effect, converts a linear mRNA into a circular message. 

Once the 43S complex binds to the 5’ end of the mRNA, the 
complex then scans along the message until it reaches a recognizable 
sequence of nucleotides (typically 5° —-CCACCAUGC—33’) that 
contains the AUG initiation codon. Once the 43S complex reaches 
the appropriate AUG codon, the GTP bound to elIF2 is hydrolyzed, 
and the large (60S) subunit joins the complex. The formation of the 
complete 80S ribosome requires the activity of another GTP-bind- 
ing protein, called eIF5B-GTP, whose bound GTP is also hydrolyzed. 
Hydrolysis of the two bound GTPs and formation of the complete 
ribosome are accompanied by the release of all of the initiation fac- 
tors. These events leave the anticodon of the initiator tRNA bound to 
the AUG start codon in the P site of the assembled ribosome. The 
complex is now ready for the first step in elongation.” '° 


The Role of the Ribosome 


Now that we have reached a point where we have assembled a 
complete ribosome, we can look more closely at the structure and 
function of this multisubunit structure. Ribosomes are molecular 
machines, similar in some respects to the molecular motors 
described in Chapter 9. During translation, a ribosome undergoes a 
repetitive cycle of mechanical changes that is driven with energy 
released by GTP hydrolysis. Unlike myosin or kinesin, which simply 
move along a physical track, ribosomes move along an mRNA “tape” 
containing encoded information. Ribosomes, in other words, are 
programmable machines: The information stored in the mRNA 
determines the sequence of aminoacyl-tRNAs that the ribosome 
accepts during translation. Another feature that distinguishes ribo- 
somes from many other cellular machines is the importance of their 
component RNAs. Ribosomal RNAs play major roles in selecting 
tRNAs, ensuring accurate translation, binding protein factors, and 
polymerizing amino acids (discussed in the Experimental Pathways 
in Section 11.20). 

Early studies using high-resolution cryoelectron microscopic 
imaging techniques (Section 18.16) revealed the ribosome to be a 
highly irregular structure with bulges, lobes, channels, and bridges 
(see Figure 2.57). These studies, carried out primarily by Joachim 
Frank and colleagues at Columbia University, also provided descrip- 
tions of major conformational changes occurring within the ribo- 
some during translation. During the 1990s, major advances were 
made in the crystallization of ribosomes, and by the end of the dec- 
ade the first reports of the X-ray crystallographic structure of 
prokaryotic ribosomes had appeared. Figures 11.48a and b show 
the overall structure of the two ribosomal subunits of a bacterial 
ribosome as revealed by X-ray crystallography. More recently, X-ray 


°Not all mRNAs are translated following attachment of the small ribosomal subunit to 
the 5’ end of the message. Many viral mRNAs and a relatively small number of cellular 
mRNAs, most notably those utilized during mitosis or during periods of stress, are 
translated as the result of the ribosome attaching to the mRNA at an internal ribosome 
entry site (IRES), which may be located at some distance from the 5’ end of the 
message. 


The description presented here pertains to steady-state translation in which an 
mRNA is translated repeatedly by numerous ribosomes. The first (or pioneer) round 
of translation has several unique properties. The cap at the 5’ end of the mRNA, for 
example, is bound by a heterodimeric cap-binding complex (CBC) rather than by the 
protein elF4E. The differences between the pioneer and steady-state rounds of trans- 
lation are discussed in Cell 142:368, 2010. 


(a) (b) 
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(a’) (b’) 
Binding sites 
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Exit channel 


(c) 70S 


FIGURE 11.46 Model of the bacterial ribosome based on X-ray 
crystallographic data, showing tRNAs bound to the A, P, and E sites of 
the two ribosomal subunits. (a—b) Views of the 30S and 50S subunits, 
respectively, with the three bound tRNAs shown at the interface between 
the subunits. (a’—b’) Drawings corresponding to the structures shown in 
parts a and b. The drawing in a’ of the 30S subunit shows the approximate 
locations of the anticodons of the three tRNAs and their interaction with the 
complementary codons of the mRNA. The drawing in b’ of the 50S subunit 
shows the tRNA sites from the reverse direction. The amino acid acceptor 
ends of the tRNAs of the A and P sites are in close proximity within the 
peptidyl transfer site of the subunit, where peptide bond formation occurs. 
The binding sites for the elongation factors EF-Tu and EF-G are located on 
the protuberance at the right side of the subunit. (c) Drawing of the 70S 
bacterial ribosome showing the space between the two subunits that is 
spanned by each tRNA molecule and the channel within the 50S subunit 
through which the nascent polypeptide exits the ribosome. 

Source: (a-b) From Jamie H. Cate et al., Courtesy of Harry F. Noller, Science 
285:2100, 1999; (c) From A. Liljas, Science 285:2078, 1999; both © 1999, 
Reprinted with permission from AAAS. 


crystallography has been successfully applied to the study of the 
much larger eukaryotic ribosome, which shares the same core struc- 
ture and primary functions with its prokaryotic counterpart. At 
the same time, advances in cryoelectron microscopy have allowed 


the atomic structure of the ribosome to be determined without 
crystallization, making it easier to determine ribosome structure in 
a wide range of different states during the process of translation. 
These structural studies, in combination with biochemical analyses, 
have opened a window into the complex dynamic function of the 
ribosome. 

Each ribosome has three sites for association with transfer RNA 
molecules. These sites, termed the A (aminoacyl) site, the P 
(peptidyl) site, and the E (exit) site, receive each tRNA in succes- 
sive steps of the elongation cycle, as described in the following sec- 
tion. The positions of tRNAs bound to the A, P, and E sites of both 
the small and large ribosomal subunit are shown in FIGURE 11.46a,b. 
The tRNAs bind within these sites and span the gap between the two 
ribosomal subunits (Figure 11.46c). The anticodon ends of the bound 
tRNAs contact the small subunit, which plays a key role in decoding 
the information contained in the mRNA. In contrast, the amino 
acid-carrying ends of the bound tRNAs contact the large subunit, 
which plays a key role in catalyzing peptide bond formation. Other 
major features revealed by these high-resolution structural studies 
include the following: 


1. The interface between the small and large subunits forms a rela- 
tively spacious cavity (Figure 11.46c) that is lined almost exclu- 
sively by RNA. The side of the small subunit that faces this cavity 
is lined along its length by a single continuous double-stranded 
RNA helix. This helix is shaded in the two-dimensional structure 
of the 16S rRNA in Figure 11.3. The surfaces of the two subunits 
that face one another contain the binding sites for the mRNA and 
incoming tRNAs and, thus, are of key importance for the func- 
tion of the ribosome. The fact that these surfaces consist largely 
of RNA supports the proposal that primordial ribosomes were 
composed exclusively of RNA (page 430). 


2. The active site, where amino acids are covalently linked to one 
another, also consists of RNA. This catalytic portion of the large 
subunit resides in a deep cleft, which protects the newly formed 
peptide bond from hydrolysis by the aqueous solvent. 


3. The mRNA is situated in a narrow groove that winds around the 
neck of the small subunit, passing through the A, P, and E sites. 
Prior to entering the A site, the mRNA is stripped of any second- 
ary structure it might possess by an apparent helicase activity of 
the ribosome. 


4. A tunnel runs completely through the core of the large subunit 
beginning at the active site. This tunnel provides a passageway 
for translocation of the elongating polypeptide through the ribo- 
some (Figure 11.46c). 

5. Most of the proteins of the ribosomal subunits have multiple 
RNA-binding sites and are ideally situated to stabilize the com- 
plex tertiary structure of the rRNAs. 


REVIEW | 


1. How does the small ribosomal subunit know where to 
start initiation of translation in prokaryotes? How 
does this step differ in eukaryotes? 


2. Which initiation system, prokaryotic or eukaryotic, 
involves scanning? 


e@ 


11.17 Translating Genetic 
Information: Elongation and 
Termination 


After initiation, synthesis of a polypeptide chain proceeds to elonga- 
tion and then termination. The basic steps in the process of elonga- 
tion of translation in bacterial cells are illustrated in FIGURE 11.47. 
This series of steps is repeated over and over as amino acids are 
polymerized one after another into the growing polypeptide chain. 


Elongation Step 1: Aminoacyl-tRNA 
Selection 


With the charged initiator tRNA in place within the P site, the ribo- 
some is available for entry of the second aminoacyl-tRNA into the 
vacant A site, which is the first step in elongation (step 1, Figure 11.47a). 
Binding of the second aminoacyl-tRNA to the A site requires a 
GTPase, known as elongation factor EF-Tu (or Tu) in bacteria and 
eEF1A in eukaryotes. EF-Tu is required to deliver aminoacyl-tRNAs 
to the A site of the ribosome. Although any aminoacyl-tRNA—Tu- 
GTP complex can enter the site, only one whose anticodon is com- 
plementary to the mRNA codon situated in the A site will trigger the 
necessary conformational changes within the ribosome that cause 
the tRNA to remain bound to the mRNA in the decoding center. 
Once the proper aminoacyl-tRNA—Tu-GTP is bound to the mRNA 
codon, the GTP is hydrolyzed and the Tu-GDP complex released, 
leaving the newly arrived aa-tRNA situated in the ribosome’s A site. 


Elongation Step 2: Peptide 
Bond Formation 


At the end of the first step, the two amino acids, attached to their 
separate tRNAs, are juxtaposed to one another and precisely aligned 
to chemically interact (Figure 11.46a’, b’). In the case of the first 
peptide bond ofa new protein chain, the P site would hold the initia- 
tor tRNA, but for all other peptide bonds of the protein, the P site 
will hold a tRNA attached to the growing peptide chain. In all cases, 
the A site holds the newly arriving aminoacyl-tRNA that needs to be 
added in order for the chain to elongate. To remember that the P site 
holds the growing peptide chain, and the A site holds the newly 
arriving amino acid that will be added onto the chain, it is useful to 
note that “peptide chain” and “aminoacyl-tRNA” start with the let- 
ters P and A, respectively. The second step in the elongation cycle is 
the formation of a peptide bond between these two amino acids 
(step 2, Figure 11.47a). Peptide bond formation is accomplished as 
the amine nitrogen of the aa-tRNA in the A site carries out a nucleo- 
philic attack on the carbonyl carbon of the amino acid bound to the 
tRNA of the P site, displacing the P-site tRNA (Figure 11.47b). Asa 
result of this reaction, the tRNA bound in the A site has an attached 
dipeptide, whereas the tRNA in the P site is deacylated. Peptide 
bond formation occurs spontaneously without the input of external 
energy. The reaction is catalyzed by peptidyl transferase, a com- 
ponent of the large subunit of the ribosome. For years it was assumed 
that the peptidyl transferase was one of the proteins of the ribosome. 
Then, as the catalytic powers of RNA became apparent, attention 
shifted to the ribosomal RNA as the catalyst for peptide bond forma- 
tion. It has now been shown that the peptidyl transferase activity of 
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the ribosome is not provided by any of the protein subunits but is 
instead performed by the P-site tRNA with assistance from nucleo- 
tides of the large ribosomal RNA molecule. In other words, peptidyl 
transferase is a ribozyme, composed of nucleotide residues of the 


FIGURE 11.47 Schematic representation of the steps in elongation during translation in 
bacteria. (a) In step 1, an aminoacyl-tRNA whose anticodon is complementary to the second 
codon of the mRNA enters the empty A site of the ribosome. The binding of the tRNA is 
accompanied by the release of EF-Tu-GDP. In step 2, peptide bond formation is accomplished by 
the transfer of the nascent polypeptide chain from the tRNA in the P site to the aminoacyl-tRNA of 
the A site, forming a dipeptidyl-tRNA in the A site and a deacylated tRNA in the P site. The 
reaction is catalyzed by a part of the rRNA acting as a ribozyme. In step 3, the binding of EF-G 
and the hydrolysis of its associated GTP results in the translocation of the ribosome relative to the 
mRNA. Translocation is accompanied by the movement of the deacylated tRNA and peptidyl- 
tRNA into the E and P sites, respectively. In step 4, the deacylated tRNA leaves the ribosome, and 
a new aminoacyl-tRNA enters the A site. (b) Peptide bond formation and the subsequent 
displacement of the deacylated tRNA. A ribosome can catalyze the incorporation of 10 to 20 
amino acids into a growing polypeptide per second, which is roughly 10 million times greater than 
that observed in the uncatalyzed reaction using model substrates in solution. 


tRNA and ribosome, a subject discussed in the Experimental Pathways 
in Section 11.20. 


Elongation Step 3: Translocation 


The formation of the first peptide bond leaves one end of the tRNA 
molecule of the A site still attached to its complementary codon on 
the mRNA and the other end of the molecule attached to a dipeptide 
(step 2, Figure 11.47a). The tRNA of the P site is now devoid of any 
linked amino acid. The following step, called translocation, is 
characterized by a small (6°) ratchet-like motion of the small subunit 
relative to the large subunit, which is depicted in Figure 11.48a. As a 
result of this ratcheting motion, the ribosome moves three nucleo- 
tides (one codon) along the mRNA in the 5’3’ direction (step 3, 
Figure 11.47a). Translocation is accompanied by the movement of 
(1) the dipeptidyl-tRNA from the A site to the P site of the ribosome, 
and (2) the deacylated tRNA from the P site to the E site. As these 
movements occur, both of these tRNAs remain hydrogen-bonded to 
their codons in the mRNA. An intermediate stage in the transloca- 
tion process has been visualized by cryoelectron microscopy, which 
shows the tRNAs occupying partially translocated “hybrid states.” In 
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FIGURE 11.48 Structural model of the stages of translocation during translational elongation in bacteria. (a) Depiction of the shift from the classic to 
the ratcheted state of the prokaryotic ribosome, which involves a 6° rotation of the subunits relative to one another. (Detailed discussion of this rotation can 
be found in Ann. Rev. Biophys. 39:227, 2010) (b) The steps that occur during translocation as described in the text. 


Source: (ab) From T. Martin Schmeing and V. Ramakrishnan, Nature 461:1238, 2009. Reprinted with permission from Macmillan Publishers Ltd. Courtesy of 


Venki Ramakrishnan. 


these hybrid states, the anticodon ends of the tRNAs still reside in 
the A and P sites of the small subunit, while the acceptor ends of the 
tRNAs have moved into the P and E sites of the large subunit. The 
tRNAs are said to occupy A/P and P/E hybrid sites, respectively. The 
various steps that occur during the process of translocation are 
shown in FIGURE 11.48b. The shift from the “classic.” nonratcheted 
state in step 1, Figure 11.48), to the hybrid, ratcheted state (step 2) 
occurs spontaneously, that is, without the involvement of other fac- 
tors. It appears that the ribosome can spontaneously oscillate back 
and forth between these two states. Once it has shifted to the hybrid 
state, a GTP-bound elongation factor (EF-G in bacteria and eEF2 in 
eukaryotes) binds to the ribosome (step 3), stabilizing the ribosome 
in the ratcheted state and thereby preventing the movement of the 
tRNAs back to the classic A/A and P/P conformation. Then, hydroly- 
sis of the bound GTP generates a conformational change that moves 
the mRNA and associated anticodon loops of the tRNAs relative to 
the small ribosomal subunit, which places the bound tRNAs in the 
E/E and P/P states, leaving the A site empty (step 4). At the same 
time, the ribosome is reset to the nonratcheted state. Following this 
reaction, EF-G-GDP dissociates from the ribosome (step 5). 


Elongation Step 4: Releasing 
the Deacylated tRNA 


In the final step of elongation (step 4, Figure 11.47a), the deacylated 
tRNA leaves the ribosome, emptying the E site. 

For each cycle of elongation, at least two molecules of GTP are 
hydrolyzed: one during aminoacyl-tRNA selection and one during 
translocation. Each cycle of elongation takes about one-twentieth of 
a second, most of which is probably spent sampling aa-tRNAs from 


the surrounding cytosol. Once the peptidyl-tRNA has moved to the 
P site by translocation, the A site is once again open to the entry of 
another aminoacyl-tRNA, in this case one whose anticodon is com- 
plementary to the third codon (Figure 11.47a). Once the third 
charged tRNA is associated with the mRNA in the A site, the dipep- 
tide from the tRNA of the P site is displaced by the aa-tRNA of the A 
site, forming the second peptide bond and, as a result, a tripeptide 
attached to the tRNA of the A site. The tRNA in the P site is once 
again devoid of an amino acid. Peptide bond formation is followed 
by translocation of the ribosome to the fourth codon and release of 
the deacylated tRNA, whereupon the cycle is ready to begin again. 
Thus, each tRNA steps through three positions on the ribosome: 
First, it binds at the A site, then after peptide bond formation it shifts 
to the P site, and then after the next round of peptide bond formation 
it moves to the E site from which it exits the ribosome. This sequence, 
A to P to E, can be remembered by thinking of a monkey (the tRNA) 
swinging from tree to tree and remembering the acronym “Ape.” 

We have seen in this section how the ribosome moves three 
nucleotides (one codon) at a time along the mRNA. The particular 
sequence of codons in the mRNA that is utilized by a ribosome (i.e., 
the reading frame) is fixed at the time the ribosome attaches to the 
initiation codon at the beginning of translation. Some of the most 
destructive mutations are ones in which a single base pair is either 
added to or deleted from the DNA. Consider the effect of an addi- 
tion of one or two nucleotides or a deletion of one or two nucleotides 
in a given sequence (see Figure 11.39d). The ribosome moves along 
the mRNA in the incorrect reading frame from the point of muta- 
tion through the remainder of the coding sequence. Mutations of 
this type are called frameshift mutations. Frameshift mutations 
lead to the assembly of an entirely abnormal sequence of amino 
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448 acids from the point of the mutation. It can be noted that, after more 


LL YadLdVHD 


UI91O1g OL VNY OL VNG :ewbog jei1UueD eu e 


than two decades in which it was assumed that the ribosome always 
moved from one triplet to the next, several examples were discov- 
ered in which the mRNA contains a recoding signal that causes the 
ribosome to change its reading frame, either backing up one nucleo- 
tide (a shift to the —1 frame) or skipping a nucleotide (a shift to the 
+1 frame). 

A wide variety of antibiotics have their effect by interfering with 
various aspects of protein synthesis in bacterial cells (Section 3.8). 
Streptomycin, for example, selectively binds to the small ribosomal 
subunit of bacterial cells, causing certain of the codons of the mRNA 
to be misread, thus increasing the synthesis of aberrant proteins. 
Because the antibiotic does not bind to eukaryotic ribosomes, it has 
no effect on translation of the host cell’s mRNAs. Resistance by bac- 
teria to streptomycin can be traced to changes in ribosomal proteins, 
particularly $12. 


Termination 


As shown in Figure 11.38, three of the 64 trinucleotide codons func- 
tion as stop codons that terminate polypeptide assembly rather than 
encode an amino acid. No tRNAs exist whose anticodons are com- 
plementary to a stop codon." When a ribosome reaches one of these 
codons, UAA, UAG, or UGA, the signal is read to stop further elon- 
gation and release the polypeptide associated with the last tRNA. 


There are minor exceptions to this statement. It is stated in this chapter that codons 
dictate the incorporation of 20 different amino acids. In actual fact, there is a 21st 
amino acid, called selenocysteine, that is incorporated into a very small number of 
polypeptides. Selenocysteine is a rare amino acid that contains the metal selenium. In 
mammals, for example, it occurs in a dozen or so proteins involved in oxidation- 
reduction reactions. Selenocysteine has its own tRNA, called tRNA®©S, but lacks its 
own aa-tRNA synthetase. This unique tRNA is recognized by the seryl-tRNA syn- 
thetase, which attaches a serine to the 39 end of the tRNA*™. Following attachment, 
the serine is altered enzymatically to a selenocysteine. Selenocysteine is encoded by 
UGA, which is one of the three stop condons. Under most circumstances UGA is read 
as a termination signal. In a few cases, however, the UGA is followed by a folded 
region of the mRNA that binds a special elongation factor that causes the ribosome to 
recruit a tRNA‘ into the A site rather than a termination factor. A 22nd amino acid, 
pyrrolysine, is encoded by another stop codon (UAG) in the genetic code of some 
archaea. Pyrrolysine has its own tRNA and aa-tRNA synthetase. 
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FIGURE 11.49 Structural model of the first step of translational 
termination in bacteria. When the ribosome reaches a UAA or UAG stop 
codon, a class | RF enters the A site and becomes aligned in a manner 
similar to that of an incoming aa-tRNA. A domain of the RF that resides in 
the peptidyl transferase center of the ribosome promotes the hydrolysis of 
the ester bond that links the polypeptide to the peptidyl-tRNA in the P site, 
thereby releasing the completed polypeptide. 

Source: From H. S. Zaher and R. Green, Cell 136:747, 2009, with permission 
from Elsevier. Courtesy of Rachel Green, Johns Hopkins School of Medicine. 


Termination requires release factors. Release factors can be 
divided into two groups: class I RFs, which recognize stop codons in 
the A site of the ribosome, and class II RFs, which are GTP-binding 
proteins (G proteins) whose roles are not well understood. Bacteria 
have two class I RFs: RF1, which recognizes UAA and UAG stop 
codons, and RF2, which recognizes UAA and UGA stop codons. 
Eukaryotes have a single class I RE, eRF1, which recognizes all three 
stop codons. Class I RFs enter the A site of the ribosome as shown in 
FIGURE 11.49. Once in the A site, a conserved tripeptide at one end 
of the release factor is thought to interact with the stop codon in the 
A site and to trigger a crucial conformational change affecting sev- 
eral nucleotides of the mRNA of the small ribosomal subunit. The 
ester bond linking the nascent polypeptide chain to the tRNA is then 
hydrolyzed, and the completed polypeptide is released. At this point, 
hydrolysis of the GTP bound to the class II RF (RF3 or eRF3) leads to 
the release of the class I RF from the ribosome’s A site. The final steps 
in translation include the release of the deacylated tRNA from the P 
site, dissociation of the mRNA from the ribosome, and disassembly 
of the ribosome into its large and small subunits in preparation for 
another round of translation. These final steps require a number of 
protein factors and are quite different in eukaryotes and bacteria. In 
bacterial cells, these proteins include EF-G, IF3, and RRF (ribosome 
recycling factor), which promotes ribosomal subunit separation. 
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1. Describe some of the ways that the initiation step of 
translation differs from the elongation steps of 
translation. 


2. During translation elongation, it can be said that an 
aminocyl-tRNA enters the A site, a peptidyl-tRNA 
enters the P site, and a deacylated tRNA enters the E 


site. Explain how each of these events occurs. 
e—_____! 


11.18 mRNA Surveillance 
and Quality Control 


Because the three termination codons can be readily formed by sin- 
gle base changes from many other codons (see Figure 11.38), one 
might expect mutations to arise that produce stop codons within the 
coding sequence of a gene (Figure 11.39c). Mutations of this type, 
termed nonsense mutations, have been studied for decades and 
are responsible for roughly 30 percent of inherited disorders in 
humans. Premature termination codons, as they are also called, or 
PTCs for short, are also commonly introduced into mRNAs during 
splicing. Cells possess an mRNA surveillance mechanism capable of 
detecting messages with premature termination codons. In most 
cases, mRNAs containing such mutations are translated only once 
before they are selectively destroyed by a process called nonsense- 
mediated decay (NMD). NMD protects the cell from producing 
nonfunctional, shortened proteins. 

How is it possible for a cell to distinguish between a legitimate 
termination codon that is supposed to end translation of a message 
and a premature termination codon? To understand this feat, we 
have to reconsider events that occur during pre-mRNA processing in 
mammalian cells. It wasn’t mentioned earlier, but when an intron is 
removed by a spliceosome, a complex of proteins is deposited on the 


transcript 20-24 nucleotides upstream from the newly formed exon- 
exon junction. This landmark of the splicing process is called the 
exon-junction complex (EJC), which stays with the mRNA until 
it is translated. In a normal mRNA, the termination codon is typi- 
cally present in the last exon and so the last EJC is upstream from that 
site. As an mRNA undergoes its initial round of translation, the EJCs 
are thought to be displaced or inactivated by the advancing ribo- 
some. Any message with an EJC left on it when the ribosome finishes 
translation is marked for degradation by ribonucleases. Because nor- 
mal stop codons occur after the last intron, and hence downstream of 
the last EJC, by the time a ribosome finishes translation and dissoci- 
ates, the message will have had all its EJCs removed. But consider 
what would happen during translation of an mRNA that contained a 
premature termination codon. The ribosome would stop at the site of 
the mutation and then dissociate, leaving any EJCs that were attached 
to the mRNA downstream of the site of premature termination. 
These remaining EJCs set in motion a series of events leading to the 
enzymatic destruction of the abnormal message. 

NMD is best known for its role in eliminating mRNAs tran- 
scribed from mutant genes containing premature stop codons, such 
as those responsible for cystic fibrosis or muscular dystrophy. For 
heterozygotes that carry one normal and one disease-causing allele, 
NMD can eliminate the protein encoded by the mutant allele, thus 
preventing a potentially toxic effect. For homozygotes carrying two 
disease-causing alleles, NMD can actually prove very harmful, 
because it prevents the production of shortened, mutant proteins 
that often possess some activity. A number of biotechnology compa- 
nies are developing drugs to treat diseases caused by premature stop 
codons, either by causing the exon containing the stop codon to 
be skipped during splicing (a strategy called exon skipping), by 
interfering with NMD, or by allowing ribosomes to read through a 
PTC rather than terminating translation. All three types of drugs 


should allow RNAs with nonsense codons to be translated into pro- 449 


teins that might still retain some function, rather than having the 
entire message degraded by NMD. Even though the protein encoded 
by the mutant gene in the presence of such drugs will be abnormally 
short or have an abnormal amino acid sequence, it may still possess 
enough residual function to rescue the patients from an otherwise 
fatal disease. Two drugs under development, eteplirsen and 
SRP-4053, both of which promote exon skipping in the muscular 
dystrophy gene dystrophin to avoid NMD of the mutant gene, began 
new clinical trials in 2014. 

NMD serves as another reminder of the opportunistic nature of 
biological evolution. Just as evolution has taken advantage of the 
presence of introns to facilitate exon shuffling (page 429), it has also 
used the process by which these gene inserts are removed to establish 
a mechanism of quality control, which ensures that only untainted 
mRNAs advance to a stage where they can be translated. 


REVIEW | 


1. How does the effect of a nonsense mutation differ 
from that of a frameshift mutation? Why? 


oe | 


11.19 Polyribosomes 


When a messenger RNA in the process of being translated is exam- 
ined in the electron microscope, a number of ribosomes are invari- 
ably seen to be attached along the length of the mRNA thread. This 
complex of ribosomes and mRNA is called a polyribosome, 
or polysome (FIGURE 11.50a). Each of the ribosomes initially 
assembles from its subunits at the initiation codon and then moves 
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FIGURE 11.50 Polyribosomes. (a) Schematic drawing of a polyribosome 
(polysome). (b) This three-dimensional model was generated from cryoelectron 
tomograms of bacterial polysomes in the act of translation in vitro. To obtain the 
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tomograms, preparations were vitrified (frozen into glass-like solid ice without 
formation of ice crystals) in liquid ethane at -196°C. Electron micrographs were 
then taken with the specimen positioned at various tilt angles, providing the 
data to generate a three-dimensional reconstruction. (c) Electron micrograph of 
a grazing section through the outer edge of a rough ER cisterna. The ribosomes 
are aligned in loops and spirals, indicating their attachment to mRNA molecules 
to form polysomes. 

Source: (a-b) From Florian Brandt et al., Courtesy of Wolfgang Baumeister, Cell 
136:267, 2009; by permission of Elsevier; (c) Don W. Fawcett/Photo Researchers/ 
(b) Getty Images, Inc. 


450 from that point toward the 3’ end of the mRNA until it reaches a 
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termination codon. As each ribosome moves away from the initia- 
tion codon, another ribosome attaches to the mRNA and begins its 
translation activity. The rate at which translation initiation occurs 
varies with the mRNA being studied; some mRNAs have a much 
greater density of associated ribosomes than others. The simultane- 
ous translation of the same mRNA by numerous ribosomes greatly 
increases the rate of protein synthesis within the cell. Studies utiliz- 
ing cryoelectron tomography (Section 18.16) have suggested that the 
three-dimensional arrangement and orientation of ribosomes within 
a “free” (i.e, nonmembrane bound) polysome are quite highly 
ordered. A model of a polysome generated by this technique is 
shown in Figure 11.50b. The ribosomes that comprise this model 
polysome are densely packed and have adopted a “double-row” 
array. Moreover, each of the individual ribosomes is oriented so that 
its nascent polypeptide (red or green filaments) is situated at its 
outer surface facing the cytosol. It is suggested that this orientation 
maximizes the distance between nascent chains, thereby minimizing 
the likelihood that the nascent chains will interact with one another 
and, possibly, aggregate. Figure 11.50c shows an electron micro- 
graph of polysomes bound to the cytosolic surface of the ER mem- 
brane. It is presumed that these polysomes had been engaged in the 
synthesis of membrane and/or organelle proteins at the time the cell 
was fixed (page 268). The ribosomes within each polysome appear 
to be organized at the surface of the ER membrane into a circular 
loop or spiral. 

Now that we have described the basic events of transcription 
and translation, it is fitting to close the chapter with a picture show- 
ing both processes in a prokaryotic cell (FIGURE 11.51). Unlike 
eukaryotic cells, in which transcription occurs in the nucleus and 
translation occurs in the cytoplasm with many intervening steps, the 
corresponding activities in bacterial cells are tightly coupled. Thus 
protein synthesis in bacterial cells begins on mRNA templates well 
before the mRNA has been completely synthesized. The synthesis of 
an mRNA proceeds in the same direction as the movement of ribo- 
somes translating that message, that is, from the 5’ to 3’ end. 
Consequently, as soon as an RNA molecule has begun to be pro- 
duced, the 5’ end is available for attachment of ribosomes. The 
micrograph in Figure 11.51 shows DNA being transcribed, nascent 
mRNAs being synthesized, and ribosomes that are translating each 
of the nascent mRNAs. This single micrograph thus displays the 
entire central dogma in action. 


FIGURE 11.51 Visualizing transcription and translation. Electron 
micrograph of portions of an E. coli chromosome engaged in transcription. 
The DNA is seen as faint lines running the length of the photo, whereas the 
nascent MRNA chains are seen to be attached at one of their ends, 
presumably by an RNA polymerase molecule. The particles associated with 
the nascent RNAs are ribosomes in the act of translation; in bacteria, 
transcription and translation occur simultaneously. The RNA molecules 
increase in length as the distance from the initiation site increases. 

Source: From Oscar L. Miller, Jr., Barbara A. Hamkalo, and C. A. Thomas, 
Science 169:392, 1970; © 1970, reprinted with permission from AAAS. 
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1. What is a polyribosome? How does its formation 
differ in prokaryotes and eukaryotes? 
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11.20 EXPERIMENTAL PATHWAYS 


The Role of RNA as a Catalyst 


Research in biochemistry and molecular biology through the 
1970s had solidified our understanding of the roles of proteins 
and nucleic acids. Proteins were the agents that made things 
happen in the cell, and the enzymes that accelerated the rates of 
chemical reactions within organisms. Nucleic acids, on the other 
hand, were the informational molecules in the cell, storing genetic 
instructions in their nucleotide sequence. The division of labor 
between protein and nucleic acids seemed as well defined as any 
distinction that had been drawn in the biological sciences. Then, 
in 1981, a paper was published that began to blur this distinction. | 


Thomas Cech and his co-workers at the University of Colorado 
had been studying the process by which the ribosomal RNA 
precursor synthesized by the ciliated protozoan Tetrahymena 
thermophila was converted to mature rRNA molecules. The 
T. thermophila pre-rRNA contained an intron of about 400 nucleo- 
tides that had to be excised from the primary transcript before the 
adjoining segments could be linked together (ligated). 

Cech had previously found that nuclei isolated from the 
cells were able to synthesize the pre-rRNA precursor and carry 
out the entire splicing reaction. Splicing enzymes had yet to be 


isolated from any type of cell, and it seemed that Tetrahymena 
might be a good system in which to study these enzymes. The 
first step was to isolate the pre-rRNA precursor in an intact state 
and determine the minimum number of nuclear components 
that had to be added to the reaction mixture to obtain accurate 
splicing. It was found that when isolated nuclei were incubated in 
a medium containing a low concentration of monovalent cations 
(5mM NH,*), the pre-rRNA molecule was synthesized but the 
intron was not excised. This allowed the researchers to purify 
the intact precursor, which they planned to use as a substrate to 
assay for splicing activity in nuclear extracts. They found, however, 
that when the purified precursor was incubated by itself at higher 
concentrations of NH4* in the presence of Mg’* and a guanosine 
phosphate (e.g., GMP or GTP), the intron was spliced from the 
precursor (FIGURE 1).' Nucleotide-sequence analysis confirmed 
that the small RNA that had been excised from the precursor was 
the intron with an added guanine-containing nucleotide at its 5’ 
end. The additional nucleotide was shown to be derived from the 
GTP that was added to the reaction mixture. 

Splicing of an intron is a complicated reaction that requires 
recognition of the sequences that border the intron, cleavage of 
the phosphodiester bonds on both sides of the intron, and ligation 
of the adjoining fragments. All efforts had been made to remove 
any protein that might be clinging to the RNA before it was tested 
for its ability to undergo splicing. The RNA had been extracted 
with detergent-phenol, centrifuged through a gradient, and 
treated with a proteolytic enzyme. There were only two reasonable 
explanations: Either splicing was accomplished by a protein that 
was bound very tightly to the RNA, or the pre-rRNA molecule was 
capable of splicing itself. The latter idea was not easy to accept. 

To resolve the question of the presence of a protein contami- 
nant, Cech and co-workers utilized an artificial system that could not 
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FIGURE 1 Purified 32P-labeled Tetrahymena ribosomal RNA, 
transcribed at different (NH4)2SO, concentrations, was analyzed by 
polyacrylamide gel electrophoresis. The numbers at the top indicate 
the concentration of ammonium sulfate. Two groups of samples are 
shown, “native” and heat-denatured. Samples of the latter group were 
boiled for five minutes in buffer and cooled on ice to dissociate any 
molecules that might be held together by hydrogen bonding between 
complementary bases. The right two columns contain bacterial 16S and 
23S rRNAs, which provide markers of known size with which the other 
bands in the gel can be compared. It can be seen from the positions of 
the bands that as the ammonium sulfate concentration rises, the 
presence of a small RNA appears whose size is equal to that of the 
isolated intron (IVS, intervening sequence). These data provided the first 
indication that the rRNA was able to excise the intron without the aid of 
additional factors. 


Source: From T. R. Cech et al., Cell 27:488, 1981; by permission of 
Elsevier. 


possibly contain nuclear splicing proteins.* The DNA that encodes 
the rRNA precursor was synthesized in E. coli, purified, and used 
as a template for in vitro transcription by a purified bacterial RNA 
polymerase. The pre-rRNA synthesized in vitro was then puri- 
fied and incubated by itself in the presence of monovalent and 
divalent ions and a guanosine compound. Because the RNA had 
never been in a cell, it could not possibly be contaminated by cel- 
lular splicing enzymes. Yet the isolated pre-rRNA underwent the 
precise splicing reaction that would have occurred in the cell. The 
RNA had to have spliced itself. 

As a result of these experiments, RNA was shown to be 
capable of catalyzing a complex, multistep reaction. Calculations 
indicated that the reaction had been speeded approximately 10 
billion times over the rate of the noncatalyzed reaction. Thus, 
like a protein enzyme, the RNA was active in small concentration 
and was able to greatly accelerate a chemical reaction. The pri- 
mary distinction between this RNA and “standard proteinaceous 
enzymes” was that the RNA acted on itself rather than on an 
independent substrate. Cech called the RNA a “ribozyme.” 

In 1983, a second, unrelated example of RNA catalysis was 
discovered.° Sidney Altman of Yale University and Norman Pace 
of the National Jewish Hospital in Denver were collaborating on 
the study of ribonuclease P, an enzyme involved in the processing 
of a transfer RNA precursor in bacteria. The enzyme was unusual 
in being composed of both protein and RNA. When incubated in 
buffers containing high concentrations (60 mM) of Mg?*, the puri- 
fied RNA subunit was found capable of removing the 5’ end of the 
tRNA precursor (lane 7, FIGURE 2) just as the entire ribonuclease 
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FIGURE 2 The results of polyacrylamide gel electrophoresis of 
reaction mixtures that had contained the precursor of tyrosinyl-tRNA 
(pTyr) and the precursor of another RNA called 4.5S RNA (p4.5). We 
will focus only on pTyr, which is normally processed by ribonuclease P into 
two RNA molecules, Tyr and 5’-Tyr (which is the 5’ end of the precursor). 
The positions at which these three RNAs (pTyr, Tyr, and 5’-Tyr) migrate 
during electrophoresis are indicated to the left of the gel. Lane 1 shows 
the RNAs that appear in the reaction mixture when pTyr (and p4.5) are 
incubated with the complete ribonuclease P. Very little of the pTyr 
remains in the mixture; it has been converted into the two products (Tyr 
and 5’-Tyr). Lane 5 shows the RNAs that appear in the reaction mixture 
when pTyr is incubated with the purified protein component of 
ribonuclease P. The protein shows no ability to cleave the tRNA precursor 
as evidenced by the absence of bands where the two products would 
migrate. In contrast, when pTyr is incubated with the purified RNA 
component of ribonuclease P (lane 7), the pTyr is processed as efficiently 
as if the intact ribonucleoprotein had been used. 


Source: From Cecilia Guerrier-Tokada et al., Cell 35:850, 1983; by 
permission of Elsevier. 
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FIGURE 3 A molecular model of a portion of the catalytic RNA 
subunit of bacterial ribonuclease P (white) and its bound substrate, 
precursor tRNA (red). The site on the precursor tRNA where cleavage by 
the ribozyme occurs is indicated by a yellow sphere. It is interesting to 
note that RNase P in human mitochondria is a protein enzyme and does 
not require an RNA component for catalytic activity. This mitochondrial 
RNase P enzyme is presumably an example of the evolutionary takeover 
of a catalytic activity by a protein from an ancient RNA. 


Source: Courtesy of Michael E. Harris and Norman R. Pace. 


P molecule would have done inside the cell. The reaction products 
of the in vitro reaction included the processed, mature tRNA mol- 
ecule. In contrast, the isolated protein subunit of the enzyme was 
devoid of catalytic activity (lane 5, Figure 2). 

To eliminate the possibility that a protein contaminant was 
actually catalyzing the reaction, the RNA portion of ribonuclease 
P was synthesized in vitro from a recombinant DNA template. 
As with the RNA that had been extracted from bacterial cells, 
this artificially synthesized RNA, without any added protein, was 
capable of accurately cleaving the tRNA precursor.* Unlike the 
rRNA processing enzyme studied by Cech, the RNA of ribonu- 
clease P acted on another molecule as a substrate rather than on 
itself. Thus, it was demonstrated that ribozymes have the same 
catalytic properties as proteinaceous enzymes. A model of the 
interaction between the catalytic RNA subunit of ribonuclease P 
and a precursor tRNA substrate is shown in FIGURE 3. 

The demonstration that RNA could catalyze chemical 
reactions created the proper atmosphere to reinvestigate an 
old question: Which component of the large ribosomal subunit 
is the peptidyl transferase, that is, the catalyst for peptide bond 
formation? During the 1970s, a number of independent findings 
raised the possibility that ribosomal RNAs might be doing more 
than simply acting as a scaffold to hold ribosomal proteins in 
the proper position so they could catalyze translation. Included 
among the evidence were the following types of data: 


1. Certain strains of E. coli carry genes that encode bacteria- 
killing proteins called colicins. One of these toxins, colicin 
E3, was known to inhibit protein synthesis in sensitive bacte- 
rial cells. Ribosomes that had been isolated from colicin E3- 
treated cells appeared perfectly normal by most criteria, but 
they were unable to support protein synthesis in vitro. Further 
analysis of such ribosomes revealed that the defect resides 
in the rRNA, not the ribosomal proteins. The 16S RNA of the 
small subunit is cleaved by the colicin about 50 nucleotides 
from its 3’ end, which renders the entire subunit unable to 
support protein synthesis.° 


2. Treatment of large ribosomal subunits with ribonuclease 
Ty, an enzyme that cleaves bonds between accessible RNA 
nucleotides, destroyed the subunit’s ability to carry out the 
peptidyl transferase reaction.® 


3. A wide variety of studies on antibiotics that inhibited peptide 
bond formation, including chloramphenicol, carbomycin, and 
erythromycin, suggested that these drugs act on the ribosomal 
RNA, not the protein. For example, it was found that ribo- 
somes became resistant to the effects of chloramphenicol as 
the result of substitutions in the bases in the ribosomal RNA.’ 


4. Ribosomal RNAs were shown to have a highly conserved base 
sequence, much more so than the amino acid sequences of 
ribosomal proteins. Some of the regions of the ribosomal RNAs 
are virtually unchanged in ribosomes isolated from prokaryotes, 
plants, and animals, as well as ribosomes isolated from mito- 
chondria and chloroplasts. The fact that rRNA sequences are 
so highly conserved suggests that the molecules have a crucial 
role in the function of the ribosome.®*” In fact, a 1975 paper by 
C. R. Woese and co-workers states, “Since little or no correla- 
tion exists between these conserved regions and known ribo- 
somal protein binding sites, the implication is strong that large 
areas of the RNA are directly involved in ribosomal function.”® 


Following the discovery of catalytic RNAs by Cech’s and 
Altman's laboratories, investigation into the roles of ribosomal 
RNAs intensified. A number of studies by Harry Noller and his col- 
leagues at the University of California, Santa Cruz, pinpointed the 
site in the ribosomal RNA that resides in or around the peptidyl 
transferase center.’ In one study, it was shown that transfer RNAs 
bound to the ribosome protect specific bases in the rRNA of the 
large subunit from attack by specific chemical agents. Protection 
from chemical attack is evidence that the tRNA must be situated 
very close to the rRNA bases that are shielded.'° Protection is lost 
if the 3’ end of the tRNA (the end with the CCA bonded to the 
amino acid) is removed. This is the end of the tRNA involved in 
peptide bond formation, which would be expected to reside in 
close proximity to the peptidyl transferase site. 

Attempts to ascribe a particular function to isolated ribo- 
somal RNA had always met with failure. It was presumed that, 
even if ribosomal RNAs did have specific functions, the presence 
of ribosomal proteins was most likely required to keep the rRNAs 
in their proper conformation. Considering that the ribosomal 
proteins and rRNAs have coevolved for billions of years, the two 
molecules would be expected to be interdependent. Despite this 
expectation, the catalytic power of isolated rRNA was finally dem- 
onstrated by Noller and coworkers in 1992.'' Working with par- 
ticularly stable ribosomes from Thermus aquaticus, a bacterium 
that lives at high temperatures, Noller treated preparations of the 
large ribosomal subunit with a protein-extracting detergent (SDS), 
a protein-degrading enzyme (proteinase K), and several rounds of 
phenol, a protein denaturant. Together, these agents removed at 
least 95 percent of the protein from the ribosomal subunit, leav- 
ing the rRNA behind. Most of the 5 percent of the protein that 
remained associated with the rRNA was demonstrated to con- 
sist of small fragments of the ribosomal proteins. Yet, despite the 
removal of nearly all of the protein, the rRNA retained 80 percent of 
the peptidyl transferase activity of the intact subunit. The catalytic 
activity was blocked by chloramphenicol and by treatment with 
ribonuclease. When the RNA was subjected to additional treat- 
ments to remove the small amount of remaining protein, the prep- 
aration lost its catalytic activity. Because it is very unlikely that the 
remaining protein had any catalytic importance, it was generally 


agreed that these experiments demonstrated that the peptidyl 
transferase activity of the ribosome is carried out by a ribozyme. 
Final confirmation of this conclusion awaited results obtained 
by X-ray crystallographic studies. As might be expected from the 
complexity of ribosomal subunits, obtaining high-quality crystals 
of such particles was a very difficult endeavor. The greatest suc- 
cesses in this field were obtained by Ada Yonath and colleagues 
at the Weizmann Institute in Israel, who had turned to using ribo- 
somes prepared from prokaryotic extremophiles (page 13) that 
lived in waters of high temperature or high salinity. Efforts in the 
X-ray crystallographic analysis of ribosomal subunits finally culmi- 
nated with the publication in 2000 of high-resolution crystal struc- 
tures of (1) the large ribosomal subunit of Haloarcula marismortui, 
an archaebacterium that lives in the Dead Sea, by Thomas Steitz, 
Peter Moore, and their colleagues at Yale University; ' and (2) the 
30S subunit of the thermophilic bacterium, Thermus thermophi- 
lus, by Venkatraman Ramakrishnan and colleagues at Cambridge 
University'? and by Ada Yonath and colleagues in Germany and 
Israel.'4 High-resolution structures of the entire ribosome, both 
bacterial and eukaryotic, are now available, which allow the 
dynamic changes in ribosomal-substrate structure to be deter- 
mined at different stages in the process of translation.'° The 
key question, once these structures were solved, was whether 
any amino acids of the ribosomal proteins would be positioned 
to catalyze the peptidyl transfer reaction. To identify the pepti- 
dyl transferase site within the large ribosomal subunit, Steitz 
and colleagues soaked crystals of these subunits with CCdA- 
phosphate-puromycin, a substance that inhibits peptide bond 
formation by binding tightly to the peptidyl transferase active 
site. Determination of the structure of these subunits at atomic 
resolution reveals the location of the bound inhibitor, and thus the 
location of the peptidyl transferase site.'° Surprisingly at the time, 
it was found that the active-site inhibitor is bound within a cleft of 
the subunit that is surrounded entirely by conserved nucleotide 
residues of the 23S rRNA. In fact, there isn’t a single amino acid 
side chain of any of the ribosomal proteins within about 18 A of 
the site where a peptide bond is synthesized. This result seemed 
to demonstrate once and for all that the ribosome is a ribozyme. 
However, as in many other examples from molecular biology, one 
has to remain careful about coming to definitive conclusions. 
Careful mutagenesis studies of all 23S rRNA nucleotides located 
near the peptidyl transfer site failed to show a strong effect on 
catalysis, suggesting that the 23S RNA might not directly partici- 
pate in catalysis,'” a conclusion supported by molecular modeling 
analyses. But if neither the ribosomal proteins nor ribosomal RNA 
participate directly in catalysis, then what does? The 2'-OH group 
of the A76 reside (the last residue of the chain, contained in the 
CCA motif) of the P-site tRNA is located very close to the alpha 
amine group of the A-site aminoacyl tRNA, and experiments in 
which the A76 residue is replaced with 2’ deoxy derivatives sug- 
gest this residue indeed plays a major catalytic role.'® 23S rRNA 
residues appear to play a role predominantly in positioning the 
substrates for better catalysis,'? suggesting that the ribosome 
is an RNA-based workbench on which the tRNA is able to cat- 
alyze peptide bond formation. One initially confusing aspect of 


the experiments showing a role for the A76 2’-OH was that the 
difference in peptidyl transfer efficiency between a normal and 
a 2’ deoxy A76 was much larger than would be predicted for a 
simple chemical catalytic effect of the OH group. This paradox 
was resolved by the discovery that the 2’-OH of the tRNA A76 
residue seems to play a role in stabilizing the conformation of the 
ribosomal RNA active site, so that at least some of the effect of 
replacing the A76 with a 2’ deoxy derivative was because the con- 
formation of the peptidyl transfer site of the ribosome became 
unstable.!? 
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. Look at the codon chart of Figure 11.38. Which codons would 


you expect to have a unique tRNA, that is, one that is used only 
for that codon? Why is it that many codons do not have their 
own unique tRNA? 


. Proflavin is a compound that inserts itself into DNA and causes 


frameshift mutations (page 447). How would the effect on the 
amino acid sequence of a proflavin-induced mutation differ 
between an overlapping and a nonoverlapping code? 


. You have just isolated a new drug that has only one effect on cell 


metabolism; it totally inhibits the breakdown of pre-rRNA to 
ribosomal RNA. After treating a culture of mammalian cells with 
this drug, you give the cells [H]uridine for 2 minutes, and then 
grow the cells in the presence of the drug in unlabeled medium 
for 4 hours before extracting the RNA and centrifuging it through 
a sucrose gradient. Draw the curves you would obtain by plot- 
ting both absorbance at 260 nm and radioactivity against the 
fraction number of the gradient. Label the abscissa (X-axis) using 
S values of RNA. 


. Using the same axes as in the previous question, draw the profile 


of radioactive RNA that you would obtain after a culture of mam- 
malian cells had been incubated for 48 hours in [2H]uridine with- 
out any inhibiting drugs present. 


. Suppose you were to construct a synthetic RNA from a repeating 


dinucleotide (e.g., AGAGAGAG) and then use this RNA as a 
messenger to synthesize a polypeptide in an in vitro protein-syn- 
thesizing system, such as that used by Nirenberg and Matthaei 
to produce polyphenylalanine. What type of polypeptide would 
you make from this particular polynucleotide? Would you expect 
to have more than one type of polypeptide produced? Why or 
why not? 


. Suppose that you found an enzyme that incorporated nucleo- 


tides randomly into a polymer without the requirement for a 
template. How many different codons would you be able to find 
in synthetic RNAs made using two different nucleotide precur- 
sors (e.g., CTP and ATP)? (An enzyme called polynucleotide 
phosphorylase catalyzes this type of reaction and was used in 
studies that identified codons.) 


. Draw the parts of a 15S globin pre-mRNA, labeling the noncod- 


ing portions. 


. What is the minimum number of GTPs that you would need to 


synthesize a pentapeptide in a bacterium? 


. It was suggested on page 437 that synonymous codon changes 


do not generally alter an organism's phenotype. Can you think of 
an occasion where this might not be true? What does this say 
about the coding requirements of the genetic material? 

Would you agree with the following statement? The discovery that 
sickle cell anemia resulted from a single amino acid change was 
proof that the genetic code was nonoverlapping. Why or why not? 


. Thalassemia is a disease characterized by mutations that convert 


amino acid codons into stop codons. Suppose you were to com- 
pare the polypeptides synthesized in vitro from mRNAs purified 
from a wide variety of thalassemia patients. How would you 
expect these polypeptides to compare? Refer to the codon 
chart of Figure 11.38; how many amino acid codons can be con- 
verted into stop codons by a single base substitution? 

Do you think it would be theoretically possible to have a genetic 
code with only two letters, A and T? If so, what would be the 
minimum number of nucleotides required to make a codon? 
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On page 430, experiments are reported that led to the synthesis 
of ribozymes with unique catalytic properties. In 2001, an artificial 
ribozyme was isolated that was capable of incorporating up to 
14 ribonucleotides onto the end of an existing RNA using an 
RNA strand as a template. The ribozyme could use any RNA 
sequence as a template and would incorporate complementary 
nucleotides into the newly synthesized RNA strand with an accu- 
racy of 98.5 percent. If you were a proponent of an ancient RNA 
world, how would you use this finding to argue your case? Would 
this prove the existence of an ancient RNA world? If not, what do 
you think would provide the strongest evidence for such a world? 


How is it possible that mRNA synthesis occurs at a greater rate in 
bacterial cells than any other class, yet very little mRNA is pre- 
sent within the cell? Try to write an equation describing the 
steady-state concentration of mRNA in terms of the rate of tran- 
scription and the rate of mRNA decay. Assume that mRNA is 
produced at a constant rate and that mRNA decays at a rate 
proportional to its concentration. (See Quantitative Tutorial Video). 


If a codon for serine is 5‘-AGC-3’, the anticodon for this triplet 
would be 5’— — —3’ (write the nucleotides on the lines). How 
would the wobble phenomenon affect this codon-anticodon 
interaction? 


One of the main arguments that proteins evolved before DNA 
(i.e., that the RNA world evolved into an RNA-protein world 
rather than an RNA-DNA world) is based on the fact that the trans- 
lational machinery involves a large variety of RNAs (e.g., tRNAs, 
rRNAs), whereas the transcriptional machinery shows no evidence 
of RNA involvement. Can you explain how such an argument 
about the stages of early evolution might be based on these 
observations? 


Frameshift mutations and nonsense mutations were described 
on pages 447 and 448. It was noted that nonsense mutations 
often lead to the destruction by NMD of an mRNA containing a 
premature termination codon. Would you expect mRNAs con- 
taining frameshift mutations to be subject to NMD? 


The arrowheads in Figure 11.14 indicate the direction of 
transcription of the various tRNA genes. What does this drawing 
tell you about the template activity of each strand of a DNA 
molecule within a chromosome? 


Genes are usually discovered by finding an abnormal phenotype 
resulting from a genetic mutation. Alternatively, they can often 
be identified by examining the DNA sequence of a genome. 
Why do you suppose that the genes encoding miRNAs were not 
discovered until very recently? 


It has recently been discovered that under stress conditions, 
ribosomes can add amino acids onto the end of growing pro- 
tein chains independently of mRNA. When translation stalls in 
eukaryotes, the small ribosomal subunit dissociates, taking the 
mRNA with it. The large subunit, containing the nascent pep- 
tide chain, binds two proteins called Rqc2 and Ltn1 which medi- 
ate direct recruitment of alanine and threonine charged tRNA, 
causing the peptide chain to continue growing by addition of 
alanine and threonine. Does this ability of the ribosome to add 
amino acids independently of mRNA argue against the RNA 
world hypothesis? Does the fact that this elongation is mRNA- 
independent mean that it is independent of all RNA? What chal- 
lenges would a “protein world” scenario face compared to an 
RNA world, in terms of being able to propagate specific 
sequence information? 
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Control of Gene Expression 


CAN WE RE-CREATE A T-REX? 


lf the movie Jurassic Park, scientists isolated dinosaur 
DNA from amber-preserved mosquitoes that had fed on 
dinosaurs and inserted it into the genome of a lizard 
egg. The dinosaur genes caused the lizards to develop 
as dinosaurs. In reality, it isn't possible to obtain signifi- 
cant quantities of dinosaur DNA from ancient samples in 
amber, but even if we could, getting the dinosaur genes 
to be properly expressed and regulated in a lizard 
genome is a formidable challenge. As we will see in this 
chapter, gene expression is controlled by a large num- 
ber of DNA elements in the genome, and inserting a 
gene into a random spot in the genome often results in 
a gene that is improperly expressed, if it is expressed at 
all. In addition to its hypothetical applications in dino- 
saur reincarnation, control of gene expression is also 
important in the real world, for example in studies that 
attempt to fix genetic defects by inserting normal 
versions of the defective gene into a patient's genome 


Mosquito preserved in amber. 


Source: © The Natural History Museum / Alamy Inc. 


(gene therapy). 
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in Bacteria 


In spite of their obvious differences in form and function, the cells 
that make up a multicellular organism contain an identical set of 
genes. The genetic information present in all cells can be compared 
to a book of blueprints prepared for the construction of a large, mul- 
tipurpose building. At different times, all of the blueprints will be 
needed, but only a small subset is consulted during the work on a 
particular floor or room. The same is true for a fertilized egg, which 
contains a complete set of genetic instructions that is faithfully rep- 
licated and distributed to each cell of a developing organism, but 
only a subset of genes is expressed in any particular cell. Single cell 
organisms such as bacteria and protists also contain a complete set of 
genes, but only a subset is expressed at any one time, depending on 
environmental cues or food sources. Thus the cells of all organisms 
carry much more genetic information than they will use at any par- 
ticular moment. Cells possess mechanisms that allow them to pre- 
cisely regulate their genetic information, expressing genes only when 
they are needed. Changes in gene expression are thought to play a 
critical role in evolution by allowing genes normally active in one 
part of an embryo to be turned on in a different region, leading to a 
novel pattern of development to produce a new organism. Gene 
expression changes are seen in many diseases, notably cancer, and 
currently 10 percent of all prescribed drugs, including steroid hor- 
mone analogs, act at the level of gene expression. In this chapter, we 
will explore some of the ways in which prokaryotic and eukaryotic 
cells control gene expression and thereby ensure that certain RNAs 
and proteins are synthesized, while others are not. Much of what we 
know about the control of gene expression is based on studies exam- 
ining a single gene under different circumstances. However, with the 
advent of new technologies and the sequencing of entire genomes, 
we are beginning to understand how the entire repertoire of 
expressed genes is regulated. 


Organization of Bacterial Genomes 


Prokaryotic genomes are organized in a number of different ways, 
but circular double-stranded DNA genomes are a common arrange- 
ment. For such organisms, there is hardly any excess DNA; nearly all 
the DNA encodes RNAs or proteins with little spacing between indi- 
vidual transcription units. In addition, genes involved in the same 
biological process are often grouped together to allow coordinate 
regulation of the entire group. For example, there may be a set of 
genes involved in flagella formation directly adjacent to a group of 
genes involved in sugar metabolism. Because of this arrangement, it 
is essential that decisions on where and when to start and stop tran- 
scription or translation are precisely regulated. 


The Bacterial Operon 


A bacterial cell lives in direct contact with its environment, which 
may change dramatically in chemical composition or temperature 
from one moment to the next. At certain times, a particular food 
source may be present, while at other times that compound is absent. 
Consider the consequences of transferring a culture of bacteria 
from a minimal medium to one containing either (1) lactose or 
(2) tryptophan. 


1. Lactose is a disaccharide (see Figure 2.16) composed of glucose 
and galactose whose oxidation can provide the cell with meta- 
bolic intermediates and energy. The first step in the catabolism 
(ie., degradation) of lactose is the hydrolysis of the bond that 
joins the two sugars (a B-galactoside linkage), a reaction cata- 
lyzed by the enzyme f-galactosidase. When bacterial cells are 
growing under minimal conditions, the cells have no need for 
B-galactosidase. Under minimal conditions, an average cell contains 
fewer than five copies of B-galactosidase and a single copy of the 
corresponding mRNA. Within a few minutes after adding lactose 
to the culture medium, cells contain approximately 1000 times the 
number of B-galactosidase molecules. The presence of lactose has 
induced the synthesis of this enzyme (FIGURE 12.1). 


2. Tryptophan is an amino acid required for protein synthesis. In 
humans, tryptophan is an essential amino acid; it must come 
from the diet. In contrast, bacterial cells can synthesize trypto- 
phan in a series of reactions requiring the activity of multiple 
enzymes. Cells growing in the absence of tryptophan activate the 
genes encoding these enzymes. If, however, tryptophan should 
become available in the medium, the cells no longer have to syn- 
thesize this amino acid, and, within a few minutes, production of 
the enzymes needed to synthesize tryptophan is repressed. 


In bacteria, the genes that encode the enzymes needed to syn- 
thesize tryptophan are clustered together on the chromosome in a 
functional complex called an operon. All of the genes of an operon 
are coordinately controlled by a mechanism first described in 1961 
by Francois Jacob and Jacques Monod of the Pasteur Institute in 
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FIGURE 12.1 Kinetics of 6-galactosidase induction in E. coli. 

When a B-galactoside such as lactose is added to a culture of these 
bacteria, the production of mRNA for the enzyme B-galactosidase begins 
very rapidly, followed within a minute or so by the appearance of the 
protein, whose concentration increases rapidly. Removal of the inducer 
leads to a precipitous drop in the level of the mRNA, which reflects its rapid 
degradation. The amount of enzyme then levels off because new molecules 
are no longer synthesized. 
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FIGURE 12.2 Organization of a bacterial operon. The enzymes that make up a metabolic pathway are encoded by a series of structural genes that 
reside in a contiguous array within the bacterial chromosome. The structural genes of an operon are transcribed into a single polycistronic mRNA, which is 
translated into separate polypeptides. Transcription of the structural genes is controlled by a repressor protein that is synthesized by a regulatory gene, 
which is also part of the operon. When bound to the operator site of the DNA, the repressor protein blocks movement of RNA polymerase from the 


promoter to the structural genes. 


Paris. A typical bacterial operon consists of structural genes, 
a promoter region, an operator region, and a regulatory gene 
(FIGURE 12.2). 


e Structural genes encode the enzymes themselves. The 
structural genes of an operon usually lie adjacent to one another, 
and RNA polymerase moves from one structural gene to the 
next, transcribing all of the genes into a single mRNA. An 
mRNA containing information for more than one polypeptide is 
called a polycistronic mRNA. The polycistronic mRNA is then 
translated into the various individual enzymes of the metabolic 
pathway. 


e The promoter is the site where the RNA polymerase binds to 
the DNA prior to beginning transcription (discussed on page 
408). 


e The operator typically resides adjacent to or overlaps with the 
promoter (see Figure 12.4) and serves as the binding site for a 
protein, usually a repressor. The repressor is an example of a 
gene regulatory protein—a protein that recognizes a specific 
sequence of base pairs within the DNA and binds to that 
sequence with high affinity. As will be evident in the remaining 
sections of this chapter, DNA-binding proteins, such as 
bacterial repressors, play a prominent role in determining 
whether or not a particular segment of the genome is 
transcribed. 


e The regulatory gene encodes the repressor protein. 


The key to operon expression lies in the sequence of the opera- 
tor and the presence or absence of a repressor. When the repressor 
binds the operator (FIGURE 12.3), it prevents RNA polymerase from 
initiating transcription. The capability of the repressor to bind the 
operator and inhibit transcription depends on the conformation of 
the repressor, which is regulated allosterically by a key compound in 
the metabolic pathway, such as lactose or tryptophan, as described 
shortly. It is the concentration of this key metabolic substance 
that determines whether the operon is active or inactive at any 
given time. 


THE TRP OPERON In a repressible operon, such as the 
tryptophan (or trp) operon, the repressor is unable to bind to 
the operator DNA by itself. Instead, the repressor is active as a DNA- 
binding protein only when complexed with a specific factor, such as 
tryptophan (Figure 12.3a), which functions as a corepressor. In the 
absence of tryptophan, the conformation of the repressor does not 
allow binding to the operator sequence, thus permitting RNA poly- 
merase to bind to the promoter and transcribe the structural genes 
of the trp operon, leading to production of the enzymes that synthe- 
size tryptophan. Once tryptophan becomes available, the enzymes 
of the tryptophan biosynthetic pathway are no longer required. 
Under these conditions, the increased concentration of tryptophan 
leads to the formation of the tryptophan-repressor complex, which 
binds to the operator and blocks transcription. 


THE LAC OPERON The lac operon is a cluster of genes that 
regulates production of the enzymes needed to degrade lactose in 
bacterial cells. The Jac operon is an example of an inducible 
operon, in which the presence of a key metabolic substance (in this 
case, lactose) induces transcription of the operon, allowing synthesis 
of the proteins encoded by the structural genes (Figure 12.3b). The 
lac operon contains three tandem structural genes: the z gene, which 
encodes f-galactosidase; the y gene, which encodes galactoside per- 
mease, a protein that promotes entry of lactose into the cell; and the 
a gene, which encodes thiogalactoside transacetylase, an enzyme 
whose physiologic role is unclear. If lactose is present in the medium, 
the disaccharide enters the cell via limiting amounts of galactoside 
permease where it binds to the lac repressor, changing the confor- 
mation of the repressor and making it unable to attach to the DNA 
of the operator. This frees RNA polymerase to transcribe the operon 
followed by translation of the three encoded proteins. Thus, in an 
inducible operon the repressor protein binds to the DNA only in the 
absence of lactose, which functions as the inducer.’ As the 
concentration of lactose in the medium decreases, the disaccharide 


'The actual inducer is allolactose, which is derived from and differs from lactose by 
the type of linkage joining the two sugars. This feature is ignored in the discussion. 
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FIGURE 12.3 Gene regulation by operons. Inducible and repressible operons work on a similar principle: If the repressor is able to bind to the operator, 
genes are turned off; if the repressor is inactivated and unable to bind to the operator, genes are expressed. (a) In a repressible operon, such as the trp 
operon, the repressor, by itself, is unable to bind to the operator, and the structural genes encoding the enzymes are actively transcribed. The enzymes of 
the trp operon catalyze a series of reactions that result in the synthesis of the essential amino acid tryptophan. (1) When tryptophan is plentiful, tryptophan 
molecules act as corepressors by binding to the inactive repressor and (2) change its shape, allowing it to bind to the operator, (3) preventing transcription of 
the structural genes. Thus, when tryptophan concentration is high, the operon is repressed, preventing overproduction of tryptophan. (4) When the 
tryptophan concentration is low, the repressor molecules lack a corepressor and therefore fail to bind to the operator allowing transcription (5) and transla- 
tion (6) of enzymes (7) needed to synthesize tryptophan (8). (b) In an inducible operon, (1) the inducer (in this case, the disaccharide lactose) binds to the 
repressor protein and (2) prevents its binding to the operator (O). (3) Without the repressor in the way, RNA polymerase attaches to the promoter (P) and 
transcribes the structural genes. Thus, when the lactose concentration is high, the operon is induced, and the sugar-digesting enzymes encoded by the lac 
operon are transcribed and translated (4). As the sugar is metabolized (5), its concentration dwindles, causing bound inducer molecules to dissociate from 
the repressor, which then regains its ability to attach to the operator and prevent transcription (6). Thus, when the inducer concentration is low, the operon is 


repressed, preventing synthesis of unneeded enzymes. 


dissociates from its binding site on the repressor molecule, which 
allows the repressor to again bind to the operator and repress 
transcription. 


CATABOLITE REPRESSION Repressors, such as those of 
the lac and trp operons, exert their influence by negative control, as 
the interaction of the DNA with this protein inhibits gene expres- 
sion. The lac operon is also under positive control, as was discovered. 
during an early investigation of a phenomenon called the glucose 
effect. If bacterial cells are supplied with glucose as well as a variety 
of other substrates, such as lactose or galactose, the cells catabolize 
the glucose first and ignore the other sugars. The glucose in the 
medium acts to repress the production of various catabolic enzymes, 
such as B-galactosidase, that would allow utilization of the other sug- 
ars. In 1965, a surprising finding was made: cyclic AMP (cAMP), 
previously thought to be involved only in eukaryotic metabolism, 
was detected in cells of E. coli. It was found that the concentration of 
cAMP in the cells was related to the presence of glucose in the 
medium; the higher the glucose concentration, the lower the cAMP 
concentration. Furthermore, when cAMP was added to the medium 
in the presence of glucose, the catabolic enzymes that were normally 
absent were suddenly synthesized by the cells. 

Although the exact means by which glucose lowers the concen- 
tration of cAMP has still not been elucidated, the mechanism by 
which cAMP overcomes the effect of glucose is well understood. 
cAMP acts in prokaryotic cells by binding to a protein, the cAMP 
receptor protein (CRP). By itself, CRP is unable to bind to DNA. 
However, the cAMP-CRP complex recognizes and binds to a spe- 
cific site in the lac control region (FIGURE 12.4). The presence of the 
bound cAMP-CRP causes a change in the conformation of the DNA, 
which makes it possible for RNA polymerase to transcribe the lac 
operon. The binding site in the promoter of the lac operon for RNA 
polymerase is not a high-affinity binding site, so initiation of tran- 
scription is extremely inefficient except in the presence of the 
cAMP-CRP complex. Even when lactose is present and the lac 
repressor is inactivated, RNA polymerase cannot transcribe the lac 
operon unless the levels of cAMP-CRP are high. Because of the 
inverse relationship between cAMP levels and glucose levels, tran- 
scription of the Jac operon is thus regulated by glucose levels. As long 
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as glucose is abundant, cAMP concentrations remain below that 459 


required to promote transcription of the operon. 


ATTENUATION Because the trp repressor can only bind the 
operator in the presence of tryptophan, the concentration of trypto- 
phan serves as a feedback regulator controlling transcription of the 
operon. A second form of feedback regulation controls the trp operon 
by regulating transcription termination, a mechanism referred to as 
attenuation. In the presence of high concentrations of tryptophan, 
RNA polymerase ceases transcription shortly after initiation in a 
region called the leader sequence. If the concentration of tryptophan 
is low, transcription does not terminate until the entire operon is tran- 
scribed. The mechanism of attenuation links alternative RNA second- 
ary structures to transcription termination. Immediately after 
transcription, RNA from the leader region folds into one of two alter- 
native secondary structures. One of these structures is a transcription 
termination signal that stops RNA polymerase from continuing to the 
end of the operon. The alternative structure does not contain a tran- 
scription termination signal and allows transcription of a single 
mRNA that encodes all the structural genes. The decision as to which 
of the two alternative RNA structures formed is regulated by the con- 
centration of tryptophan. 


Riboswitches 


Over the past few years a different type of mechanism has captured 
the interest of researchers studying bacterial gene regulation. 
Proteins such as the lac and trp repressors are not the only gene regu- 
latory molecules that are influenced by interaction with small 
metabolites. A number of bacterial mRNAs have been identified that 
can bind a small metabolite, such as glucosamine or adenine, with 
remarkable specificity. The metabolite binds to a highly structured 
5’ noncoding region of the mRNA. Once bound to the metabolite, 
these mRNAs, or riboswitches, as they are called, undergo a change 
in their folded conformation that allows them to alter the expression 
of a gene involved in production of that metabolite. Thus ribo- 
switches act by means of a feedback mechanism similar to the alter- 
native RNA structures that regulate attenuation in the trp operon. 
Most riboswitches suppress gene expression by blocking either 
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sequence CTTTCGCCCGTCACTCGCGTTGCGTTAATTACACTCAATCGAGTGAGTAATCCGTGGGGTCCGAAATGTGAAATAC GAAGGCCGAGCATACAACACACCTTAACACTCGCCTATTGTTAAAGTGTGICCTTTGTCGATACTGGTAC , 
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FIGURE 12.4 Nucleotide sequence of the control regions of the lac operon. The binding site for the sigma subunit of RNA polymerase (page 411) is a 
low-affinity binding site in the lac operon. Transcription of the operon is highly inefficient except in the presence of a complex between cAMP and the CRP 
protein. Because the concentration of cAMP is inversely proportional to the levels of glucose, the lac operon will not be activated unless glucose levels are 
low. The site of initiation of transcription is denoted as +1, which is approximately 40 nucleotides upstream from the site at which translation is initiated. 


Regions of sequence symmetry in the CRP site and operator are indicated by the red horizontal line. 


Source: From R. C. Dickson et al., Science 187:32, 1975; Copyright 1975, reprinted with permissions from AAAS. 
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| repressors that function in conjunction with operons, riboswitches 

_allow cells to adjust their level of gene expression in response to 
changes in the available levels of certain metabolites. Given that they 
act without the participation of protein cofactors, riboswitches are 
likely another legacy from an ancestral RNA world (page 429). 


REVIEW | 


1. Describe the cascade of events responsible for the 
sudden changes in gene expression in a bacterial cell 
following the addition of lactose. How does this 
compare with the events that occur in response to 
the addition of tryptophan? 

2. What is the role of cyclic AMP in the synthesis of 
B-galactosidase? 


3. What is a riboswitch? 


12.2 Structure of the 
Nuclear Envelope 


A primary difference between prokaryotic and eukaryotic organ- 
isms is the presence of a nucleus in eukaryotic cells. Moreover, most 
eukaryotic organisms have much larger genomes, which are present 
as double-stranded DNA molecules arranged in linear chromo- 
somes that can be easily visualized during mitosis (as in Figure 
12.22b). Before discussing regulation of gene expression in eukary- 
otes, we will first discuss the consequences of compartmentalizing 
genomes within nuclei and how large genomes are packaged into 
DNA-protein complexes called chromatin. The DNA in prokary- 
otic organisms is not packaged into chromatin; as a result, DNA- 
binding proteinssuchas RNA polymerase, repressors,and CRP-cAMP 
complexes can directly bind to their preferred binding sites. In 
contrast, DNA-binding proteins in eukaryotes have to find their 


(a) Tum 


preferred binding sites in the context of a complicated DNA-protein 
complex. 

Considering its importance in the storage and utilization of 
genetic information, the nucleus of a eukaryotic cell has at first 
glance a rather undistinguished morphology (FIGURE 12.5). The 
contents of the nucleus are present as a viscous, amorphous mass of 
material enclosed by a complex nuclear envelope that forms a bound- 
ary between the nucleus and cytoplasm. Included within the nucleus 
of a typical interphase (i-e., nonmitotic) cell are (1) the chromo- 
somes, which are present as highly extended nucleoprotein fibers, 
termed chromatin; (2) one or more nucleoli, which are irregularly 
shaped electron-dense structures that function in the synthesis of 
ribosomal RNA and the assembly of ribosomes (discussed on page 
413); and (3) the nucleoplasm, the fluid substance in which the sol- 
utes of the nucleus are dissolved. 

The separation of a cell’s genetic material from the surrounding 
cytoplasm may be the single most important feature that distin- 
guishes eukaryotes from prokaryotes, which makes the appearance 
of the nuclear envelope a landmark in biological evolution. The 
nuclear envelope consists of two cellular membranes arranged 
parallel to one another and separated by 10 to 50 nm (FIGURE 12.6). 
Together these membranes contain upwards of 60 distinct trans- 
membrane proteins, including a number of species that link the 
outer nuclear membrane with elements of the cytoskeleton (Figure 
12.6a). The inner and outer nuclear membranes are fused at sites 
forming circular pores that contain complex assemblies of proteins. 
The average mammalian cell contains several thousand nuclear 
pores. The outer membrane is generally studded with ribosomes and 
is continuous with the membrane of the rough endoplasmic reticu- 
lum. The space between the membranes is continuous with the ER 
lumen (Figure 12.6a). 

The inner surface of the nuclear envelope of animal cells is 
bound by integral membrane proteins to a thin filamentous mesh- 
work, called the nuclear lamina (FIGURE 12.7). The nuclear lamina 
provides mechanical support to the nuclear envelope, serves as a site 
of attachment for chromatin fibers at the nuclear periphery 


(b) 


FIGURE 12.5 The cell nucleus. (a) Electron micrograph of an interphase HeLa cell nucleus. Heterochromatin (page 469) is evident around the entire 
inner surface of the nuclear envelope. Two prominent nucleoli are visible, and clumps of chromatin can be seen scattered throughout the nucleoplasm. 


| (b) Schematic drawing showing some of the major components of the nucleus. 


Source: (a) From Werner W. Franke, Int. Rev. Cytol. (Suppl.) 4:130, 1974., with permission from Elsevier. 
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FIGURE 12.6 The nuclear envelope. (a) Schematic drawing showing the 
double membrane, nuclear pore complex, nuclear lamina, and the continuity 
of the outer membrane with the rough endoplasmic reticulum (ER). Both 
membranes of the nuclear envelope contain their own distinct complement 
of proteins. The actin filaments and intermediate filaments of the cytoskel- 
eton are connected to the outer nuclear membrane by fibrous proteins 
(Nesprins). (6) Electron micrograph of a section through a portion of the 
nuclear envelope of an onion root tip cell. Note the double membrane (NM) 
with intervening space, nuclear pore complexes (NPC), and associated 

(a) heterochromatin (HC) that does not extend into the region of the nuclear pores. 


Source: (6) From Werner W. Franke et al., J. Cell Biol. 91:47s, 1981, Fig. 8. 
Reproduced with permission of The Rockefeller University Press. 
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(Figure 12.6b), and has a poorly understood role in DNA replication is characterized by premature aging and death during teenage years 


and transcription. The filaments of the nuclear lamina are approxi- _ from heart attack or stroke. Figure 12.7c shows the misshapen nuclei 
mately 10 nm in diameter and composed of polypeptides, called from the cells of a patient with HGPS, demonstrating the importance 
lamins. Lamins are members of the same superfamily of polypep- _ of the nuclear lamina as a determinant of nuclear architecture. It is 


tides that assemble into the 10-nm intermediate filaments of the interesting to note that the phenotype depicted in Figure 12.7c has 
cytoplasm (see Table 9.2). The disassembly of the nuclear lamina _ been traced to a synonymous mutation, that is, one that generated a 


prior to mitosis is induced by phosphorylation of the lamins. different codon for the same amino acid. In this case, the change in 

Mutations in one of the lamin genes (LMNA) are responsible for | DNA sequence alters the way the gene transcript is spliced, leading to 
a number of diverse human diseases, including a rare form of muscu- _ production of a shortened protein, causing the altered phenotype. 
lar dystrophy (called EDMD2) in which muscle cells contain excep- — This example illustrates how the sequence of a gene serves as a “mul- 


tionally fragile nuclei. Mutations in LMNA have also been linkedtoa _ tiple code”: one that directs the translation machinery and others that 
disease, called Hutchinson-Gilford progeria syndrome (HGPS), that —_—_ direct the splicing machinery and protein folding. 
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FIGURE 12.7 The nuclear lamina. (a) Nucleus of a cultured human cell that has been stained with fluorescently labeled antibodies against lamin A/C to 
reveal the nuclear lamina (red), which lies on the inner surface of the nuclear envelope. A protein that is proposed to be part of a nuclear matrix (or nuclear 
scaffold) appears green. (b) Electron micrograph of a freeze-dried, metal-shadowed nuclear envelope of a Xenopus oocyte that has been extracted with the 
nonionic detergent Triton X-100. The lamina appears as a rather continuous meshwork comprising filaments oriented roughly perpendicular to one another. 
Inset shows a well-preserved area from which nuclear pores have been mechanically removed. (c) These micrographs show the nucleus within a fibroblast 
that had been cultured from either a patient with HGPS (bottom row) or a healthy subject (top row). The cells are stained for the protein lamin A (left 
column), for DNA (middle column), or shown in a living state under the phase-contrast light microscope (right column). The cell nucleus from the HGPS 
patient is misshapen due to the presence in the nuclear lamina of a truncated lamin A protein. 

Source: (a) From H. Ma, A. J. Siegel, and R. Berezney, J. Cell Biol.146:535, 1999, Fig. 2. Reproduced with permission of the Rockefeller University Press; 

(b) From U. Aebi, J. Cohn, L. Buhle, and L. Gerace, Nature 323:561, 1986. Reprinted by permission from Macmillan Publishers Limited; (c) From Anna 
Mattout et al., Courtesy of Yosef Gruenbaum, Curr. Opin. Cell Biol. 18:338, 2006, with permission from Elsevier. 
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462) The Nuclear Pore Complex and Its Role 
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in Nucleocytoplasmic Trafficking 


The nuclear envelope is the barrier between the nucleus and cyto- 
plasm, and nuclear pores are the gateways across that barrier. Unlike 
the plasma membrane, which prevents passage of macromolecules 
between the cytoplasm and the extracellular space, the nuclear enve- 
lope is a hub of activity for the movement of RNAs and proteins in 
both directions between the nucleus and cytoplasm. The replication 
and transcription of genetic material within the nucleus require the 
participation of large numbers of proteins that are synthesized in the 
cytoplasm and transported across the nuclear envelope. Conversely, 
mRNAs, tRNAs, and ribosomal subunits that are manufactured in 
the nucleus must be transported through the nuclear envelope in the 
opposite direction. Some components, such as the snRNAs of the 
spliceosome (page 426), move in both directions; they are synthe- 
sized in the nucleus, assembled into RNP particles in the cytoplasm, 
and then shipped back to the nucleus where they function in mRNA 
processing. To appreciate the magnitude of the traffic between the 
two major cellular compartments, consider a HeLa cell, which is 
estimated to contain about 10,000,000 ribosomes. To support its 
growth, a single HeLa cell nucleus must import approximately 
560,000 ribosomal proteins and export approximately 14,000 ribo- 
somal subunits every minute. 

How do all of these materials pass through the nuclear enve- 
lope? In one early approach, a suspension of tiny gold particles was 
injected into cells and passage of the material through the nuclear 
envelope was observed with the electron microscope. As illustrated 
in FIGURE 12.8a,b, these particles move from the cytoplasm into the 
nucleus by passing single-file through the center of the nuclear 
pores. Electron micrographs of cells fixed in the normal course of 
their activities have also shown that particulate material can pass 
through a nuclear pore. An example is shown in Figure 12.8c, in 
which granular material presumed to consist of a ribosomal subunit 
is seen squeezing through one of these pores. 

Given the fact that materials as large as gold particles and ribo- 
somal subunits can penetrate nuclear pores, one might assume that 
these pores are merely open channels, but just the opposite is true. 
Nuclear pores contain a doughnut-shaped structure called the 
nuclear pore complex (NPC) that straddles the nuclear envelope, 
projecting into both the cytoplasm and nucleoplasm. The NPC is 
a huge, supramolecular complex—15 to 30 times the mass of a 
ribosome—that exhibits octagonal symmetry due to the eightfold 
repetition of a number of structures (Figure 12.10). Despite their 
considerable size and complexity, NPCs contain only about 30 dif- 
ferent proteins, called nucleoporins, which are largely conserved 
between yeast and vertebrates. Each nucleoporin is present in multi- 
ple copies—8, 16, or 32 in number—in keeping with the octagonal 
symmetry of the structure. 

The structure of the NPC is seen in the electron micrographs of 
FIGURE 12.9 and the models of FIGURE 12.10. At the heart of the NPC 
is a central channel, which is surrounded by a ring of nucleoporins 
whose rearrangements can change the diameter of the opening from 
about 20 to 40 nm. The NPC is not a static structure, as evidenced by 
the finding that many of its component proteins are replaced with new 
copies over a time period of seconds to minutes. Recent studies suggest 
that this dynamic exchange of nucleoporins may play a role in activat- 
ing the transcription of chromatin that is associated with the NPC. 

Among the nucleoporins is a subset of proteins that possess, within 
their amino acid sequence, a large number of phenylalanine-glycine 
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FIGURE 12.8 The movement of materials through the nuclear pore. 
(a) Electron micrograph of the nuclear—-cytoplasmic border of a frog oocyte 
taken minutes after injection with gold particles that had been coated with 
a protein normally found in the nucleus. These particles pass through the 
center of the nuclear pores (arrows) on their way from the cytoplasm to the 
nucleus. (b) At higher magnification, the gold particles are seen to be 
clustered in a linear array within each pore. (c) Electron micrograph of a 
section through the nuclear envelope of an insect cell showing the 
movement of granular material (presumed to be a ribosomal subunit) 
through a nuclear pore. 

Source: (a, 6) Courtesy of C. M. Feldherr; (c) From Barbara J. Stevens and 
Hewson Swift, J. Cell Biol. 31:72, 1966, Fig. 23. Reproduced with 
permission of the Rockefeller University Press. 


repeats (FG, by their single letter names). The FG repeats are clustered 
in a particular region of each molecule called the FG domain. 
Because of their unusual amino acid composition, the FG domains 
possess a disordered structure (page 55) that gives them an extended 
and flexible organization. The FG repeat-containing nucleoporins 
are thought to line the central channel of the NPC with their fila- 
mentous FG domains extending into the heart of the central chan- 
nel. The FG domains form a hydrophobic meshwork or sieve that 
blocks the free diffusion of larger macromolecules (greater than 
about 40,000 Daltons) between the nucleus and cytoplasm. 

In 1982, Robert Laskey and his co-workers at the Medical 
Research Council of England found that nucleoplasmin, one of the 
more abundant nuclear proteins of amphibian oocytes, contains a 
stretch of amino acids near its C-terminus that functions asa nuclear 
localization signal (NLS). This sequence enables a protein to pass 
through the nuclear pores and enter the nucleus. The best studied, or 
“classical,” NLSs consist of one or two short stretches of positively 
charged amino acids. The T antigen encoded by the virus SV 40, for 
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FIGURE 12.9 Scanning electron micrographs of the nuclear pore 
complex from isolated nuclear envelopes of an amphibian oocyte. 

(a) The cytoplasmic face of the nuclear envelope showing the peripheral 
cytoplasmic ring of the nuclear pore complex. (b) The nuclear face of the 
nuclear envelope showing the basket-like appearance of the inner portion 
of the complex. (c) The nuclear face of the envelope showing the 
distribution of the NPCs and places where intact patches of the nuclear 
lamina (NEL) are retained. In all of these micrographs, isolated nuclear 
envelopes were fixed, dehydrated, dried, and metal-coated. 

Source: (a—c) From M. W. Goldberg and T. D. Allen, J. Cell Biol. 119:1431, 
1992, Figures 1-3. Reproduced with permission of the Rockefeller 
University Press. 


example, contains an NLS identified as -Pro-Lys-Lys-Lys-Arg-Lys- 
Val-. If one of the basic amino acids in this sequence is replaced by a 
nonpolar amino acid, the protein fails to localize to the nucleus. 
Conversely, if this NLS is fused to a nonnuclear protein, such as 
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FIGURE 12.10 A model of a vertebrate nuclear pore complex (NPC). 
(a) Schematic representation of a vertebrate NPC as it is situated within the 
nuclear envelope. This elaborate structure consists of several parts, 
including a scaffold and transmembrane ring that anchors the complex to 
the nuclear envelope, a cytoplasmic and a nuclear ring, a nuclear basket, 
and eight cytoplasmic filaments. The FG-containing nucleoporins line a 
central channel with their disordered FG-containing domains extending 
into the opening and forming a hydrophobic meshwork. (6) Three- 
dimensional reconstruction of a portion of a nuclear pore complex showing 
the localization of individual nucleoporin molecules within the structure. 
The FG nucleoporins are shown in green. Several transmembrane 
nucleoporins span the pore membrane, forming an outer ring that 

anchors the NPC to the nuclear envelope. 

Source: (b) From Javier Fernandez-Martinez and Michael P. Rout, Curr. Opin. 
Cell Biol. 21: 604, 2009, with permission from Elsevier. 


serum albumin, and injected into the cytoplasm, the modified protein 
becomes concentrated in the nucleus. Thus, targeting of proteins to 
the nucleus is similar in principle to trafficking of other proteins that 
are destined for segregation within a particular organelle, such as a 
mitochondrion or a peroxisome (Section 8.21). In all of these cases, 
the proteins possess a specific “address” that is recognized by a spe- 
cific receptor that mediates its transport into the organelle. 

The study of nuclear transport has been a very active area of 
research, driven by the development of in vitro systems capable of 
selectively importing proteins and RNPs into the nucleus. Using 
these systems, researchers have identified a family of proteins that 
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function as mobile transport receptors, ferrying macromolecules | 
across the nuclear envelope. Within this family, importins move | 


macromolecules from the cytoplasm into the nucleus and exportins 
move macromolecules in the opposite direction. 

FIGURE 12.11a depicts some of the major steps that occur during 
the nuclear import of a protein, such as nucleoplasmin, that contains 
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a classical NLS. Import begins as the NLS-containing cargo protein 
binds to a heterodimeric, soluble NLS receptor, called importin a/B, 
that resides in the cytoplasm (step 1, Figure 12.11a). The transport 
receptor is thought to escort the protein cargo to the outer surface of 
the nucleus where it likely docks with the cytoplasmic filaments that 
extend from the outer ring of the NPC (step 2). Figure 12.11b shows 
a number of gold particles bound to these filaments; these particles 
were coated with an NLS-containing nuclear protein that was being 
transported through the nuclear pore complex. The receptor-cargo 
complex then moves through the nuclear pore (step 3, Figure 12.11a) 
by engaging in a series of successive interactions with the FG domains 
of the FG-containing nucleoporins, allowing passage of the receptor- 
cargo complex through the NPC. 

Once the bound cargo proceeds through the NPC, a GTP-binding 
protein called Ran drives release of the transported protein into the 
nuclear compartment. Like other GTP-binding proteins, such as Sarl 
(page 283) and EF-Tu (page 445) discussed in earlier chapters, Ran 
can exist in an active GTP-bound form or an inactive GDP-bound 
form. Ran’s role in regulating nucleocytoplasmic transport is based 
on a mechanism in which the cell maintains a high concentration of 
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FIGURE 12.11 Importing proteins from the cytoplasm into the 
nucleus. (a) Proposed steps in nuclear protein import. Proteins bearing a 
nuclear localization signal (NLS) bind to the heterodimeric receptor 
(importin «/B) (step 1) forming a complex that associates with a cytoplasmic 
filament (step 2). The receptor-cargo complex moves through the nuclear 
pore (step 3) and into the nucleoplasm where it interacts with Ran-GTP and 
dissociates (step 4). The importin B subunit, in association with Ran-GTP, is 
transported back to the cytoplasm, where the Ran-GTP is hydrolyzed 

(step 5). Ran-GDP is subsequently transported back to the nucleus, where it 
is converted to Ran-GTP. Conversely, importin « is transported back to the 
cytoplasm. (6) Nucleoplasmin is a protein present in high concentration in 
the nucleoplasm of Xenopus oocytes. When gold particles are coated with 
nucleoplasmin and injected into the cytoplasm of a Xenopus oocyte, they 
are seen to bind to the cytoplasmic filaments (CF) projecting from the outer 
ring of the nuclear pore complex. Several particles are also seen in transit 
through the pore (NP) into the nucleus. 

Source: (a) Based on a model by M. Ohno et al., Cell 92:327, 1998; Cell 

by Cell Press. Reproduced with permission of Cell Press in the format reuse 
in a book/textbook via Copyright Clearance Center; (b) From W. D. 
Richardson et al., Cell 52:662, 1988; with permission from Elsevier. 
Courtesy of A. D. Mills. 


Ran-GTP in the nucleus and a very low concentration of Ran-GTP in 
the cytoplasm. The steep gradient of Ran-GTP across the nuclear 
envelope depends on the compartmentalization of certain accessory 
proteins (see Figure 15.210 for further discussion). One of these acces- 
sory proteins (named RCC1) is sequestered in the nucleus where it 
promotes the conversion of Ran-GDP to Ran-GTP, thus maintaining 
the high nuclear level of Ran-GTP. Another accessory protein (named 
RanGAP1) resides in the cytoplasm where it promotes the hydrolysis 
of Ran-GTP to Ran-GDP, thus maintaining the low cytoplasmic level 
of Ran-GTP. Thus the energy released by GTP hydrolysis is used to 
maintain the Ran-GTP gradient across the nuclear envelope. As dis- 
cussed below, the Ran-GTP gradient drives nuclear transport by a 
process that depends only on receptor-mediated diffusion; no motor 
proteins or ATPases have been implicated. 

We can now return to our description of the classical NLS 
import pathway. When the importin-cargo complex arrives in the 
nucleus, it is met by a molecule of Ran-GTP, which binds to the com- 
plex and causes its disassembly as indicated in step 4, Figure 12.11a. 
This is the apparent function of the high level of Ran-GTP in the 
nucleus: It promotes the disassembly of complexes imported from 


the cytoplasm. The imported cargo is released into the nucleoplasm, 
and one portion of the NLS receptor (the importin B subunit) is 
shuttled back to the cytoplasm together with the bound Ran-GTP 
(step 5). Once in the cytoplasm, the GTP molecule bound to Ran is 
hydrolyzed, releasing Ran-GDP from the importin B subunit. Ran- 
GDP is returned to the nucleus, where it is converted back to the 
GTP-bound state for additional rounds of activity. Importin o is 
transported back to the cytoplasm by one of the exportins. 

Ran-GTP plays a key role in the escort of macromolecules out of 
the nucleus, just as it does in their import from the cytoplasm. Recall 
that Ran-GTP is essentially confined to the nucleus. Whereas Ran- 
GTP induces the disassembly of imported complexes, as shown in 
step 4 of Figure 12.11a, Ran-GTP promotes the assembly of export 
complexes. Proteins exported from the nucleus contain amino acid 
sequences (called nuclear export signals, or NESs) that are recog- 
nized by transport receptors that carry them through the nuclear 
envelope to the cytoplasm. 


RNA Transport 


Most of the traffic out of the nucleus consists of various types of RNA 
molecules—mRNAs, rRNAs, snoRNAs, miRNAs, and tRNAs—that 
are synthesized in the nucleus and function in the cytoplasm or are 
modified in the cytoplasm and return to function in the nucleus. 
These RNAs move through the NPC as ribonucleoproteins (RNPs). 

As with protein transport, RNA transport involves the associa- 
tion of transport receptors that ferry mRNP complexes through 
nuclear pores. Transport of an mRNP from the nucleus to cytoplasm 
is associated with extensive remodeling; certain proteins are stripped 
from the mRNA, while others are added to the complex. Numerous 
studies have demonstrated a functional link between pre-mRNA 
splicing and mRNA export; only mature (ie., fully processed) 
mRNAs are capable of nuclear export. If an mRNA still contains an 
unspliced intron, that RNA is retained in the nucleus. 


REVIEW | 


1. Describe the components that make up the nuclear 
envelope. What is the relationship between the 
nuclear membranes and the nuclear pore complex? 
How does the nuclear pore complex regulate 
bidirectional movement of materials between the 
nucleus and cytoplasm? 


e—____ 


12.3 Packaging the Eukaryotic 
Genome 


Chromosomes seem to appear out of nowhere at the beginning of 
mitosis and disappear once again when cell division has ended. The 
appearance and disappearance of chromosomes provided early 
cytologists with a challenging question: What is the nature of the 
chromosome in the nonmitotic cell? 

An average human cell contains about 6.4 billion base pairs of 
DNA divided among 46 chromosomes (the value for a diploid number 
of chromosomes). Each chromosome contains a single, continuous 
DNA molecule; the larger the chromosome, the more DNA it 
contains. The largest human chromosome contains ~0.3 billion base 


pairs. Given that each base pair is 0.34 nm in length, 0.3 billion base 
pairs would constitute a DNA molecule from just one chromosome 
of greater than 10 cm long. How is it possible to fit all 46 chromo- 
somes into a nucleus that is only 10 um (1 x 10° m) in diameter and, 
at the same time, maintain the DNA in a state that is accessible to 
enzymes and regulatory proteins? Just as important, how is the sin- 
gle DNA molecule of each chromosome organized so that it does not 
become hopelessly tangled with other chromosomes? The answers 
lie in the remarkable manner in which a DNA molecule is packaged 
in eukaryotic cells. 


Nucleosomes: The Lowest Level 
of Chromosome Organization 


Chromosomes are composed of DNA and associated proteins, 
together called chromatin. The orderly packaging of eukaryotic DNA 
depends on histones, a remarkable group of small proteins that 
possess an unusually high content of the basic amino acids arginine 
and lysine. There are five types of histones named H1, H2A, H2B, 
H3, and H4. The amino acid sequences of histones, particularly H3 
and H4, have undergone very little change over long periods of evo- 
lutionary time. The H4 histones of both peas and cows, for example, 
contain 102 amino acids, and their sequences differ at only two 
amino acid residues. Why are histones so highly conserved? One 
reason is the positively charged histones interact with the negatively 
charged backbone of the DNA molecule, which is identical in all 
organisms. In addition, nearly all of the amino acids in a histone 
molecule are engaged in an interaction with another molecule, either 
DNA or another histone. As a result, very few amino acids in a his- 
tone can be replaced with other amino acids without severely affect- 
ing the function of the protein. 

In the early 1970s, it was found that when chromatin was treated. 
with nonspecific nucleases, most of the DNA was converted to frag- 
ments of approximately 200 base pairs in length. In contrast, a simi- 
lar treatment of naked DNA (i.e., DNA devoid of proteins) produced 
a randomly sized population of fragments. This finding suggested 
that chromosomal DNA was protected from enzymatic attack, 
except at certain periodic sites along its length. It was presumed that 
the proteins associated with the DNA were providing the protection. 
In 1974, using data from nuclease digestion and other types of infor- 
mation, Roger Kornberg proposed an entirely new structure for 
chromatin. Kornberg proposed that DNA and histones are organ- 
ized into repeating subunits, called nucleosomes. We now know 
that each nucleosome contains a nucleosome core particle consisting 
of 146 base pairs of supercoiled DNA (page 381) wrapped almost 
twice around a disk-shaped complex of eight histone molecules 
(FIGURE 12.12a). The histone core of each nucleosome consists of 
two copies each of histones H2A, H2B, H3, and H4 assembled into 
an octamer (FIGURE 12.13a). The remaining histone—type H1— 
resides outside the nucleosome core particle. The H1 histone is 
referred to as a linker histone because it binds to part of the linker 
DNA that connects one nucleosome core particle to the next. 
Fluorescence studies indicate that H1 molecules continuously dis- 
sociate and reassociate with chromatin. Together the H1 protein and 
the histone octamer interact with about 168 base pairs of DNA. H1 
histone molecules can be selectively removed from the chromatin 
fibers by subjecting the preparation to solutions of low ionic strength. 
When H1-depleted chromatin is observed under the electron micro- 
scope, the nucleosome core particles and naked linker DNA can be 
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FIGURE 12.12 Nucleosomal organization of 
chromatin. (a) Schematic diagram showing the 
structure of a nucleosome core particle and an 
associated histone H1 molecule. The core particle 
itself consists of approximately 1.8 turns (146 
base pairs) of negatively supercoiled DNA 
wrapped around eight core histone molecules 
(two each of H2A, H2B, H3, and H4). The H1 
linker histone binds near the sites where DNA 
enters and exits the nucleosome. Two alternate 
positions of the H1 molecule are shown. 

(b) Electron micrograph of chromatin fibers 
released from the nucleus of a Drosophila cell. 
The nucleosome core particles are approximately 
10 nm in diameter and are connected by short 
strands of naked linker DNA, which are approxi- 
mately 2 nm in diameter. 


Source: (6) Courtesy of Oscar L. Miller, University 
of Virginia. 


FIGURE 12.13 The structure of a nucleosome. (a) Schematic representation of a nucleosome core particle with its histone octamer composed of four 
histone heterodimers (two H3/H4 dimers and two H2A/H2B dimers). (b) X-ray crystallographic structure of a nucleosome core particle viewed down the 


central axis of the DNA superhelix, showing the position of each of the eight histone molecules of the core octamer. The histones are organized into four 
dimeric complexes. Each histone dimer binds 27 to 28 base pairs of DNA, with contacts occurring where the minor groove of the DNA faces the histone 
core. (c) A simplified, schematic model of half of a nucleosome core particle, showing one turn (73 base pairs) of the DNA superhelix and four core histone 
molecules. The four different histones are shown in separate colors, as indicated by the key. Each core histone is seen to consist of (1) a globular region, 
called the “histone fold,” consisting of three « helices (represented by the cylinders) and (2) a flexible, extended N-terminal tail (indicated by the letter N) 
that projects out of the histone disk and out past the DNA double helix. The intermittent points of interaction between the histone molecules and the DNA 
are indicated by white hooks. The dashed lines indicate the outermost portion of the histone tails, which are sites of modification. These flexible tails lack a 
defined tertiary structure and therefore do not appear in the X-ray structure shown in part b. 


Source: (a) From C. David Allis, et al., Epigenetics, Figure 3.5, p. 30, copyright 2007. Reprinted with permission from Cold Spring Harbor Laboratory Press; 


(b) From Karolin Luger and Timothy J. Richmond, et al., Nature 389:252, 1997, Fig. 1. Reprinted by Permission of Macmillan Publishers Limited; 


(c) After drawing by D. Rhodes; © 1997, by Macmillan Publishers Limited. 


seen as separate elements, which together appear like “beads on a 
string” (Figure 12.120). 

Our understanding of DNA packaging has been greatly advanced 
by portraits of the nucleosome core particle obtained by X-ray crys- 
tallography. The eight histone molecules that comprise a nucleosome 
core particle are organized into four heterodimers: two H2A-H2B 
dimers and two H3-H4 dimers (Figure 12.13a,b). Dimerization of 
histone molecules is mediated by their C-terminal domains, which 


consist largely of « helices (represented by the cylinders in Figure 
12.13c) folded into a compact mass in the core of the nucleosome. In 
contrast, the N-terminal segments from each core histone (and also 
the C-terminal segment of H2A) take the form of a long, flexible tail 
(represented by the dashed lines of Figure 12.13c) that extends out 
through the DNA helix that is wrapped around the core particle. For 
many years, histones were thought of as inert, structural molecules, 
but the extending N-terminal segments are targets of key enzymes 


that play a role in making the chromatin accessible to proteins. In this 
way, chromatin is a dynamic cellular component in which histones, 
regulatory proteins, and a variety of enzymes move in and out of the 
nucleoprotein complex to facilitate the complex tasks of DNA tran- 
scription, compaction, replication, recombination, and repair. 
Histone modification is not the only mechanism that alters the 
histone character of nucleosomes. In addition to the four “conven- 
tional” core histones discussed above, several alternate versions of the 
H2A and H3 histones are also synthesized in most cells. The impor- 
tance of these histone variants, as they are called, remains largely 
unexplored, but they are thought to have specialized functions. The 
localization and apparent function of one of these variants, CENP-A, 
is discussed on page 477. Another variant, H2A.X, is distributed 
throughout the chromatin, where it replaces conventional H2A in a 
fraction of the nucleosomes. H2A.X becomes phosphorylated at sites 
of DNA-strand breakage and may play a role in recruiting the 
enzymes that repair the DNA. Two other core histone variants— 
H2A.Z and H3.3—have been implicated in numerous activities, 
including the activation of transcription, but their roles are debated. 
We began this section by wondering how a nucleus 10 pm in 
diameter can pack 200,000 times this length of DNA within its 
boundaries. The assembly of nucleosomes is the first important step 
in the compaction process. With a nucleotide-nucleotide spacing of 
0.34 nm, the 200 base pairs of a single 10-nm nucleosome would 
stretch nearly 70 nm if fully extended. Consequently, it is said that 
the packing ratio of the DNA of nucleosomes is approximately 7:1. 


(b) 


376-380 2014. 


Higher Levels of Chromatin Structure 


A DNA molecule wrapped around nucleosome core particles of 10-nm 
diameter is the lowest level of chromatin organization. Chromatin 
does not, however, exist within the cell in this relatively extended, 
“beads-on-a-string” state. When chromatin is released from nuclei 
and prepared at physiologic ionic strength, a fiber of approximately 
30-nm thickness is observed (FIGURE 12.14a). Despite more than two 
decades of investigation, the structure of the 30-nm fiber remains a 
subject of debate. Cryo-electron microscopy reconstructions suggest 
the structure shown in Figure 12.14b,c, in which nucleosomes line up 
end-to-end into two stacks of nucleosomes that form a double helical 
structure. Successive nucleosomes along the DNA are arranged in two 
different stacks and alternating nucleosomes interact across the helix 
via their linker DNA. Assembly of the 30-nm fiber increases the DNA- 
packing ratio an additional 6-fold, or about 40-fold altogether. 
Maintenance of the 30-nm fiber depends on interactions between 
histone molecules of neighboring nucleosomes. Linker histones and 
core histones have both been implicated in higher-order packaging of 
chromatin. If, for example, H1 linker histones are selectively extracted 
from compacted chromatin, the 30-nm fibers uncoil to form the 
thinner, more extended beaded filament shown in Figure 12.12). 
Adding back H1 histone leads to restoration of the higher-order 
structure. Core histones of adjacent nucleosomes may interact with 
one another by means of their long, flexible tails. Structural studies 
indicate, for example, that the N-terminal tail of an H4 histone from 


(c) 


FIGURE 12.14 The 30-nm fiber. (a) Electron micrograph of a 30-nm chromatin fiber released from a 
nucleus following lysis of the cell in a hypotonic salt solution. (6) Model for the structure of the 30-nm fiber 
based on fitting atomic structures of nucleosomes into cryo-electron microscopy data. Histones are shown in 
red and green. DNA is shown in blue, gold, and purple. Nucleosomes line up to form a double helix with pairs 
of successive nucleosomes in the chain (depicted here by the same color DNA) alternating between the two 
helices, with linker DNA spanning across the middle of the double helix. (c) Schematic of the fiber structure, 
showing 12 nucleosomes numbered N1-N12, which stack into pairs. These pairs then associate end to end to 
form the two chains of a double helix. Linker DNA is shown in green. 


Source: (a) Courtesy of Barbara Hamkalo and Jerome B. Rattner; (b, c) From Feng Song et al., Science 344: 
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one nucleosome core particle can reach out and make extensive con- 
tact with both the linker DNA between nucleosome particles and the 
H2A/H2B dimer of adjacent particles. These types of interactions are 
thought to mediate the folding of the nucleosomal filament into a 
thicker fiber. In fact, chromatin fibers prepared with H4 histones that 
lack their tails are unable to fold into higher-order fibers. 

The next stage in the hierarchy of DNA packaging is thought to 
occur as the 30-nm chromatin fiber is gathered into a series of large, 
supercoiled loops, or domains, that may be compacted into even 
thicker (80-100 nm) fibers. The DNA loops are apparently tethered 
at their bases to proteins that may be part ofa poorly defined nuclear 
scaffold. Normally, loops of chromatin fibers are spread out within 
the nucleus and cannot be visualized, but their presence can be 
revealed under certain circumstances. For instance, when isolated 
mitotic chromosomes are treated with solutions that extract his- 
tones, the histone-free DNA can be seen to extend outward as loops 
from a protein scaffold (FIGURE 12.15a). Among the proteins thought 
to play a key role in maintaining these DNA loops is cohesin, which 
is best known for its role in holding replicated DNA molecules 
together during mitosis (Section 14.7). Cohesin is a ring-shaped 
protein that may act as shown in Figure 12.15b. 

The mitotic chromosome represents the ultimate in chromatin 
compactness; 1 um of mitotic chromosome length typically contains 
approximately 1 cm of DNA, which represents a packing ratio of 
10,000:1. This compaction occurs by a poorly understood process 
that is discussed in Section 14.7. An overview of the various levels of 
chromatin organization, from the nucleosomal filament to a mitotic 
chromosome, is depicted in FIGURE 12.16. 


FIGURE 12.16 Levels of organization of chromatin. Naked DNA molecules are 
wrapped around histones to form nucleosomes, which represent the lowest level of 
chromatin organization. Nucleosomes are organized into 30-nm fibers, which in turn are 
organized into looped domains. When cells prepare for mitosis, the loops become 


further compacted into mitotic chromosomes (see Figure 14.13). 


Cohesin ring 


(b) 


FIGURE 12.15 Chromatin loops: a higher level of chromatin structure. 
(a) Electron micrograph of a mitotic chromosome that had been treated to 
remove histones. The histone-depleted chromosome displays loops of 
DNA that are attached at their bases to a residual protein scaffold. (b) A 
simplified model by which rings of cohesion could play a role in maintaining 
loops of interphase DNA. 

Source: (a) From James R. Paulson and U. K. Laemmli, Cell 12:823, 1977, 
with permission of Elsevier. 
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1. What is the relationship between the histones and 
DNA of a nucleosome core particle? How was the 
existence of nucleosomes first revealed? How are 
nucleosomes organized into higher levels of 
chromatin? 
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12.4 Heterochromatin 


After mitosis has been completed, most of the chromatin in highly 
compacted mitotic chromosomes returns to its diffuse interphase 
condition. Approximately 10 percent of the chromatin, however, 
generally remains in a condensed, compacted form throughout 
interphase. This compacted, densely stained chromatin is typically 
concentrated near the periphery of the nucleus, often in proximity 
with the nuclear lamina, as indicated in Figure 12.54. Chromatin 
that remains compacted during interphase is called heterochro- 
matin to distinguish it from euchromatin, which returns to a dis- 
persed, active state. When a radioactively labeled RNA precursor 
such as [*H]uridine is given to cells that are subsequently fixed, sec- 
tioned, and autoradiographed, the clumps of heterochromatin 
remain largely unlabeled, indicating that they have relatively little 
transcriptional activity. The peripheral regions of the nucleus are 
thought to contain factors that promote transcriptional repression, 
making it a favorable environment for the localization of hetero- 
chromatin. As discussed later, the state of a particular region of the 
genome, whether it is euchromatic or heterochromatic, is stably 
inherited from one cell generation to the next. 

Heterochromatin is divided into two classes. Constitutive 
heterochromatin remains in the compacted state in all cells at all 
times and, thus, represents DNA that is permanently silenced. In 


ae 
(a) (b) 


mammalian cells, the bulk of the constitutive heterochromatin is 469 


found in regions that flank the telomeres and centromere of 
each chromosome and in a few other sites, such as the distal arm of 
the Y chromosome in male mammals. The DNA of constitutive hetero- 
chromatin consists primarily of repeated sequences (page 384) and 
contains relatively few genes. In fact, when genes that are normally 
active move into a position adjacent to these regions as a result of 
transposition or translocation, they tend to become transcriptionally 
silenced, a phenomenon known as the position effect. It is thought 
that heterochromatin contains components whose influence can 
spread outward a certain distance, affecting nearby genes. The 
spread of heterochromatin along the chromosome is apparently 
blocked by specialized barrier sequences (boundary elements) in the 
genome. Constitutive heterochromatin also serves to inhibit genetic 
recombination between homologous repetitive sequences. This type 
of recombination can lead to DNA duplications and deletions (see 
Figure 10.23). 

Facultative heterochromatin is chromatin that has been 
specifically inactivated during certain stages of an organism’s life or 
in certain types of differentiated cells (as in Figure 17.9b). An exam- 
ple of facultative heterochromatin can be seen by comparing the sex 
chromosomes in the cells of a female mammal to those of a male. 
The cells of males have a tiny Y chromosome and a much larger X 
chromosome. Because the X and Y chromosomes have only a few 
genes in common, males have a single copy of most genes that are 
carried on the sex chromosomes. Although cells of females contain 
two X chromosomes, only one of them is transcriptionally active. 
The other X chromosome remains condensed as a heterochromatic 
clump (FIGURE 12.17a) called a Barr body after the researcher who 
discovered it in 1949. Formation of a Barr body ensures that the cells 
of both males and females have the same number of active X chro- 
mosomes and thus synthesize equivalent amounts of the products 
encoded by X-linked genes. 


FIGURE 12.17 Facultative heterochromatin: the inactive X chromosome. (a) The inactivated X chromosome appears as a darkly staining heterochro- 
matic structure, called a Barr body (arrows). (6) Random inactivation of either X chromosome in different cells during early embryonic development creates a 
mosaic of tissue patches and is responsible for the color patterns in calico cats. Each patch comprises the descendants of one cell that was present in the 
embryo at the time of inactivation. These patches are visually evident in calico cats, which are heterozygotes with an allele for black coat color residing on 
one X chromosome and an allele for orange coat color on the other X. This explains why male calico cats are virtually nonexistent because all cells in the 
male have either the black or orange coat color allele. (The white spots on this cat are due to a different, autosomal coat color gene.) (c) This kitten was 
cloned from the cat shown in b. The two animals are genetically identical but have different coat patterns, a reflection of the random nature of the 


X inactivation process (and likely other random developmental events). 


Source: (a) Courtesy of E. G. (Mike) Bertram; (b-c) Courtesy of College of Veterinary Medicine and Biomedical Sciences, Texas A&M University. 
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Based on her studies of the inheritance of coat color in mice, the 
British geneticist Mary Lyon proposed the following in 1961: 


1. Heterochromatinization of the X chromosome in female mam- 
mals occurs during early embryonic development and leads to 
the inactivation of the genes on that chromosome. 


2. Heterochromatinization in the embryo is a random process in 
the sense that the paternally derived X chromosome and the 
maternally derived X chromosome stand an equal chance of 
becoming inactivated in any given cell. Consequently, at the time 
of inactivation, the paternal X can be inactivated in one cell of the 
embryo, and the maternal X can be inactivated in a neighboring 
cell. Once an X chromosome has been inactivated, its hetero- 
chromatic state is transmitted through many cell divisions, so 
that the same X chromosome is inactive in all the descendants of 
that particular cell. The process of X-chromosome inactivation is 
best studied in vitro during differentiation of mouse embryonic 
stem (ES) cells. ES cells are derived from very early embryos 
(page 18) and thus have two active X chromosomes. 


3. Reactivation of the heterochromatinized X chromosome occurs 
in female germ cells prior to the onset of meiosis. Consequently, 
both X chromosomes are active during oogenesis, and all of the 
gametes receive a euchromatic X chromosome. 


The Lyon hypothesis was soon confirmed.” Because maternally 
and paternally derived X chromosomes may contain different alleles 
for the same trait, adult females are in a sense genetic mosaics, where 
different alleles function in different cells. X-chromosome mosai- 
cism is reflected in the patchwork coloration of the fur of some 
mammals, including calico cats (Figure 12.17b,c). Pigmentation 
genes in humans are not located on the X chromosome, hence the 
absence of “calico women?” Mosaicism due to X inactivation can be 
demonstrated in women, nonetheless. For example, if a narrow 
beam of red or green light is shone into the eyes of a woman who is 
a heterozygous carrier for red-green color blindness, patches of reti- 
nal cells with defective color vision can be found interspersed among 
patches with normal vision. 

The mechanism responsible for X inactivation has been a focus 
of attention since a 1992 report suggesting that inactivation is initi- 
ated by a noncoding RNA molecule—rather than a protein—that is 
transcribed from one of the genes (called XIST in humans) on the 
X chromosome that becomes inactivated (the X,). The XIST RNA is 
a large transcript (over 17 kb long), which distinguishes it from 
many other noncoding RNAs that tend to be quite small. Because of 
its size, XIST is described as a long noncoding RNA (or IncRNA) and 
represents only the first in a long and growing list of IncRNAs that 
have been discovered as regulatory factors in cellular activities. In 
fact, at least seven distinct IncRNAs are now thought to be involved 
in X-chromosome inactivation; we will restrict the present discus- 
sion to XIST. The XIST RNA selectively accumulates along the 
length of the X chromosome from which it is transcribed, where it 


?The random inactivation of X chromosomes discussed here, which occurs after the 
embryo implants in the uterus, is actually the second wave of X chromosome inactiva- 
tion to occur in the embryo. The first wave, which occurs very early in development, 
is not random but rather leads only to the inactivation of X chromosomes that had 
been donated by the father. This early inactivation of paternal X chromosomes is 
maintained in the cells that give rise to extraembryonic tissues (e.g., the placenta) and 
is not discussed in the text. Early paternal X inactivation is erased in cells that give rise 
to embryonic tissue and random X inactivation subsequently occurs. 


helps recruit certain protein complexes that inactivate the genes at 
selected sites. The XIST gene is required to initiate inactivation, but 
not to maintain it from one cell generation to the next. This conclu- 
sion is based on the discovery of tumor cells in certain women that 
contain an inactivated X chromosome whose XIST gene is deleted. 
X inactivation is thought to be maintained by DNA methylation 
(page 500) and repressive histone modifications, as discussed in the 
next section. 


The Histone Code and Formation 
of Heterochromatin 


Figure 12.13c shows a schematic model of the nucleosome core 
particle with its histone tails projecting outward. But this is only a 
general portrait that obscures important differences among nucle- 
osomes. Cells contain a remarkable array of enzymes that are able to 
add chemical groups to or remove them from specific amino acid 
residues on the histone tails. Those residues that are subject to modi- 
fication, most notably by methylation, acetylation, phosphorylation, 
or ubiquitination, are indicated by the colored bars in FIGURE 12.18. 
The past decade has seen the emergence of a hypothesis known as 
the histone code, which postulates that the state and activity of a 
particular region of chromatin depend on the specific modifications, 
or combinations of modifications, to the histone tails in that region. 
In other words, the pattern of modifications adorning the tails of the 
core histones contains encoded information governing the proper- 
ties of those nucleosomes. Studies suggest that histone tail modifica- 
tions act in two ways to influence chromatin structure and function. 


1. The modified residues serve as docking sites to recruit a specific 
array of nonhistone proteins, which then determine the proper- 
ties and activities of that segment of chromatin. A sampling of 
some of the specific proteins that bind selectively to modified 
histone residues is depicted in FIGURE 12.19. Each of the proteins 
bound to the histones in Figure 12.19 is capable of modulating 
some aspect of chromatin activity or structure. 


2. The modified residues alter the manner in which the histone tails 
of neighboring nucleosomes interact with one another or with 
the DNA to which the nucleosomes are bound. Changes in these 
types of interactions can lead to changes in the higher order 
structure of chromatin. Acetylation of the lysine residue at posi- 
tion 16 on histone H4, for example, interferes with its interaction 
with a histone H2A molecule of an adjacent nucleosome, which 
in turn inhibits formation of the compact 30-nm chromatin fiber. 


For the moment, we will restrict the discussion to the formation 
of heterochromatin as it occurs, for example, during X chromosome 
inactivation. For the sake of simplicity, we will focus on modification 
of a single residue—lysine 9 of H3—which will illustrate the general 
principles by which cells utilize the histone code. The actions of sev- 
eral other histone modifications are indicated in Figure 12.18 and 
discussed in the accompanying legend, and another example is 
described on page 496. As we will see throughout this chapter, tech- 
niques have been developed in recent years to analyze changes that 
affect genome transcription, such as histone modifications, on a 
genome-wide level, rather than simply looking at these changes one 
gene at a time. This has given us a much broader view of the general 


*Approximately 15 percent of genes on the chromosome escape inactivation by an 
unknown mechanism. The “escapees” include genes that are also present on the 
Y chromosome, which ensures that they are expressed equally in both sexes. 
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FIGURE 12.18 Histone modifications and the histone code. The amino terminal tails from histone proteins that extend out past the DNA in nucleo- 
somes can be enzymatically modified by the covalent addition of methyl, acetyl, and phosphate groups. Methyl groups are added to either lysine (K) or 
arginine (R) residues, acetyl groups to lysine residues, and phosphate groups to serine (S) residues. The small protein ubiquitin can be added to one of the 
lysine residues in the core (rather than the tail) of HZA and H2B. Each lysine residue can have one, two, or three added methyl groups, and each arginine 
residue can have one or two added methyl groups. These modifications affect the affinity of the histone for interacting proteins that control the transcrip- 
tional activity of chromatin, which has led to the concept of a histone code. It is widely found that acetylation of lysines leads to transcriptional activation. 
The effects of methylation depend strongly on which of these residues is modified. For example, methylation of lysine 9 of histone H3 (i-e., H3K9) is typically 
present in heterochromatin and associated with transcriptional repression, as discussed in the text. Methylation of H3K27 and H4K20 is also strongly associ- 
ated with transcriptional repression, whereas methylation of H3K4 and H3K36 is associated with transcriptional activation. Just as there are enzymes that 
catalyze each of these modifications, there are also enzymes (deacetylases, demethylases, phosphatases, and deubiquitinases) that specifically remove them. 


Source: From C. David Allis et al., Epigenetics, Figure 3.6, page 31, Copyright 2007. Reprinted with permission from Cold Spring Harbor Press. 


importance of each of these phenomena than was possible only a few 
years ago. 

Comparison of the nucleosomes present within heterochro- 
matic versus euchromatic chromatin domains revealed a striking 
difference. The lysine residue at the #9 position (Lys9 or K9) of 
histone H3 in heterochromatic domains is largely methylated, 
whereas this same residue in euchromatic domains is often acetylated. 
Removal of the acetyl groups from H3 and H4 histones is among the 
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initial steps in conversion of euchromatin into heterochromatin. The 
correlation between transcriptional repression and histone deacet- 
ylation can be seen by comparing the inactive, heterochromatic X 
chromosome of female cells, which contains deacetylated histones, 
to the active, euchromatic X chromosome, whose histones exhibit a 
normal level of acetylation (FIGURE 12.20). Histone deacetylation is 
accompanied by methylation of H3K9, which is catalyzed by an 
enzyme (a histone methyltransferase) that appears to be dedicated to 


FIGURE 12.19 Examples of proteins that bind selectively to modified H3 or H4 residues. Each of the bound proteins possesses an activity that alters 
the structure and/or function of the chromatin. There is an added complexity that is not shown in this drawing in that modifications at one histone residue 
can influence events at other residues, a phenomenon known as cross-talk. For example, the binding of the heterochromatin protein HP1 to H3K9 is blocked 
by phosphorylation of the adjacent |serine residue (H3S10), which typically occurs during mitosis. 


Source: From T. Kouzarides, Cell 128:696, 2007; Cell by Cell Press. Reproduced with permission of Cell Press in the format reuse in a book/textbook via 


Copyright Clearance Center. 
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FIGURE 12.20 Experimental demonstration of a correlation between 
transcriptional activity and histone acetylation. This metaphase chromo- 
some spread has been labeled with fluorescent antibodies to acetylated 
histone H4, which fluoresce green. All of the chromosomes except the 
inactivated X stain brightly with the antibody against the acetylated histone. 
Source: From P. Jeppesen and B. M. Turner, cover of Cell vol. 74, No. 2, 
1993, with permission from Elsevier. Courtesy Peter Jeppesen. 


this, and only this, particular function. This enzyme, called SUV39H1 
in humans, can be found localized within heterochromatin, where it 
may stabilize the heterochromatic nature of the region through its 
methylation activity. 

The formation of a methylated lysine at the #9 position endows 
the histone H3 tail with an important property: It becomes capable 
of binding with high affinity to proteins that contain a particular 
domain, called a chromodomain. The human genome contains at 
least 30 proteins with chromodomains, the best studied of which is 
heterochromatic protein 1 (or HP1). HP1 has been implicated in the 
formation and maintenance of heterochromatin. Once bound to an 
H3 tail, HP1 is thought to interact with other proteins, including 
(1) SUV39H1, the enzyme responsible for methylating the H3K9 


FIGURE 12.21 A model in which small RNAs govern the formation of 


heterochromatin. Recent studies suggest that noncoding RNAs play a role in 


heterochromatin formation. In this model, RNAs are transcribed from both 


strands of repetitive DNA sequences (step 1). The RNAs form double-stranded 


molecules (step 2) that are processed by the endonuclease Dicer and other 


components of the RNAi machinery (page 431) to form a single-stranded siRNA 
guide and an associated protein complex (step 3). In step 4, the siRNA-protein 
complex has bound to a complementary segment of a nascent RNA and has 


recruited the histone methyltransferase SUV39H1. This leads to addition of 


methyl groups to the K9 residue of histone H3, replacing acetyl groups that were 
previously linked to the H3K9 residues. (The acetyl groups, which are character- 
istic of transcribed regions of euchromatin, are removed enzymatically from the 


lysine residues by a histone deacetylase, which is not shown.) In step 5, the 


acetyl groups have all been replaced by methyl! groups, which serve as binding 
sites for the HP1 protein (step 6). The boundary element in the DNA prevents 
the spread of heterochromatinization into adjacent regions of chromatin. Once 
HP1 has bound to the histone tails, the chromatin can be packaged into higher- 
order, more compact structures by means of interaction between HP1 protein 
molecules (step 7). The enzyme SUV39H1 can also bind to methylated histone 
tails (not shown) so that additional nucleosomes can become methylated. In step 


8, a region of highly compacted heterochromatin has been formed. 


residue and (2) other HP1 molecules on nearby nucleosomes. These 
binding properties of the HP1 protein promote the formation of an 
interconnecting network of methylated nucleosomes, leading to a 
compact, higher-order chromatin domain (see FIGURE 12.21). Most 
importantly, this state is transmitted through cell divisions from one 
cell generation to the next (discussed on page 480). Histone modifi- 
cations present in the parent cell must somehow instruct the forma- 
tion of the same modifications on newly deposited nucleosomes in 
the daughter cells. The mechanism by which this occurs is poorly 
understood. 

Studies in a number of organisms indicate that small RNAs, 
similar in nature to those involved in RNA interference (page 431), 
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play an important role in targeting a particular region of the genome 
to undergo H3K9 methylation and subsequent heterochromatiniza- 
tion. A model showing the types of events thought to take place dur- 
ing heterochromatin assembly is depicted in Figure 12.21. RNAs 
derived from highly repetitive elements like centromeres or from 
regions harboring transposable elements are processed by the RNA 
interference pathway and function to recruit histone modification 
enzymes, including histone methyltransferases that modify lysine 9 
on histone H3. Modification in this manner recruits members of the 
HP1 chromodomain protein family, which leads to heterochromatin 
formation and gene silencing. This is particularly important for the 
silencing of mobile transposable elements and possibly viral DNA. 
If components of the RNAi machinery are deleted, methylation of 
H3K9 and heterochromatinization is impaired, allowing activation 
of mobile DNA elements. In addition to its role in the formation of 
heterochromatin, the RNAi machinery may also function in the 
maintenance of the heterochromatic state from one cell generation 
to the next. These findings point to yet another role, on a rapidly 
growing list of roles, performed by noncoding RNAs. It would not be 
surprising, given the fact that most of the genome is now thought to 
be transcribed (page 436), to discover that RNAs play a major role in 
guiding many of the changes in chromatin structure that are known 
to occur during embryonic development or in response to physio- 
logical stimuli. 


REVIEW | 


1. What is the difference in structure and function 
between heterochromatin and euchromatin? 
Between constitutive and facultative 
heterochromatin? Between an active and inactivated 
X chromosome in a female mammalian cell? How is 
the histone code a determinant of the state of a 
chromatin region? 


—— 


12.5 The Structure of a 
Mitotic Chromosome 


The relatively dispersed state of the chromatin of an interphase cell 
favors interphase activities, such as replication and transcription. In 
contrast, the chromatin of a mitotic cell exists in its most highly con- 
densed state, which facilitates the delivery of DNA to each daughter 
cell. When a chromosome undergoes compaction during mitotic 
prophase, it adopts a distinct and predictable shape determined pri- 
marily by the length of the DNA molecule in that chromosome and 
the position of the centromere (discussed later). The mitotic chro- 
mosomes of a dividing cell can be displayed by the technique 
depicted in FIGURE 12.22a. In this technique, a dividing cell is bro- 
Ken open, and the mitotic chromosomes from the cell’s nucleus settle 
and attach to the surface of the slide over a very small area (as in 
Figure 12.22b). The chromosomes displayed in Figure 12.22b have 
been prepared using a staining methodology in which chromosome 
preparations are incubated with multicolored, fluorescent DNA 
probes that bind specifically to particular chromosomes. By using 
various combinations of DNA probes and computer-aided visualiza- 
tion techniques, each chromosome can be “painted” with a different 
“virtual color,’ making it readily identifiable to the trained eye. In 


addition to providing a colorful image, this technique delivers 473 


superb resolution, which allows clinical geneticists to discover chro- 
mosomal aberrations that might otherwise be missed (see Figure 2 
of the Human Perspective in Section 12.6). 

If the individual chromosomes are cut out of a photograph such 
as that of Figure 12.22b, they can be matched up into homologous 
pairs (23 in humans) and ordered according to decreasing size, as 
shown in Figure 12.22c. A preparation of this type is called a karyo- 
type. The chromosomes shown in the karyotype of Figure 12.22c 
have been prepared using a staining procedure that gives the chro- 
mosomes a cross-banded appearance. The pattern of these bands is 
highly characteristic for each chromosome of a species and provides 
a basis to identify chromosomes and compare them from one spe- 
cies to the next (see Figure 3 in the Human Perspective). Karyotypes 
are routinely prepared from cultures of blood cells and used to 
screen individuals for chromosomal abnormalities. As discussed in 
the accompanying Human Perspective, extra, missing, or grossly 
altered chromosomes can be detected in this manner. 


Telomeres 


In contrast to most prokaryotic chromosomes that consist of a circu- 
lar molecule of DNA, each eukaryotic chromosome contains a sin- 
gle, continuous DNA molecule. The tips of each DNA molecule are 
composed of an unusual stretch of repeated sequences that, together 
with a group of specialized proteins, form a cap at each end of the 


chromosome called a telomere. Human telomeres contain the 
TTAGGG 
AATCCC 
(FIGURE 12.23a). Unlike most repeated sequences that vary consid- 


erably from species to species, the same telomere sequence is found 
throughout the vertebrates, and similar sequences are found in most 
other organisms. This similarity in sequence suggests that telomeres 
have a conserved function in diverse organisms. 

As discussed in Chapter 13, the DNA polymerases that replicate 
DNA cannot initiate synthesis of a strand of DNA but can only add 
nucleotides to the 3’ end of an existing strand. Replication is initi- 
ated at the 5’ end of each newly synthesized strand by synthesis of a 
short RNA primer (step 1, FIGURE 12.24a) that is subsequently 
removed (step 2). Because of this mechanism, and the additional 
processing steps shown in step 3, the 5’ end of each strand is missing 
a short segment of DNA. As a result, the strand with the 3’ end over- 
hangs the strand with the 5’ end. Rather than existing as an unpro- 
tected single-stranded terminus, the overhanging strand is “tucked 
back” into the double-stranded portion of the telomere to form a 
loop as illustrated in Figure 12.24b. This conformation is thought to 
protect the telomeric end of the DNA. A number of DNA-binding 
proteins have been identified that bind specifically to telomeres and 
are essential for telomere function. The protein that is bound to the 
chromosomes in Figure 12.23b plays a role in regulating telomere 
length in yeast. The telomeric DNA of animal cells binds a 6-subunit 
protein complex called shelterin. Among its functions, shelterin pre- 
vents a cell’s DNA repair machinery from mistaking the ends of the 
telomeric DNA as damaged or broken DNA strands, which would 
have serious consequences for the integrity of the genome. 

If cells were not able to replicate the ends of their DNA, the 
chromosomes would become shorter and shorter with each round of 
cell division (Figure 12.24a). This predicament has been called the 
“end-replication problem? The primary mechanism by which 
organisms solve the end-replication problem came to light in 1984 


sequence repeated from about 500 to 5000 times 
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FIGURE 12.22 Human mitotic chromosomes and karyotypes. 

(a) Procedure used to obtain preparations of mitotic chromosomes for 
microscopic observation from leukocytes in the peripheral blood. 

(b) Photograph of a cluster of mitotic chromosomes from a single dividing 
human cell. The DNA of each chromosome has hybridized to an assortment 
of DNA probes that are linked covalently to two or more fluorescent dyes. 
Different chromosomes bind different combinations of these dyes and, 
consequently, emit light of different wavelengths. Emission spectra from 
different chromosomes are converted into distinct and recognizable display 
colors by computer processing. Pairs of homologous chromosomes can be 
identified by searching for chromosomes of the same color and size. (c) The 
stained chromosomes of a human male arranged in a karyotype. Karyotypes 
showing paired homologues, arranged according to chromosome size, are 
prepared from a photograph of chromosomes released from a single nucleus. 
Source: (b) From E. Schréck et al., courtesy of Evelin Schrock and Thomas 
Ried, Science 273:495, 1996; © 1996, Reprinted with permission from 
AAAS; (c) CNRI/Science Photo Library/ Photo Researchers. 


(a) 


(b) 2um 


FIGURE 12.23 Telomeres. (a) In situ hybridization of a DNA probe containing the sequence TTAGGG, which localizes to the telomeres of human 
chromosomes. (b) Demonstration that certain proteins bind specifically to telomeric DNA. These chromosomes were prepared from a meiotic nucleus of a 
yeast cell and incubated with the protein RAP1, which was subsequently localized at the telomeres by a fluorescent anti-RAP1 antibody. Blue areas indicate 
DNA staining, yellow areas represent anti-RAP1 antibody labeling, and red shows RNA stained with propidium iodide. Humans have a homologous telomere 


protein (hRAP1 that is part of the shelterin complex). 


Source: (a) From J. Meyne, in R. P. Wagner, Chromosomes: A Synthesis, Wiley, 1993, © 1993. This material is used by permission of John Wiley & Sons, Inc.; 
(b) From Franz Klein et al., J. Cell Biol. 117:940, 1992, Fig. 4c. Courtesy of Susan M. Gasser, reproduced with permission of the Rockefeller University Press. 


when Elizabeth Blackburn and Carol Greider of the University of 
California, Berkeley, discovered a novel enzyme, called telomer- 
ase, that can add new repeat units to the 3’ end of the overhanging 
strand (Figure 12.24c). Once the 3’ end of the strand has been 
lengthened, a conventional DNA polymerase can use the newly syn- 
thesized 3’ segment as a template to return the 5’ end of the comple- 
mentary strand to its previous length. Telomerase is a reverse 
transcriptase that synthesizes DNA using an RNA template. Unlike 
most reverse transcriptases, the enzyme itself contains the RNA that 
serves as its template (Figure 12.24c). 

Telomeres are very important parts of a chromosome: They are 
required for the complete replication of the chromosome; they form 
caps that protect the chromosomes from nucleases and other desta- 
bilizing influences; and they prevent the ends of chromosomes from 
fusing with one another. FIGURE 12.25 shows mitotic chromosomes 
from a mouse that has been genetically engineered to lack telomer- 
ase. Many of the chromosomes have undergone end-to-end fusion, 
which leads to catastrophic consequences as the chromosomes are 
torn apart in subsequent cell divisions. Recent experiments suggest 
additional roles for telomeres, which have made them a focus of cur- 
rent research. 

Suppose a researcher were to take a small biopsy of your skin, 
isolate a population of fibroblasts from the dermis, and allow these 
cells to grow in an enriched culture medium. The fibroblasts would 
divide every day or so in culture and eventually cover the dish. If a 
fraction of these cells were removed from the first dish and replated 
on a second dish, they would once again proliferate and cover the 
second dish. You might think you could subculture these cells 
indefinitely—as was thought in the first half of the last century—but 


you would be wrong. As discovered in 1961 by Leonard Hayflick and 
Paul Moorhead of the Wistar Institute in Philadelphia, the cells stop 
dividing entirely after about 50 to 80 population doublings and enter 
a stage referred to as replicative senescence. If the average length of 
the telomeres in the fibroblasts at the beginning and end of the 
experiment are compared, one finds a dramatic decrease in telomere 
length over time in culture. Telomeres shrink because most cells lack 
telomerase and are unable to prevent the loss of their chromosome 
ends. With each cell division, the telomeres of the chromosomes 
become shorter and shorter. Telomere shortening continues to a 
critical point at which a physiological response is triggered within 
each cell that causes that cell to cease growth and division. Cells that 
are able to reactivate telomerase can resume growth and division, 
and become immortal. Such cells are also typically capable of caus- 
ing cancer (see page 710). 

Interestingly, telomerase is absent from most of the body’s cells, 
but there are notable exceptions. The germ cells of the gonads retain 
telomerase activity, and the telomeres of their chromosomes do not 
shrink as the result of cell division. Consequently, each offspring 
begins life as a zygote that contains telomeres of maximal length. 
Similarly, the stem cells located in the lining of the skin and intestine 
and the hematopoietic stem cells of the blood-forming tissues also 
express this enzyme, which allows these cells to continue to prolifer- 
ate and generate the huge numbers of differentiated cells required in 
these organs. The importance of telomerase is revealed by a rare 
condition in which individuals have greatly reduced telomerase 
levels, because they are heterozygous for the gene that encodes 
either the telomerase RNA or one of the protein subunits. These indi- 
viduals suffer from bone marrow failure due to the inability of this 
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FIGURE 12.24 The end-replication problem and the role of telomerase. (a) When the DNA of a chromosome is replicated (step 1), the 5’ ends of the 
newly synthesized strands (red) contain a short segment of RNA (green), which had functioned as a primer for synthesis of the adjoining DNA. Once this 
RNA is removed (step 2), the 5’ end of the DNA becomes shorter relative to that of the previous generation. The 5’ ends at each end of the chromosome 
are further processed by nuclease activities (step 3), which increases the lengths of the single-stranded overhangs. (b) The single-stranded overhang does 
not remain as a free extension, but invades the duplex as shown here, displacing one of the strands, which forms a loop. The loop is a binding site for a 
complex of specific telomere-capping proteins that protect the ends of the chromosomes and regulate telomere length. (c) The mechanism of action of 
telomerase. The enzyme contains an RNA molecule that is complementary to the end of the G-rich strand, which extends past the C-rich strand. The 
telomerase RNA binds to the protruding end of the G-rich strand (step 1) and then serves as a template for the addition of nucleotides onto the 3’ terminus 
of the strand (step 2). After a segment of DNA is synthesized, the telomerase RNA slides to the new end of the strand being elongated (step 3) and serves 
as the template for the incorporation of additional nucleotides (step 4). The gap in the complementary strand is filled by the replication enzymes polymerase 
a-primase (see Figure 13.21). 

Source: (c) C. W. Greider and E. H. Blackburn, reprinted with permission from Nature 337:336, 1989, copyright 1989. Nature by Nature Publishing Group. 
Reproduced with permission of Nature Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center. 


FIGURE 12.25 Telomerase and chromosome integrity. The chromosomes in this micrograph are from the 
cell of a mouse that lacks a functional gene for the enzyme telomerase. The telomeres appear as yellow spots 
following in situ hybridization with a fluorescently labeled telomere probe. Some chromosomes lack telomeres 
entirely and a number of chromosomes have fused to one another at their ends. Chromosome fusion produces 
chromosomes with more than one centromere, which in turn leads to chromosome breakage during cell 
division. The genetic instability resulting from loss of a telomere may be a major cause of cells becoming 
cancerous. 


Source: From Maria A. Blasco et al., Courtesy of Carol W. Greider, Cell, vol. 91, cover #1, 1997, with permission 


from Elsevier. 


blood-forming tissue to produce a sufficient number of blood cells 
over a normal human lifetime. 

If telomeres are such an important factor in limiting the number 
of times that a cell can divide, one might expect telomere shortening 
to be a major factor in normal human aging. A large body of research 
supports this proposal. In fact, some biologists describe telomere 
length as a type of “longevity clock,” providing a measure of how 
rapidly a particular individual is aging biologically, rather than just 
chronologically. In addition to age, psychological stress may be an 
important regulator of telomere length. Multiple studies have found 
that individuals experiencing chronic psychological stress, as well as 
early childhood trauma, show a statistically significant reduction in 
telomere length. These studies are mostly based on self-reported lev- 
els of stress and in some but not all cases involve small numbers of 
samples, nevertheless the idea that human psychology could poten- 
tially affect molecular biology at the level of telomeres is tantalizing 
and clearly calls for further investigation. Several companies now 
provide a service in which they measure the lengths of telomeres in 
a sample of a person's blood cells, but the jury is still out on whether 
such tests will provide a useful measure of an individual’s risk for 
age- or stress-related diseases. 

Even if a person with exceptionally long telomeres were likely to 
age more slowly than the average person, there is another aspect of 
the issue to consider. A large body of evidence indicates that telo- 
mere shortening plays a key role in protecting humans from cancer 
by limiting the number of divisions of a potential tumor cell. 
Whether a person with longer telomeres is more susceptible to the 
development of a serious malignancy remains to be determined. 
Regardless, the relationship between cancer growth and telomeres is 
a major focus of study. Malignant cells, by definition, are cells that 
have escaped the body’s normal growth controls and continue to 
divide indefinitely. How is it that malignant tumor cells can divide 
repeatedly without running out of telomeres and bringing on their 
own death? Unlike normal cells that lack detectable telomerase 
activity, approximately 90 percent of human tumors consist of cells 
that contain an active telomerase enzyme.’ According to one sce- 
nario, the growth of tumors is accompanied by intense selection for 
cells in which the expression of telomerase has been reactivated. The 
vast majority of tumor cells fail to express telomerase and die out, 
whereas the rare cells that express the enzyme are “immortalized.” 
This does not mean that activation of telomerase, by itself, causes 
cells to become malignant. As discussed in Chapter 16, the develop- 
ment of cancer is a multistep process in which cells typically develop 
abnormal chromosomes, changes in cell adhesion, and the ability to 
invade normal tissues. Telomere maintenance and unlimited cell 
division is thus only one property of cancer cells. But it is still an 
important property of cancer and if telomerase could be inhibited in 
cancer cells, this might prove an effective way to treat a wide range of 
cancers. A telomerase inhibitor called Imetelstat is currently in phase 
II clinical trials for treatment of thrombocythemia, a blood disase 
characterized by overproduction of platelets, and myelofibrosis, a 
form of chronic leukemia. It is interesting to note that the initial dis- 
coveries of telomeric DNA sequences and telomerase were carried 
out on Tetrahymena, the same single-celled pond creature exploited 
in the discovery of ribozymes (page 425). Tetrahymena has a major 
advantage for studying telomeres and telomerase using biochemical 
methods, because its genome is organized into a large number of 


‘The other 10 percent or so have an alternate mechanism based on genetic recombina- 
tion that maintains telomere length in the absence of telomerase. 


tiny chromosomes, so that a single cell contains a much larger number 477 


of telomeres than a typical mammalian cell. This means that it is 
easier to isolate telomerase from such a cell, because there is more of 
it there in the first place. This point serves as another reminder that 
discoveries of great medical significance are often obtained using 
model organisms with specific or unusual features that make a par- 
ticular aspect of cell biology easier to study. 


Centromeres 


Each chromosome depicted in Figure 12.22 contains a site where the 
outer surfaces are markedly indented. The site of the constriction marks 
the centromere of the chromosome (FIGURE 12.26). In humans the 
centromere contains a tandemly repeated, 171-base-pair DNA 
sequence (called «-satellite DNA) that extends for at least 500 kilo- 
bases. This stretch of DNA associates with specific proteins that dis- 
tinguish it from other parts of the chromosome. For instance, 
centromeric chromatin contains a unique H3 histone variant, called 
CENP-A in mammals, which replaces conventional H3 in a certain 
fraction of the centromeric nucleosomes and gives these nucleosomes 
unique properties. During the formation of mitotic chromosomes, the 
CENP-A-containing nucleosomes become situated on the outer sur- 
face of the centromere where they serve as the platform for the assem- 
bly of the kinetochore. The kinetochore, in turn, serves as the 
attachment site for the microtubules that separate the chromosomes 
during cell division (see Figure 14.16). Chromosomes lacking CENP-A 
fail to assemble a kinetochore and are lost during cell division. 

It has been suggested in previous chapters that DNA sequences 
responsible for essential cellular functions tend to be conserved. It 
came as a surprise, therefore, to discover that centromeric DNA 
exhibited marked differences in nucleotide sequence, even among 
closely related species. This finding suggests that the DNA sequence 
itself is not an important determinant of centromere structure and 
function, a conclusion that is strongly supported by the following 
studies on humans. Approximately one in every 2000 humans is 
born with cells that have an excess piece of chromosomal DNA that 
forms an additional diminutive chromosome, called a marker 
chromosome. In some cases, marker chromosomes are devoid of 
a-satellite DNA, yet they still contain a primary constriction and a 
fully functional centromere that allows the duplicated marker chro- 
mosomes to be separated normally into daughter cells at each division. 
Clearly, some other DNA sequence in these marker chromosomes 
becomes “selected” as the site to contain CENP-A and other 


FIGURE 12.26 Each mitotic chromosome has a centromere whose site 
is marked by a distinct indentation. Scanning electron micrograph of a 
mitotic chromosome. The centromere (C) contains highly repetitive DNA 
sequences (satellite DNA) and a protein-containing structure called the 
kinetochore that serves as a site for the attachment of spindle microtubules 
during mitosis and meiosis (discussed in Chapter 14). 


Source: From Jerome B. Rattner, Bioess. 13:51, 1991, © 1991. This material 
is used by permission of John Wiley & Sons. 
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marker chromosome in all of the person’s cells, indicating that the 
property is transmitted to the daughter chromosomes during cell 
division. In one study, marker chromosomes were found to be trans- 
mitted stably through three generations of family members. 


REVIEW | 


1. What is the difference in structure and function 
between the centromeres and telomeres of a 
chromosome? 


——— 


12.6 THE HUMAN PERSPECTIVE 


Chromosomal Aberrations and Human Disorders 


In addition to mutations that alter the information content of a 
single gene, chromosomes may be subjected to more extensive 
alterations that occur most commonly during cell division. Pieces 
of a chromosome may be lost or segments may be exchanged 
between different chromosomes. Because these chromosomal 
aberrations follow chromosomal breakage, their incidence is 
increased by exposure to agents that damage DNA, such as 
viral infection, X-rays, or reactive chemicals. In addition, the 
chromosomes of some individuals contain “fragile” sites that are 
particularly susceptible to breakage. Persons with certain rare 
inherited conditions, such as Bloom syndrome, Fanconi anemia, 
and ataxia-telangiectasia, have unstable chromosomes that 
exhibit a greatly increased tendency toward breakage. 

The consequences of a chromosomal aberration depend 
on the genes that are affected and the type of cell in which it 
occurs. If the aberration occurs in a somatic (nonreproductive) 
cell, the consequences are generally minimal because only a 
few cells of the body are usually affected. On rare occasions, 
however, a somatic cell carrying an aberration may be trans- 
formed into a malignant cell, which can grow into a cancerous 
tumor. Chromosomal aberrations that occur during meiosis— 
particularly as a result of abnormal crossing over—can be trans- 
mitted to the next generation. When an aberrant chromosome is 
inherited through a gamete, all cells of the embryo will have the 
aberration, which generally proves lethal during development. 
There are several types of chromosomal aberrations, including 
the following: 


e Inversions. Sometimes a chromosome breaks in two places, 
and the segment between the breaks becomes resealed into 
the chromosome in reverse orientation. This aberration is 
called an inversion. More than 1 percent of humans carry an 
inversion that can be detected during chromosome karyotyp- 
ing (see Figure 10.30b). A chromosome bearing an inversion 
usually contains all the genes of a normal chromosome, and 
therefore, the individual is not adversely affected. However, if a 
cell with a chromosome inversion enters meiosis, the aberrant 
chromosome cannot pair properly with its normal homologue 
because of differences in the order of their genes. In such 
cases, chromosome pairing is usually accompanied by a loop 
(FIGURE 1). If crossing over occurs within the loop, as shown 
in the figure, the gametes generated by meiosis either pos- 
sess an additional copy of certain genes (a duplication) or are 
missing those genes (a deletion). When a gamete containing 
an altered chromosome fuses with a normal gamete at fertiliza- 
tion, the resulting zygote has a chromosome imbalance and is 
often nonviable. 


Centromere 


FIGURE 11 The effect of inversion. Crossing over between a normal 
chromosome (purple) and one containing an inversion (orange) is 
usually accompanied by formation of a loop. The chromosomes that 
result from the crossover contain duplications and deficiencies, which 
are shown in the chromosomes at first meiotic division in the lower 
part of the figure. 


Translocations. When all or a piece of one chromosome 
becomes attached to another chromosome, the aberration is 
called a translocation (FIGURE 2). Like inversions, a translocation 
that occurs in a somatic cell generally has little effect on the 
functions of that cell or its progeny. However, certain trans- 
locations increase the likelihood that the cell will become 
malignant. The best studied example is the Philadelphia 
chromosome, which is found in the malignant cells (but not 
the normal cells) of individuals with certain forms of leukemia. 
The Philadelphia chromosome, which is named for the city 
in which it was discovered in 1960, is a shortened version of 
human chromosome number 22. For years, it was thought that 
the missing segment represented a simple deletion, but with 


FIGURE 2 A translocation. Micrograph shows a set of human 
chromosomes in which chromosome 12 (bright blue) has exchanged 
pieces with chromosome 7 (red). The affected chromosomes have been 
made fluorescent by in situ hybridization with a large number of DNA 
fragments that are specific for each of the two chromosomes involved. 
The use of these “stains” makes it very evident when one chromosome 
has exchanged pieces with another chromosome. 


Source: Courtesy of Lawrence Livermore Laboratory. From a technique 
developed by Joe Gray and Dan Pinkel. 


improved techniques for observing chromosomes, the missing 
genetic piece was found translocated to another chromosome 
(number 9). Chromosome number 9 contains a gene (ABL) that 
encodes a protein kinase that plays a role in cell proliferation. 
As a result of translocation, one small end of this protein is 
replaced by about 600 extra amino acids encoded by a gene 
(BCR) carried on the translocated piece of chromosome num- 
ber 22. This novel “fusion protein” retains the catalytic activity 
of the original ABL but is no longer subject to the cell's normal 
regulatory mechanisms. As a result, the affected cell becomes 
malignant and causes chronic myelogenous leukemia (CML). It 
has generally been assumed that translocations occur following 
the random breakage of chromosomal DNA. Recent studies, 
however, suggest that such breaks in the DNA may occur at 
sites during the normal process of transcription, an activity that 
may make the DNA more susceptible for breakage. Prostate 
cancer, for example, is characterized by translocations affecting 
a number of genes that are transcribed in normal prostate cells 


in response to male hormones (androgens). 


Like inversions, translocations cause problems during 
meiosis. A chromosome altered by translocation has a different 
genetic content from its homologue. As a result, the gametes 
formed by meiosis will either contain extra copies of genes or 
be missing genes. Translocations have been shown to play an 
important role in evolution, generating large-scale changes 
that may be pivotal in the branching of separate evolutionary 
lines from a common ancestor. Such a genetic incident prob- 
ably happened during our own recent evolutionary history. A 
comparison of the 23 pairs of chromosomes in human cells 
with the 24 pairs of chromosomes in the cells of chimpanzees, 
gorillas, and orangutans reveals a striking similarity. Close exam- 
ination of the two ape chromosomes that have no counterpart 
in humans reveals that together they are equivalent, band for 
band, to human chromosome number 2 (FIGURE 3). At some 
point during the evolution of humans, an entire chromosome 
was apparently translocated to another, creating a single fused 
chromosome and reducing the haploid number from 24 to 23. 


» Deletions. A deletion occurs when a portion of a chromosome 


is missing. As noted above, zygotes containing a chromosomal 
deletion are produced when one of the gametes is the product 
of an abnormal meiosis. Forfeiting a portion of a chromosome 
often results in a loss of critical genes, producing severe con- 
sequences, even if the individual's homologous chromosome is 
normal. Most human embryos that carry a significant deletion 
do not develop to term, and those that do exhibit a variety 
of malformations. The first correlation between a human disor- 
der and a chromosomal deletion was made in 1963 by Jerome 
Lejeune, a French geneticist who had earlier discovered the 
chromosomal basis of Down syndrome. Lejeune discovered 
that a baby born with a variety of facial malformations was 
missing a portion of chromosome 5. A defect in the larynx 
(voice box) caused the infant's cry to resemble the sound of a 
suffering cat. Consequently, the scientists named the disorder 
cri-du-chat syndrome, meaning cry-of-the-cat syndrome. 


e Duplications. A duplication occurs when a portion of a 


chromosome is repeated. The role of duplications in the 
formation of multigene families was discussed on page 390. 
More substantial chromosome duplications create a condition 
in which a number of genes are present in three copies rather 
than the two copies normally present (a condition called partial 
trisomy). Cellular activities are very sensitive to the number of 
copies of genes, and thus extra copies of genes can have serious 
deleterious effects. 


Human chromosome #2 


Human 
Chimpanzee 
Ql las) > 
Gorilla 
CH Tiassa 
FIGURE 3 Translocation and evolution. If the only 
two ape chromosomes that have no counterpart in 
humans are hypothetically fused, they match human _—_ Orangutan 
chromosome number 2, band for band. ila) ID 
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to Inheritance than DNA 


As described in Section 12.5, a-satellite DNA is not required for the 
development of a centromere. In fact, dozens of unrelated DNA 
sequences have been found at the centromeres of marker chromo- 
somes. It is not the DNA that indelibly marks the site as a centromere 
but the CENP-A-containing chromatin that is localized there. These 
findings raise a larger issue. Not all inherited traits are dependent on 
DNA sequences. Inheritance that is not encoded in DNA is referred 
to as epigenetic as opposed to genetic. The inactivation of the 
X chromosome discussed on page 469 is another example of an epi- 
genetic phenomenon: The two X chromosomes can have identical 
DNA sequences, but one is inactivated and the other is not. 
Furthermore, the state of inactivation is transmitted from each cell 
to its daughters throughout the life of the person. However, unlike 
genetic inheritance, an epigenetic state can usually be reversed; 
X chromosomes, for example, are reactivated prior to formation of 
gametes. 

Biologists have struggled to understand (1) the mechanisms by 
which epigenetic information is stored and (2) the mechanisms by 
which an epigenetic state can be transmitted from cell to cell and 
from parent to offspring. In this discussion, we will focus primarily 
on one type of epigenetic phenomenon: the state of a cell’s gene 
activity. Consider a stem cell residing at the base of the epidermis 
(as in Figure 7.1). Certain genes in these cells are transcriptionally 
active and others are repressed, and it is important that this charac- 
teristic pattern of gene activity is transmitted from one cell to its 
daughters. However, not all of the daughter cells continue life as 
stem cells; some of them take on a new commitment and begin the 
process of differentiation into mature epidermal cells. This step 
requires a change in the transcriptional state of that cell. Recent 
attention has focused on the histone code (page 470) as a critical 
factor in both the determination of the transcriptional state of a 
particular region of chromatin and its transmission to subsequent 
cellular generations. 

When the DNA of a cell is replicated, histones associated with 
the DNA as part of the nucleosomes are distributed randomly to the 
daughter cells along with the DNA molecules. As a result, each 
daughter DNA strand receives roughly half of the core histones that 
were associated with the parental strand (see Figure 13.23). The 
other half of the core histones that become associated with the 
daughter DNA strands are recruited from a pool of newly synthe- 
sized histone molecules. The modifications present on the histone 
tails in the parental chromatin are thought to determine the modifi- 
cations that will be introduced in the newly synthesized histones in 
the daughter chromatin. We saw on page 472, for example, that het- 
erochromatic regions of chromatin have methylated lysine residues 
at position 9 of histone H3. The enzyme responsible for this methyla- 
tion reaction is present as one of the components of heterochroma- 
tin. As heterochromatin is replicated, this histone methyltransferase 
is thought to methylate the newly synthesized H3 molecules that 
become incorporated into the daughter nucleosomes. In this way, the 
H3 methylation pattern of the chromatin, and thus its condensed 
heterochromatic state, is transmitted from the parent cell to its off- 
spring. In contrast, euchromatic regions tend to contain acetylated 
H3 tails, and this modification is also transmitted from parental 
chromatin to progeny chromatin, which may serve as the epigenetic 


mechanism by which active euchromatic regions are perpetuated in 
daughter cells. Histone modifications represent one carrier of epige- 
netic information, covalent modifications to the DNA are another 
type. This latter subject is taken up on page 500. 

Inappropriate changes in epigenetic state are associated with 
numerous diseases. There is also evidence to suggest that differences 
in physical appearance, disease susceptibility, and longevity between 
genetically identical twins may be due, in part, to epigenetic differ- 
ences that appear between the twins as they age. Some of these dif- 
ferences in the epigenomes of identical twins are presumed to be 
caused by differences in environmental conditions that the individu- 
als have experienced, and others are simply a result of random dif- 
ferences that are introduced during the many cell divisions that 
occur during a human lifetime. 
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1. What is the difference between genetic and 
epigenetic variation? Name one important form of 


epigenetic variation based on a histone modification. 
oe 


12.8 The Nucleus as an 
Organized Organelle 


Examination of the cytoplasm of a eukaryotic cell under the electron 
microscope reveals the presence of a diverse array of membranous 
organelles and cytoskeletal elements. Examination of the nucleus, on 
the other hand, typically reveals little more than scattered clumps of 
chromatin and one or more irregular nucleoli. As a result, research- 
ers were left with the impression that the nucleus is largely a “sack” 
of randomly positioned components. With the development of new 
microscopic techniques, including fluorescence in situ hybridization 
(FISH, page 386) and imaging of live GFP-labeled cells (page 697), it 
became possible to localize specific gene loci within the interphase 
nucleus. It became evident from these studies that the nucleus main- 
tains considerable order. For example, the chromatin fibers of a 
given interphase chromosome are not strewn through the nucleus 
like a bowl of spaghetti, but are concentrated into a distinct territory 
that does not overlap extensively with the territories of other chro- 
mosomes (FIGURE 12.27). 

The localization within the nucleus of a given chromosome ter- 
ritory may not be entirely random. In the micrograph shown in 
FIGURE 12.28, the chromatin that comprises human chromosome 
number 18 (shown in green) occupies a territory near the periphery 
of the nucleus, whereas the chromatin of chromosome number 19 
(shown in red) is localized more centrally within the organelle. This 
difference in nuclear location may be related to the levels of activity 
of these two chromosomes: Chromosome number 18 is relatively 
devoid of genes, whereas chromosome number 19 is rich in protein- 
coding sequences, many of which are presumably transcribed in 
these cells. A similar disposition is observed in the nuclei of women, 
where the inactive X chromosome is located at one edge of the 
nucleus and the active X chromosome is situated internally 
(Figure 12.17a). 

Even though genes are typically transcribed while they reside 
within their territories, individual chromatin fibers can extend away 


(a) 


FIGURE 12.27 Chromosome territories. (a) Three and (b) all 23 pairs of 
human chromosomes were detected using fluorescence in situ hybridiza- 
tion analysis similar to that described for Figure 12.22b that allows each 
human chromosome to be distinguished from others and represented by 
an identifiable color. Each chromosome is found to occupy a distinct 
territory within the nucleus. 

Source: (a) From Michael R. Hubner and David L. Spector, Courtesy of Irina 
Solovei; (b) From Michael R. Hubner and David L. Spector, Courtesy of Irina 
Solovei and Andreas Bolzer, Annual Review of Biophysics, Volume 39; 471, 
Fig. 1, 2010. Reprinted with permission of Annual Reviews, Inc. 


from these territories for considerable distances. Furthermore, DNA 
sequences that participate in a common biological response but 
reside on different chromosomes can apparently come together 
within the nucleus where they can influence gene transcription. 
Interactions between different loci have been revealed by the inven- 
tion of a variety of techniques that involve “chromosome conforma- 
tion capture (3C). In this approach, cells are fixed by treatment with 
formaldehyde, which causes DNA sequences residing in close prox- 
imity within the nucleus to become covalently cross-linked to one 


FIGURE 12.28 Localizing specific chromosomes within an interphase 
nucleus. Micrograph of the nucleus of a human lymphocyte (stained blue) 
that was subjected to dual-label fluorescence in situ hybridization to 
visualize chromosomes numbers 18 and 19, which appear as green and red, 
respectively. Chromosome 19, which contains a greater density of protein- 
coding genes than chromosome 18, tends to be more centrally located 
within the cell's nucleus. 


Source: From Jenny A. Croft et al., courtesy of Wendy A. Bickmore, J. Cell 
Biol. 145:1119, 1999, Fig. 1. Reproduced with permission of the Rockefeller 
University Press. 


another (they are said to be “captured”). After the fixation process, 481 


the DNA is isolated and subjected to digestion by restriction enzymes 
(Section 18.20), and the digestion products are analyzed to determine 
which DNA sequences in the genome interact with a given DNA 
sequence (the “bait” sequence) at the time of fixation. For example, a 
researcher might want to know which DNA sequences throughout 
the genome interact with the B-globin locus during the differentia- 
tion of erythrocytes in the bone marrow. DNA sequences that are 
found to interact using this technique are typically sequences that 
are present on the same chromosome. This is what you would expect, 
for example, between an enhancer and a promoter for the same gene, 
which come into close proximity as depicted in Figure 12.48. But 
numerous examples have also been found where the interacting 
DNA sequences are present on different chromosomes. A good 
example of such interchromosomal interactions comes from a study 
in which human cultured breast cells (both normal and malignant 
versions) were treated with the hormone estrogen (FIGURE 12.29). 
Estrogen induces the transcription of a large number of target genes 
through its binding to an estrogen receptor (ERa). Two of estrogen’s 
target genes in humans are GREB1, located on chromosome 2, and 
TFF1, located on chromosome 21. Prior to treating the cells with 
estrogen, the chromosome 2 and 21 territories are at distant loca- 
tions from one another, and the GREBI and TFF1 gene loci are 
tucked away within the interior of their respective chromosome ter- 
ritories (Figure 12.29a.). However, within minutes after the cells are 
exposed to estrogen, these two chromosome territories are reposi- 
tioned into close physical proximity to one another, and the two 
gene loci become co-localized on the periphery of their territories 
(Figure 12.29b). These studies support the idea that genes are physi- 
cally moved to sites within the nucleus called transcription factories, 
where the transcription machinery is concentrated, and that genes 
involved in the same response tend to become co-localized in the 
same factory (Figure 12.29c). A number of studies suggest that the 
movement of chromosomal loci within the nucleus is driven by 
actin-myosin interactions. More elaborate versions of the chromo- 
some conformation capture technique, known as 5C and Hi-C, allow 
interactions to be analyzed for many different genomic regions at the 
same time and have demonstrated the presence of thousands of 
long-range looping interactions, most of which involve interactions 
of genes with promotors located approximately 100 kb upstream. 
Mathematical techniques have been devised to use such genome- 
wide interaction data, combined with physical models of polymer 
folding, to compute the most likely structural arrangement of the 
genome inside the nucleus (FIGURE 12.30). The main result thus far 
from such studies is that the genome is packed into a series of 
regions, hundreds of kb in size, such that DNA within such a region 
tends to interact much more strongly with other DNA in the same 
region than it does with other parts of the genome. These regions are 
known as topologically associated domains (TADs). While it is 
hypothesized that genes contained within a TAD might be coordi- 
nately regulated, the actual impact of these higher levels of genome 
organization on gene expression is still not well understood and is an 
important area of current research. 

Another example of the interrelationship between nuclear 
organization and gene expression is illustrated in FIGURE 12.31a. 
This micrograph shows a cell that has been stained with a fluorescent 
antibody against one of the protein factors involved in pre-mRNA 
splicing. Rather than being spread uniformly throughout the 
nucleus, the processing machinery is concentrated within 20 to 50 
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FIGURE 12.29 Interactions can occur between distantly located genes in response to physiological stimuli. (a) Two cells derived from a breast cancer 
line that have not been treated with hormone. The chromosome 2 and 21 territories, which have been fluorescently labeled in each cell, are positioned 
independently of one another within the nucleus. (b) Two of the same types of breast cancer cells visualized 60 minutes after estrogen treatment. 
Interactions between the chromosome 2 and 21 territories are now evident. Close examination shows the co-localization of the TFF1 locus (in green) 

on chromosome 21 and the GREB7 locus on chromosome 2 (in red). In this study, approximately half of the hormone-treated cells exhibited biallelic 
interactions (i.e., both TFF1 alleles interacting with GREB1 alleles) as depicted here. (c) A drawing illustrating how genes on different regions of the same 
chromosome (A and B), or on different chromosomes (C and D), can come together within the nucleus. In some cases, DNA sequences from distant loci 
may influence one another's transcription activity, and in other cases these distant genes may simply share a common pool of proteins involved in the 
transcription process. 

Source: (a, b) From Qidong Hu et al., Courtesy of Michael G. Rosenfeld, Proc. Nat’! Acad. Sci. U.S.A. 105:19200, 2008, Fig. 2b and 2c. © 2008, National 
Academy of Sciences, U.S.A. 


irregular domains, referred to as “speckles.” According to current 
opinion, these speckles function as dynamic storage depots that 
supply splicing factors for use at nearby sites of transcription. The 
green dot in the nucleus in Figure 12.314 is a viral gene that is being 
transcribed near one of the speckles. The micrographs of Figure 
12.31b show a trail of splicing factors extending from a speckle 
domain toward a nearby site where pre-mRNA synthesis has recently 
been activated. The various structures of the nucleus, including 
nucleoli and speckles, are dynamic, steady-state compartments with 
their component parts constantly moving in and out of the struc- 
tures. If that activity is blocked, the compartment simply disappears 
as its materials are dispersed into the nucleoplasm. 

In addition to nucleoli and speckles, several other types of 
nuclear bodies (e.g., Cajal bodies, GEMs, and PML bodies) are often 
seen under the microscope. Each of these nuclear bodies contains 
large numbers of proteins that move in and out of the structure in a 
dynamic manner. Because none of these nuclear bodies are enclosed 
by a membrane, no special transport mechanisms are required for 
these large-scale movements. Various functions have been attributed 
to these nuclear structures, but they remain poorly defined. 
Moreover, they appear not to be essential for cell viability and will 
not be discussed further. 


FIGURE 12.30 Global packing of the genome within the nucleus. 

By combining measurements of chromosome interactions from Hi-C with 
physical models of polymers, it has been possible to reconstruct models for 
how the human genome is packed within the nucleus. The image here 
shows a theoretical model known as a fractal globule that accurately 


predicts statistical features of the Hi-C results, suggesting such an REVIEW | 

arrangement may represent the organization of the genome at the 

megabase scale. 1. Describe some of the observations that suggest that 
Source: From Erez Lieberman-Aiden et al., Science. 326:289-293, 2009, the nucleus is an ordered compartment. 


Reprinted with permission from AAAS. : 
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FIGURE 12.31 Nuclear compartmentalization of the cell's mRNA processing machinery. (a) The nucleus of a cell stained with fluorescent antibodies 
against one of the factors involved in processing pre-mRNAs. The mRNA processing machinery is localized to approximately 30 to 50 discrete sites, or 
“speckles.” The cell shown in this micrograph had been infected with cytomegalovirus, whose genes (shown as a green dot) are being transcribed near one 
of these domains. (b) Cultured cells were transfected with a virus, and transcription was activated by addition of cyclic AMP. Images are seen at various times 
after transcriptional activation. The site of transcription of the viral genome in this cell is indicated by the arrows. This site was revealed at the end of the 
experiment by hybridizing the viral RNA to a fluorescently labeled probe (indicated by the white arrow in the fourth frame). Pre-mRNA splicing factors 


(orange) form a trail from existing speckles in the direction of the genes being transcribed. 


Source: (a) From Tom Misteli and David L. Spector, Curr. Opin. Cell Biol. 10:324, 1998, with permission from Elsevier; (6) From Tom Misteli, Javier F. Caceres, 
and David L. Spector, Nature 387:525, 1997, reprinted by permission from Macmillan Publishers Ltd. 


12.9 An Overview of Gene 
Regulation in Eukaryotes 


In addition to possessing a genome containing tens of thousands of 
genes, complex plants and animals are composed of many different 
types of cells. Vertebrates, for example, are composed of hundreds of 
different cell types, each far more complex than a bacterial cell and 
each requiring a distinct battery of proteins and RNAs that enable it 
to carry out specialized activities. Researchers struggled for many 
years to demonstrate that every cell in a complex multicellular 
organism contained all of the genes required to become any other 
cell in that organism. Convincing evidence for this widely held 
hypothesis was finally obtained in the late 1950s by two investiga- 
tors, one working on plants and the other on animals. One of these 
key experiments was carried out by Frederick Steward and his col- 
leagues at Cornell University, who demonstrated that a root cell iso- 
lated from a mature plant could be induced to grow into a fully 
developed plant that contained all the various cell types normally 
present. In the second experiment, John Gurdon of Oxford University 
demonstrated that a nucleus could be removed from a differentiated 
intestinal cell of a Xenopus tadpole, transplanted into an egg whose 
own nucleus had been destroyed, and that the recipient egg contain- 
ing the donor nucleus could develop into a normal adult amphibian. 
All of the nuclei present in this cloned animal were shown to be 
derived from the nucleus taken from the somatic cell. In another, 
more recent landmark experiment, Ian Wilmot’s group in Scotland 
was able to clone a sheep (Dolly) by transplanting nuclei derived 
from cultured mammary gland cells into unfertilized eggs whose 
chromosomes had been removed (FIGURE 12.32). These experi- 
ments established that the entire set of genetic instructions are pre- 
sent in adult cells and can support the development of an entire 
organism under the right conditions. Since the birth of Dolly, 


researchers have successfully cloned over a dozen other mammals, 
including mice, cattle, goats, pigs, rabbits, and cats. Thus, it is not the 
presence or absence of genes in a cell that determines the properties 
of that cell, but how those genes are utilized. Those cells that become 
liver cells, for example, do so because they express a specific set of 
“liver genes, while at the same time repressing those genes whose 
products are not involved in liver function. 

The success of the cloning experiments described above led to 
the conclusion that the transcriptional state of a differentiated cell, 
which in turn is dependent on the epigenetic state of its chromatin, 
is not irreversible. During a cloning experiment, the nucleus of a dif- 
ferentiated cell is able to stop expressing the genes of the adult tissue 
from which it is taken and begin to selectively express the genes that 
are appropriate for the activated egg in which it suddenly finds itself. 
We can conclude from these cloning experiments that a nucleus 
from a differentiated cell can be reprogrammed by factors that reside 
in the cytoplasm of its new environment. The subject of selective 
gene expression, which resides at the heart of molecular biology, will 
occupy us for the remainder of this chapter. 

The average bacterial cell contains enough DNA to encode 
approximately 3000 polypeptides, of which about one-third are 
typically expressed at any particular time. Compare this to a human 
cell that contains enough DNA (6 billion base pairs) to encode 
several million different polypeptides. Even though the vast majority 
of this DNA does not actually contain protein-coding information, it 
is estimated that a typical mammalian cell manufactures at least 
5000 different polypeptides at any given time. Many of these poly- 
peptides, such as the enzymes of glycolysis and the electron carriers 
of the respiratory chain, are synthesized by virtually all the cells 
of the body. At the same time, each cell type synthesizes proteins 
that are unique to that differentiated state. It is these proteins, more 
than any other components, that give each cell type its unique 
characteristics. 
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FIGURE 12.32 The cloning of animals demonstrates that nuclei retain 
a complete complement of genetic information. In this experiment, an 
enucleated egg from a sheep of one breed was fused with a mammary 
gland cell from a female of another breed. The activated egg developed 
into a healthy lamb. Because all of the genes in the newborn lamb had to 
have been derived from the mammary cell nucleus (which was demon- 
strated by use of genetic markers), this experiment confirms the widely held 
belief that differentiated cells retain all of the genetic information originally 
present in the zygote. The primary difficulty that is generally encountered in 
nuclear transplantation experiments occurs when the nucleus from an active 
somatic (nongerm) cell is suddenly plunged into the cytoplasm of a 
relatively inactive egg. To avoid damaging the donor nucleus, the cultured 
cells were forced into a quiescent state (called G,) by drastically reducing 
the content of serum in the culture medium. 


Consider the situation that faces a developing red blood cell 
inside the marrow of a human bone. Of all the hundreds of different 
types of cells in the human body, only those in the lineage leading to 
red blood cells produce the protein hemoglobin. Moreover, hemo- 
globin accounts for more than 95 percent of a red blood cell’s pro- 
tein, yet the genes that encode the two hemoglobin polypeptides 
represent less than one-millionth of the developing cell’s total DNA. 
Not only does the cell have to find this genetic needle in the chromo- 
somal haystack, it has to regulate its expression to such a high degree 
that production of these few polypeptides becomes the dominant 
synthetic activity of the cell. Because the chain of events leading to 
the synthesis of a particular protein includes a number of discrete 
steps, there are several levels at which control might be exercised. We 


will discuss regulation of gene expression in eukaryotic cells at four 
distinct levels, as illustrated in the overview of FIGURE 12.33: 


1. Transcriptional control mechanisms determine whether a 
particular gene can be transcribed and, if so, how often. 


2. Processing control mechanisms determine the path by which 
the primary mRNA transcript (pre-mRNA) is processed into a 
messenger RNA that can be translated into a polypeptide. 


3. Translational control mechanisms determine whether a par- 
ticular mRNA is actually translated and, if so, how often and for 
how long a period. 


4. Posttranslational control mechanisms regulate the activity 
and stability of proteins. 


In the following sections of this chapter, we will consider each of 
these regulatory strategies in turn. 
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1. What are the four levels at which gene expression 
may be regulated in eukaryotic cells? 
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FIGURE 12.33 Eukaryotic gene regulation. Transcriptional-level controls 
operate by determining which genes are transcribed and how often; 
processing-level controls operate by determining which parts of the 
primary transcripts become part of the pool of cellular mRNAs; transla- 
tional-level controls regulate whether a particular mRNA is translated and, if 
so, how often and for how long, and posttranslational-level controls 
determine the longevity of specific proteins. 


12.10 Profiling Gene Activity 


As in prokaryotic cells, differential gene transcription is a key mech- 
anism by which eukaryotic cells activate or repress gene expression. 
Different genes are expressed by cells at different stages of embry- 
onic development, by cells in different tissues, and by cells that are 
exposed to different types of stimuli. An example of tissue-specific 
gene expression is shown in FIGURE 12.34. In this case, a gene encod- 
ing a muscle-specific protein is being transcribed in the cells of a 
mouse embryo that will give rise to muscle tissue. 

Every so often a new technology is invented that fundamentally 
changes the way certain types of problems in biology are addressed. 
Two such technologies are DNA microarrays (or “DNA chips”) 
and RNA sequencing (RNA-Seq). These techniques allow simul- 
taneous profiling of the activity of all the genes in an organism or cell 
type, as opposed to earlier techniques that were mostly restricted to 
analyzing just one or a few genes at a time. In both approaches, RNA 
is first isolated from cells, tissues, or whole organisms; the two 
techniques then diverge according to the means by which gene- 
expression patterns are analyzed. 


DNA Microarrays 


First, we will examine the use of a microarray to identify genes that are 
expressed differently between yeast strains grown either (1) in the 
presence of glucose or (2) as the glucose is depleted and the yeast begin 
to utilize ethanol as a carbon source (FIGURE 12.35). Figure 12.35a of 


FIGURE 12.34 Experimental demonstration of tissue-specific gene 
expression. Transcription of the myogenin gene is activated specifically in 
those parts of this 11.5-day-old mouse embryo (the myotome of the 
somites) that will give rise to muscle tissue. The photograph shows a 
transgenic mouse embryo that contains the regulatory region of the 
myogenin gene placed upstream from a bacterial p-galactosidase gene, 
which acts as a reporter. The B-galactosidase gene is commonly employed 
to monitor the tissue-specific expression of a gene because the presence of 
the enzyme f-galactosidase is easily revealed by the blue color produced in 
a simple histochemical test. The activation of transcription, which results 
from transcription factors binding to the regulatory region of the myogenin 
gene, has occurred in the blue-stained cells. 

Source: From T. C. Cheng et al., Courtesy of Eric N. Olson, J. Cell Biol. 119: 
1652, 1992, reproduced with permission of the Rockefeller University Press. 


this figure provides an outline of the basic steps taken in this type of 485 


experiment; these steps can be described briefly as follows: 


1. DNA fragments representing individual genes to be studied are 
generated using techniques discussed in Chapter 18. In the case 
depicted in step a at the bottom of Figure 12.35a, each well on the 
plate holds a small volume containing a specific, cloned yeast gene. 
Different wells contain different genes. The cloned DNAs are spot- 
ted, one at a time, in an ordered array on a glass slide by an auto- 
mated instrument that delivers a few nanoliters of a concentrated 
DNA solution onto a specific spot on the slide (step b). A completed 
DNA microarray is shown in step c. Using this technique, DNA 
fragments from thousands of different genes or even entire genomes 
can be spotted at known locations on a glass surface. 


2. Meanwhile, the mRNAs present in the cells being studied are 
purified (step 1, Figure 12.35a) and converted to a population of 
fluorescently labeled, complementary DNAs (cDNAs) (step 2). 
(The method for preparation of cDNAs is described in Figure 
18.46). In the example illustrated in Figure 12.35, and discussed 
below, cDNAs are prepared from two different cell populations, 
one labeled with a fluorescent green dye and the other with a 
fluorescent red dye. The two preparations of labeled cDNAs are 
mixed (step 3) and then incubated with the slide containing the 
immobilized DNA (step 4). 

3. Those DNAs in the microarray that have hybridized to a labeled 
cDNA are identified by examination of the slide under a fluores- 
cence microscope (step 5). Any spot in the microarray that 
exhibits fluorescence represents a gene that has been transcribed 
in the cells being studied. 


Figure 12.35b shows the actual results of this experiment. Each 
spot on the microarray of Figure 12.35b contains an immobilized 
DNA fragment from a different gene in the yeast genome. Taken 
together, the spots contain DNA from all of the roughly 6200 protein- 
coding genes present in a yeast cell. As discussed above, this particu- 
lar DNA microarray was hybridized with a mixture of two different 
cDNA populations. One population of cDNAs, which was labeled 
with a green fluorescent dye, was prepared from the mRNAs of yeast 
cells grown in the presence of a high concentration of glucose. Unlike 
most cells, when baker's yeast cells are grown in the presence of glu- 
cose and oxygen, they obtain their energy by glycolysis, rapidly con- 
verting the glucose into ethanol (Section 3.10). The other population 
of cDNAs, which was labeled with a red fluorescent dye, was prepared 
from the mRNAs of yeast cells that were growing aerobically in a 
medium rich in ethanol but lacking glucose. Cells growing under 
these conditions obtain their energy by oxidative phosphorylation, 
which requires the enzymes of the TCA cycle (page 173). The two 
cDNA populations were mixed, hybridized to the DNA probes on the 
microarray, and the slide examined with a fluorescence microscope. 
Genes that are active in one or the other growth media appear as 
either green or red spots in the microarray (step 5, Figure 12.35a,b). 
Those spots that remain devoid of color in Figure 12.35 represent 
genes that are not transcribed in these cells in either growth medium, 
whereas those spots that have a yellow fluorescence represent genes 
that are transcribed in cells in both types of media. A close-up of a 
small portion of the microarray is shown in the inset. 

The experimental results shown in Figure 12.35b identify those 
genes transcribed in yeast cells under two different growth condi- 
tions. But just as importantly, they also provide information about the 
abundance of individual mRNAs in the cells, which is proportional to 
the intensity ofa spot’s fluorescence. A spot that displays a very strong 
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FIGURE 12.35 DNA microarrays and analysis of gene expression. 

(a) Steps in the construction of a DNA microarray. Preparation of the cDNAs 
(i.e., DNAs that represent the mRNAs present in a cell) used in the 
experiment are shown in steps 1-3. Preparation of the DNA microarray is 
shown in steps a-c. The cDNA mixture is incubated with the microarray in 
step 4 and a hypothetical result is shown in step 5. The intensity of the color 
of the spot is proportional to the number of cDNAs bound there. (6) Results 
of an experiment using a mixture of cDNAs representing mRNAs 
transcribed from yeast cells (1) in the presence of glucose (green-labeled 
cDNAs) and (2) in the presence of ethanol after the glucose has been 
depleted (red-labeled cDNAs). Those spots displaying yellow fluorescence 
correspond to genes that are expressed under both growth conditions. The 
inset at the lower right shows a close-up of a small portion of the 
microarray. The details of this experiment are discussed in the text. (c) Plot 
showing the changes in glucose and ethanol concentrations in the media 
and in cell density during the experiment. Initially, the yeast cells consume 
glucose, then the ethanol they had produced by fermentation, and finally 
they cease growth after exhausting both sources of chemical energy. 

(d) Changes in expression of genes encoding TCA-cycle enzymes during 
the course of the experiment. Each horizontal row depicts the level of 
expression of a particular gene at different time periods. The names of the 
genes are provided on the right (see Figure 5.7 for the enzymes). Bright red 
squares indicate the highest level of gene expression, bright green squares 
indicate the lowest level of expression. Expression of genes encoding TCA- 
cycle enzymes is induced as the cells begin metabolizing ethanol and 
repressed when the ethanol has been exhausted. 

Source: (b-c): Courtesy of Patrick O. Brown. See Joseph DeRisi et al., Science 
278:680, 1997 and Tracy L. Ferea and Patrick O. Brown, Curr. Opin. Gen. 
Develop. 9:715, 1999. Current Opinion in Genetics & Development by Elsevier 
Ltd. Reproduced with permission of Elsevier Ltd. in the format reuse in a book/ 
textbook via Copyright Clearance Center; (d) Courtesy of Patrick O. Brown. 


green fluorescence, for example, represents a gene whose transcripts 
are abundant in yeast cells grown in glucose but is repressed in yeast 
cells grown in ethanol. Concentrations of individual mRNAs can vary 
by more than a 100-fold in yeast cells. The technique is so sensitive 
that an mRNA can be detected at a level of less than one copy per cell. 
Figure 12.35c shows the changes in the concentration of glucose 
and ethanol over the course of an experiment. The glucose is rapidly 
metabolized by the yeast cells, leading to the disappearance of the 
sugar within a few hours. The ethanol produced by glucose fermen- 
tation is then gradually metabolized by the yeast cells over the next 
five days until it eventually disappears from the medium. Figure 
12.35d shows the changes in the level of expression of the genes 
encoding the enzymes of the TCA cycle during the course of this 
experiment. The levels of expression of each gene (labeled on the 
right) are shown in intervals of time (labeled at the top), using shades 
of red to represent increases in expression, and shades of green to 
represent decreases in expression. It is evident that transcription of 
genes encoding TCA enzymes is stimulated when the cells adapt to 
growth on a carbon source (ethanol) that is metabolized by aerobic 
respiration, and then repressed when the ethanol is exhausted. 
DNA microarrays are currently being employed to study 
changes in gene expression that occur during a wide variety of bio- 
logical events, such as cell division and the transformation of a nor- 
mal cell into a malignant cell. It is even possible to study the diversity 
of RNAs being produced by a single tumor cell, once the cDNAs are 
amplified by PCR. Now that a number of eukaryotic genomes have 
been sequenced, researchers have an unlimited variety of genes 
whose expression can be monitored under different conditions. 


*Differences in mRNA abundance likely reflect differences in mRNA stability (page 506) 
as well as differences in rates of transcription. Consequently, results obtained from DNA 
microarrays cannot be interpreted solely on the basis of transcriptional-level control. 


RNA Sequencing 


In recent years, microarrays are gradually being supplanted by a new 
technology called RNA sequencing (RNA-Seq) that depends on 
sequencing small fragments of cDNAs derived from RNA, providing 
the ability to globally analyze gene-expression patterns. Instead of 
hybridizing cDNA fragments to DNA chips, as shown in the micro- 
array experiment above, the cDNA fragments are directly sequenced 
and computational approaches are then used to map the fragments 
back to the genome. This approach allows researchers to identify 
each fragment, reassemble the fragments into complete transcripts, 
and quantify the exact levels of each of the RNA transcripts, provid- 
ing a profile of the gene-expression patterns of the cells being stud- 
ied. The latest sequencing technologies allow accurate analyses 
starting from just a single cell's worth of RNA. 

FIGURE 12.36 shows an experiment that combined both DNA 
microarrays and RNA sequencing to identify differentially expressed 
genes between breast cancer cell lines that either express the estrogen 


ER- 
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FIGURE 12.36 Transcription profiling to personalize breast cancer 
therapy. Microarrays and RNA-sequencing technologies were used to 
examine gene-expression patterns in 76 estrogen receptor positive and 53 
estrogen receptor negative tumors. Each column of tiny squares within the 
microarray shows the results from a single cancer patient. The data from 
ER+ patients are shown on the left and from ER— patients on the right. A 
total of 179 genes (listed in the column to the right of the microarray) were 
identified that are differentially expressed between the two groups. The 
ability to define a molecular portrait of tumors that will respond to hormone 
therapy can personalize breast cancer treatment. 


Source: From Zhifu Sun et al., Courtesy of E. Aubrey Thompson, PLoS ONE 
6: E17490, 2011. 
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protocols differ depending on whether a particular breast cancer 
expresses the ER. The experimental results shown in Figure 12.36 
examined 129 different breast cancer tumors for the expression of 
149 genes that segregate between ER— and ER+ tumors. The differ- 
ences are represented by changes in blue (low expression) or red 
(high expression) and demonstrate that such analyses can identify 
the genotype of a breast cancer biopsy, which provides physicians an 
opportunity to personalize therapy. These technological approaches 
have many potential uses in addition to providing a visual portrait of 
gene expression. For example, they can be used to determine the 
degree of genetic variation in human populations or to identify the 
alleles for particular genes that a person carries. It is hoped, one day, 
that this latter information may advise people of the diseases to 
which they may be susceptible during their life, giving them an 
opportunity to take early preventive measures. 


REVIEW | 


1. What steps are similar between microarray analysis 


and RNA-seq, and what steps are different? 
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12.11 The Role of Transcription 
Factors in Regulating Gene 
Expression 


Great progress has been made in elucidating how certain genes can 
be transcribed in a particular cell, while other genes remain inactive. 
Transcriptional control is orchestrated by a large number of pro- 
teins, called transcription factors. As discussed in Chapter 11, 
these proteins can be divided into two functional classes: general 
transcription factors that bind at core promoter sites in association 
with RNA polymerase (page 417) and sequence-specific transcrip- 
tion factors that bind to various regulatory sites of particular genes. 
This latter group of transcription factors can act either as transcrip- 
tional activators that stimulate transcription of the adjacent gene or 
as transcriptional repressors that inhibit transcription. We will focus 
in this section on transcriptional activators, whose job is to bind to 
specific regulatory sites within the DNA and initiate the recruitment 
of a large number of protein complexes that bring about the actual 
transcription of the gene itself. 

We know the detailed structure of many transcription factors 
and how they interact with their target DNA sequences. Single genes 
are usually controlled by many different DNA regulatory sites that 
bind a combination of different transcription factors (FIGURE 12.37). 
Conversely, a single transcription factor may bind to numerous sites 
around the genome, thereby controlling the expression of a host of 
different genes. Each type of cell has a characteristic pattern of gene 
transcription, which is determined by the particular complement of 
transcription factors contained in that cell. 

The control of gene transcription is complex, regulated by the 
presence of multiple binding sites for transcription factors and the 
binding affinity for these transcription factors to each sequence. 
Transcription factors have preferred, high-affinity binding sites. A 
region upstream of a given gene might contain one or more pre- 
ferred, high-affinity sites or might contain sites with a slightly differ- 
ent sequence that bind with lower affinity. The binding of multiple 
transcription factors is usually required to activate transcription. An 


FIGURE 12.37 Combinatorial control of transcription. Transcription of 
the Oct4 gene requires the action of multiple transcription factors that bind 
upstream of the start site of transcription. Oct4 is the single-most important 
(i.e., least dispensable) gene in maintaining the pluripotent state in 
embryonic stem cells. Note that the Oct4 transcription factor has a role in 
regulating Oct4 gene transcription. 

Source: From Ng Huck-Hui and M. A. Surani, Nature Cell Biology 13:490, 
2011. Nature Cell Biology by Nature Publishing Group. Reproduced with 
permission of Nature Publishing Group in the format reuse in a book/ 
textbook via Copyright Clearance Center. 


example of this type of cooperative interaction between neighboring 
transcription factors is illustrated in FIGURE 12.38 and discussed fur- 
ther on page 491. In a sense, the regulatory region of a gene can be 
thought of as a type of integration center for that gene’s expression. 
Cells exposed to different stimuli respond by synthesizing different 
transcription factors, which bind to different sites in the DNA. The 
extent to which a given gene is transcribed depends on the particular 
combination of transcription factors bound to upstream regulatory 
elements. Given that roughly 5 to 10 percent of genes encode tran- 
scription factors, it is apparent that a virtually unlimited number of 
possible combinations of interactions among these proteins is pos- 
sible. When this is coupled to the fact that the binding sites for these 
factors vary from gene to gene, the combination of the presence or 
absence of a given factor and variable binding affinity for each factor 


FIGURE 12.38 Interactions between transcription factors bound to 
different sites in the regulatory region of a gene. This image depicts the 
tertiary structure of two separate transcription factors, NFAT-1 (green) and 
AP-1 (whose two subunits are shown in red and blue) bound to the DNA 
upstream from a cytokine gene involved in the immune response. 
Cooperative interaction between these two proteins alters the expression 
of the cytokine gene. The gray bar traces the path of the DNA helix, which 
is bent as the result of this protein-protein interaction. 


Source: From Tom K. Kerppola, Structure 6:550, 1998, with permission from 
Elsevier. 


allows for precise variation in gene-expression patterns between 
cells of different type, different tissue, different stage of develop- 
ment, and different physiologic state. 

The phenotype of a differentiated cell, such as a fibroblast or an 
epithelial cell, is generally quite stable. However, such cells can be 
induced to adopt new phenotypes if they are forced to express certain 
genes that they would not normally express. It was shown in the late 
1980s, for example, that it was possible to induce connective tissue 
fibroblasts to become muscle cells by forcing the cells to express a sin- 
gle tissue-specific transcription factor, MyoD (shown in Figure 12.42a). 
As a result of this and other experiments, MyoD became known as a 
“master regulatory factor” that plays a key role in directing the differ- 
entiation of muscle tissue in developing embryos. Another example of 
the “power” of transcription factors in the determination of a cell’s phe- 
notype is shown in FIGURE 12.39, which shows the dramatic result of 
an experiment in which a fruit fly gene called eyeless has been expressed. 
in cells that would not normally express this gene—in this case, cells of 
the developing leg. Expression of eyeless activates a developmental 
pathway that leads to the formation ofan eye situated in a very unusual 
location of the body. Like MyoD in vertebrate muscle development, the 
transcription factor Eyeless can be considered to act as a master regula- 
tory factor in the development of eyes in these insects. 

The most impressive experiments to test the power of transcrip- 
tion factors in directing phenotypic transformations have focused on 
embryonic stem (ES) cells. As discussed in the Human Perspective in 
Chapter 1 (Section 1.6), embryonic stem (ES) cells appear very early 
in the development of a mammalian embryo and exhibit two key 
properties: They are (1) capable of indefinite self-renewal and 
(2) pluripotent; that is, capable of differentiating into all of the 
different types of cells in the body. It had been known for several years 
that these pluripotent stem cells had a particular set of transcription 
factors that played an important role in maintaining their self-renewing, 
pluripotent state. Just how important these transcription factors were 
in the biology of ES cells was demonstrated by Shinya Yamanaka and 
Kazutoshi Takahashi in 2006 when they introduced the genes for 
24 various transcription factors into adult mouse fibroblasts. They 


FIGURE 12.39 Phenotypic conversion induced by abnormal expression 
of a single transcription factor . The leg of this fruit fly bears a fully formed 
eye that has developed as the result of the forced expression of the eyeless 
gene within cells of the leg primordium during the development of this insect. 


Source: Courtesy of Walter Gehring, Universitat Basel. 
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only four specific transcription factors—Oct4, Sox2, Myc, and Klf4— 
was sufficient to reprogram the differentiated fibroblasts, converting 
them into undifferentiated cells that behaved like pluripotent ES cells. 
The cellular reprogramming process does not occur within a single 
cell generation but occurs gradually as the cells divide in culture. As 
the cells become reprogrammed, the four introduced genes are 
silenced as the cells’ endogenous genes governing pluripotency are 
activated. Reprogramming is accompanied by a reorganization of the 
chromatin in such a way that, for the most part, the epigenetic marks 
(histone modifications and DNA methylation patterns) of the differ- 
entiated cell are “erased” and the epigenetic marks characteristic of an 
ES cell are installed.° As discussed in Section 1.6, the induced pluripo- 
tent cells, or iPS cells as they are called, are capable of dividing indefi- 
nitely in culture and of differentiating into all of the various types of 
the body’s cells. Each pathway of differentiation, whether it leads to 
formation of a neuron, a fibroblast, or a muscle cell, is accompanied by 
changes in the epigenetic marks that characterize the chromatin of the 
cells along that pathway. These epigenetic changes have the effect of 
restricting the developmental potential of the cells along each path- 
way, making them less capable of developing into other types of cells. 


REVIEW | 


1. How do transcriptional activators and repressors 


differ in terms of their effect on gene expression? 
e 


12.12 The Structure of 
Transcription Factors 


The three-dimensional structure of numerous DNA-protein com- 
plexes has been determined by X-ray crystallography and NMR spec- 
troscopy, providing a basic portrait of the way that these two 
macromolecules interact with one another. Like most proteins, tran- 
scription factors contain different domains that mediate different 
aspects of the protein's function. Transcription factors typically con- 
tain at least two domains: a DNA-binding domain that binds to a spe- 
cific sequence of base pairs in the DNA and an activation domain that 
regulates transcription by interacting with other proteins. In addition, 
many transcription factors contain a surface that promotes the bind- 
ing of the protein with another protein of identical or similar structure 
to form a dimer (FIGURE 12.40). The formation of dimers has proved to 
be a common feature of many different types of transcription factors 
and plays an important role in regulating gene expression. 

The DNA-binding domains of most transcription factors can be 
grouped into several broad classes whose members possess related 
structures (motifs) that interact with DNA sequences. The existence 
of several families of DNA-binding proteins indicates that evolution 
has found a number of different solutions to the problem of con- 
structing polypeptides that can bind to the DNA double helix. As we 
will see shortly, most of these motifs contain a segment (often an 
a helix, as in Figure 12.40) that is inserted into the major groove of 
the DNA, where it recognizes the sequence of base pairs that line the 
groove. Binding of the protein to the DNA is accomplished by a 


This statement is qualified with the words “for the most part” because recent studies 
indicate that the reprogramming process is not complete. Instead, the iPS cells retain 
some of the epigenetic marks of the differentiated cells from which they are derived, 
which indicates that they are not truly equivalent to embryonic stem cells. 


$10}De4 UOI}DsUed) JO aINJONIIS SUL e ZL ‘ZL 


pS 
o 
j=) 


éL YALdVHD 


UOISSAIdXW SUSH JO |O1]UOD e 


O Interaction between a transcription factor and its DNA 
target sequence. A model of the interaction between the two DNA- 
binding domains of the dimeric glucocorticoid receptor (GR) and the target 
DNA. Two a helices, one from each subunit of the dimer, are symmetrically 
extended into two adjacent major grooves of the target DNA. Residues in 
the dimeric GR that are important for interactions between the two 
monomers are colored green. The four zinc ions (two per monomer) are 
shown as purple spheres. 

Source: From T. Hard et al., Courtesy of Robert Kaptein, Science 249:159, 
1990; © 1990, reprinted with permission from AAAS. 


specific combination of van der Waals forces, ionic bonds, and 
hydrogen bonds between amino acid residues and various parts of 
the DNA, including the backbone. 

Among the most common motifs that occur in eukaryotic 
DNA-binding proteins are the zinc finger, the helix-loop-helix, and 
the leucine zipper. Each provides a structurally stable framework on 
which the specific DNA-recognition surfaces of the protein can be 
properly positioned to interact with the double helix. 


The Zinc-Finger Motif 


The largest class of mammalian transcription factors contains a 
motif called the zinc finger. In most cases, a zinc ion of each finger is 
coordinated between two cysteines and two histidines. The two 
cysteine residues are part of a two-stranded f sheet on one side of the 
finger, and the two histidine residues are part of a short a-helix on 
the opposite side of the finger (inset, ). These proteins 
typically have a number of such fingers that act independently of one 


Zinc-finger transcription factors. (a) Crystal structure of 
the complex between a protein with five zinc fingers (called 2GLI) and 
DNA. The protein chain is shown in green; the DNA helix is orange and 
blue. The inset shows the structure of a single zinc finger. (b) A model of 
TFIIIA bound to the DNA of the 5S RNA gene. TFIIIA is required for the 
transcription of the 5S rRNA gene by RNA polymerase III. 

Source: (a) This work is licensed under the Creative Commons Attribution- 
ShareAlike 3.0 License. (b) From N.P. Pavietich and C.O. Pabo, Science 
261:1702, 1993, Reprinted with permission from AAAS. 


another and are spaced apart so as to project into successive major 
grooves in the target DNA, as illustrated in Figure 12.41a. The first 
zinc-finger protein to be discovered, TFIIA, has nine zinc fingers 
(Figure 12.41b). Other zinc-finger transcription factors include Egr, 
which is involved in activating genes required for cell division, and 
the GATA family of transcription factors, which are involved in mul- 
tiple developmental events, including cardiac muscle development. 
Comparison of a number of zinc-finger proteins indicates that the 
motif provides the structural framework for a wide variety of amino 
acid sequences that recognize a diverse set of DNA sequences. In 
fact, researchers are attempting to design new species of zinc-finger 
proteins that are capable of targeting DNA sequences that govern the 
expression of particular genes of interest. It is hoped that such pro- 
teins might have the potential to act as therapeutic drugs by turning 
on or off the expression of disease-related genes. 

The Helix-Loop—Helix (HLH) Motif 

As the name implies, this motif is characterized by two «-helical seg- 
ments separated by an intervening loop. The HLH domain is often pre- 
ceded by a stretch of highly basic amino acids whose positively charged 
side chains contact the DNA and determine the sequence specificity of 


(a) 


(b) 


FIGURE 12.42 Basic helix-loop-helix (bHLH) transcription factors. 

(a) MyoD, a dimeric transcription factor involved in triggering muscle cell 
differentiation, is a bHLH protein that binds to the DNA by an associated 
basic region. The basic region of each MyoD monomer is red, whereas the 
helix-loop-helix region of each MyoD monomer is shown in brown. The 
DNA bases bound by the transcription factor are indicated in yellow. 

(b) A sketch of the dimeric MyoD complex in the same orientation as in 
part a. The a helices are represented as cylinders. 

Source: (a) From Philip C.M. et al., Courtesy of Carl O. Pabo, Cell 
77:453,1994; Cell by Cell Press. Reproduced with permission of Cell Press 
in the format reuse in a book/textbook via Copyright Clearance Center. 


the transcription factor. Proteins with this basic- HLH (or bHLH) motif 
always occur as dimers, as illustrated in the example of the transcrip- 
tion factor MyoD in FIGURE 12.42. The two subunits of the dimer are 
usually encoded by different genes, making the protein a heterodimer. 

Heterodimerization greatly expands the diversity of regulatory 
factors that can be generated from a limited number of polypeptides 
(FIGURE 12.43). Suppose, for example, that a cell were to synthesize 


FIGURE 12.43 Modifying the DNA-binding specificities of 
transcription factors through dimerization. In this model of a 
bHLH protein, three different dimeric transcription factors that 
recognize different DNA-binding sites can be formed when the 
two subunits associate in different combinations. The human 
genome encodes approximately 118 different bHLH mono- 


five different bHLH-containing polypeptides that were capable of 491 


forming heterodimers with one another in any combination, then 32 
(2°) different transcription factors that recognize 32 different DNA 
sequences could conceivably be formed. In actuality, combinations 
between polypeptides are restricted, not unlike the formation of het- 
erodimeric integrin molecules (page 232). 

HLH-containing transcription factors play a key role in the dif- 
ferentiation of certain types of tissues, including skeletal muscle (as 
illustrated in Figure 12.34). HLH-containing transcription factors 
also participate in the control of cell proliferation and have been 
implicated in the formation of certain cancers. It was noted on page 
479 that chromosome translocations can generate abnormal genes 
whose expression causes the cell to become cancerous. Genes encod- 
ing at least four different bHLH proteins (MYC, SCL, LYL-1, and 
E2A) have been found in chromosome translocations that lead to 
the development of specific cancers. The most prevalent of these 
cancers is Burkitt's lymphoma in which the MYC gene on chromo- 
some 8 is translocated to a locus on chromosome 14 that contains a 
regulatory site for a gene encoding part of an antibody molecule. 
The overexpression of the MYC gene in its new location is thought to 
be a significant factor in development of this lymphoma. 


The Leucine Zipper Motif 


This motif gets its name from the fact that leucines occur every sev- 
enth amino acid along a stretch of o helix. Because an « helix repeats 
every 3.5 residues, all of the leucines along this stretch of polypep- 
tide face the same direction. Two « helices of this type are capable of 
zipping together to form a coiled coil. Thus, like most other tran- 
scription factors, proteins with a leucine zipper motif exist as dimers. 
The leucine zipper motif can bind DNA because it contains a stretch 
of basic amino acids on one side of the leucine-containing « helix. 
Together the basic segment and leucine zipper are referred to as a 
bZIP motif. Thus, like bHLH proteins, the a-helical portions of bZIP 
proteins are important in dimerization, while the stretch of basic 
amino acids allows the protein to recognize a specific nucleotide 
sequence in the DNA. AP-1, whose structure and interaction with 
DNA are shown in Figure 12.38, is an example of a bZIP transcrip- 
tion factor. AP-1 is a heterodimer whose two subunits (Fos and Jun, 
shown in red and blue in Figure 12.38, respectively) are encoded by 
the genes FOS and JUN. Both of these genes play an important role 
in cell proliferation and, when mutated, can contribute to a cell 
becoming malignant. Mutations in either of these genes that prevent 
the proteins from forming heterodimers also prevent the proteins 
from binding to DNA, indicating the importance of dimer forma- 
tion in regulating their activity as transcription factors. 
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1. What are some of the structural properties that tend 


to be found in several groups of transcription factors? 
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mers, which could potentially generate thousands of different < 
dimeric transcription factors. 


A-A homodimer 


A-B heterodimer 


B-B homodimer 
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in Regulating Transcription 


The complexity inherent in the control of gene transcription can be 
illustrated by examining the DNA in and around a single gene. In the 
following discussion, we will focus on the gene that encodes phospho- 
enolpyruvate carboxykinase (PEPCK). PEPCK is one of the key 
enzymes of gluconeogenesis, the metabolic pathway that converts 
pyruvate to glucose (see Figure 3.31). The enzyme is synthesized in 
the liver when glucose levels are low, as might occur, for example, 
when a considerable period of time has passed since one’s last meal. 
Conversely, synthesis of the enzyme drops sharply after ingestion of a 
carbohydrate-rich meal, such as pasta. The level of synthesis of PEPCK 
mRNA is controlled by a variety of different transcription factors 
(FIGURE 12.44), including a number of receptors for hormones that 
are involved in regulating carbohydrate metabolism. The keys to 
understanding the regulation of PEPCK gene expression lie in (1) 
unraveling the functions of the numerous DNA regulatory sequences 
that reside upstream from the gene itself, (2) identifying the transcrip- 
tion factors that bind these sequences, and (3) elucidating the signal- 
ing pathways that activate the machinery responsible for selective 
gene expression (discussed in Chapter 15). 

The closest (most proximal) regulatory sequence upstream of 
the PEPCK gene is the TATA box, which is a major element of the 
gene’s promoter (FIGURE 12.45). As discussed on page 408, a pro- 
moter is a region upstream from a gene that regulates the initiation 
of transcription. For our purposes, we will divide the eukaryotic pro- 
moter into separate regions, which are not that well delineated. The 
region that stretches from roughly the TATA box to the transcription 
start site is designated the core promoter. The core promoter is the 
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FIGURE 12.45 Identifying promoter sequences required for transcription. The top 
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FIGURE 12.44 Regulating transcription from the rat PEPCK gene. 
Transcription of this gene, like others, is controlled by a combination of 
transcription factors that interact with specific DNA sequences located in a 
regulatory region upstream from the gene's coding region. Included within 
this region is a glucocorticoid response element (GRE) that, when bound by 
a glucocorticoid receptor, stimulates transcription from the promoter. Also 
included within the regulatory region are binding sites for a thyroid hormone 
receptor (labeled TRE); a protein that binds cyclic AMP, which is produced in 
response to the hormone glucagon (labeled CRE-1); and the hormone 
insulin (labeled IRE). A number of other transcription factors are also seen to 
bind to regulatory sites in this region upstream from the PEPCK gene. 
Source: From S. E. Nizielski et al., J. Nutrition 126:2699, 1996; © American 
Society for Nutritional Sciences. 


site of assembly of a preinitiation complex consisting of RNA 
polymerase II and a number of general transcription factors (GTFs) 
that are required before a eukaryotic gene can be transcribed. The 
core promoter elements, and the GTFs that bind to them (shown in 
Figure 11.18), define the exact site at which transcription will start 
(the +1 site). 

The TATA box is not the only short sequence that is shared by 
large numbers of genes. Two other sequences, called the CAAT box 


Core promoter elements 
i 


-20 -10 
ii.) | 


— 


TATA Box 


ine shows the nucleotide sequence of one strand of the PEPCK 
five lines show the results of experiments in which particular regions of the 


promoter were deleted (indicated by black boxes). The observed level of transcription of the PEPCK gene in each of these cases is indicated at the right. 
Deletions that remove all or part of the three boxes cause a marked decrease in the level of transcription, whereas deletions that affect other regions have 
either a lesser or no effect. (As noted in Chapter 11, many mammalian promoters lack the TATA box. Promoters lacking the TATA box often have a con- 


served element at a position downstream of the start site, called the downstream promoter element, or DPE.) 


Source: Data taken from studies by Richard W. Hanson and Daryl K. Granner and their colleagues. 


and GC box, which are located farther upstream from the gene, are 
often required for RNA polymerase II to initiate transcription. The 
CAAT and GC boxes bind transcription factors (e.g., NF1 and SP1) 
that are found in many tissues and widely employed in gene expres- 
sion. Whereas the TATA box determines the site of initiation of 
transcription, the CAAT and GC boxes are two of many sequences 
that regulate the frequency with which RNA polymerase II initiates 
transcription. The TATA box is within the core promoter, usually 
within 25-30 bases from the start site of transcription. The CAAT 
and GC boxes, when present, are typically located within 100-150 
base pairs upstream from the transcription start site, a region 
referred to as the proximal promoter region, as indicated in the top 
line of Figure 12.45. 

The more genes that are examined, the more variability that is 
found in the nature and locations of the promoter elements that reg- 
ulate gene expression. In addition, a significant number of mamma- 
lian genes (possibly as many as 50 percent of them) have more than 
one promoter (i.e., alternative promoters) allowing initiation of tran- 
scription to occur at more than one site upstream from a gene. 
Alternative promoters are typically separated from one another by 
several hundred bases so that the primary transcripts produced by 
their differential usage will differ considerably at their 5’ ends. In 
some cases, alternative promoters are utilized in different tissues, but 
they lead to synthesis of the same polypeptide. In such cases, the 
mRNAs that encode the proteins are likely to have different 5’ UTRs, 
which makes them subject to distinct types of translational-level 
control. In other cases, alternative promoters promote the synthesis 
of related proteins that differ in their N-terminal amino acid 
sequence. In a number of cases, alternative promoters govern the 
transcription of mRNAs that are translated in different reading 
frames (page 442) to produce entirely different polypeptides. 

How do researchers learn which sites in the genome interact 
with a particular transcription factor? The following general strate- 
gies are often employed to make this determination. 


e Deletion mapping. In this procedure, DNA molecules are 
prepared that contain deletions of various parts of the gene’s 
promoter (Figure 12.45). The altered DNAs are then introduced 
into cells and the ability of the deletion mutants to initiate 
transcription is measured. In many cases, the deletion of a few 
nucleotides has little or no effect on the transcription of an 
adjacent gene. However, if the deletion falls within a region 
that prevents binding of transcription factors that activate 
transcription, the level of transcription will decrease (as in lines 
2, 4, and 5 of Figure 12.45). Deletion of other regions might 
increase transcription, which would indicate that a transcription 
factor binds to this region that inhibits transcription, in 
other words, a transcription repressor. Finally, deletion of 
other regions might have little to no effect (lines 3 and 6 of 
Figure 12.45). 


e DNA footprinting. When a transcription factor binds to a 
DNA sequence, the presence of the protein protects the DNA 
sequence from digestion by nucleases. Researchers take advan- 
tage of this property by isolating chromatin from cells and 
treating it with DNA-digesting enzymes, such as DNase I. 
Regions that are not protected by proteins are digested, whereas 
those with bound proteins are protected. Once the chromatin 
has been digested, the bound protein is removed, and the 
protected DNA sequences are identified. 


e Genome-wide location analysis. As the name implies, this 
strategy allows researchers to simultaneously monitor all of the 
sites within the genome that are bound by a given transcription 
factor under a given set of physiologic conditions. An outline 
of the approach is shown in FIGURE 12.46. To carry out this 
analysis, cells are cultured under the desired conditions or 
isolated from a particular tissue or developmental stage, and 
then treated with an agent, such as formaldehyde, that will kill 
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FIGURE 12.46 Use of chromatin immunoprecipitation (ChIP) to 
identify transcription-factor binding sites on a global scale. The steps 
are described in the text. 
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the cells and cross-link transcription factors to the DNA sites at 
which they were bound in the living cell (step 1, Figure 12.46). 
Following the cross-linking step, chromatin is isolated, 

sheared into small fragments, and incubated with an anti- 
body that binds to the transcription factor of interest (step 2). 
Binding of the antibody leads to the precipitation of chromatin 
fragments containing the bound transcription factor (step 3a), 
while leaving all of the unbound chromatin fragments in 
solution (step 3b). Once this process of chromatin immunopre- 
cipitation (or ChIP) has been carried out, the cross-links 
between the protein and DNA in the precipitate can be 
reversed and the segments of DNA can be purified (step 4). 
The next step is to identify where in the genome these transcrip- 
tion-factor binding sites are located. Two different approaches 
can be taken to make this determination: either the purified 
DNA is fluorescently labeled and used in microarrays (steps 5-7) 
in a technique called ChIP-chip, or the purified DNA can be 
directly sequenced using massively parallel sequencing 
approaches (step 5a) in a technique called ChIP-seq. Unlike 

the microarray shown in Figure 12.35, which contains DNA 
probes representing protein-coding genes, the microarrays 
used in these ChIP-chip experiments contain DNA probes 
from across the entire genome. Whether by microarray or by 
direct sequencing, the experiments enable identification of 
specific binding sites across the entire genome for any factor 

of interest. 


Interestingly, when these types of experiments are performed 
with mammalian transcription factors, a significant percentage of 
the DNA binding sites are located at considerable distances from 
known gene promoters. It is speculated that some of these sites are 
involved in regulating transcription of noncoding RNAs, such as the 
microRNAs discussed on page 434. It is also likely that many of the 
thousands of genetic loci identified in these genome-wide screens 
have little or no real importance in the regulation of gene expression. 
One way to assess the likely importance of the binding of a transcrip- 
tion factor is to consider the data in a larger context. For example, a 
number of transcription factors, such as Oct4, Sox2, and Nanog, are 
important in maintaining the pluripotent state of embryonic stem 
cells (page 18). When tested alone, each of these three transcription 
factors are bound to several thousand sites within the genome of 
embryonic stem cells. In contrast, only a small fraction of these col- 
lected genomic regions is bound by all three of these key transcrip- 
tion factors. The presence of all three transcription factors bound in 
close proximity to one another provides much greater confidence 
that a given region is functionally important in regulating gene 
expression in these cells. In addition to studying transcription fac- 
tors, the ChIP technique can be modified to locate the positions 
within the genome of any other type of DNA-bound protein, such as 
a particular type of modified histone or even the locations of bound 
RNA polymerase molecules. 
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1. What types of regulatory sequences are found in the 
regulatory regions of the DNA upstream from a gene, 
such as that which encodes for PEPCK? What is the 
role of these various sequences in controlling the 
expression of the nearby gene(s)? 
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12.14 An Example of 
Transcriptional Activation: 
The Glucocorticoid Receptor 


Transcription can be said to be controlled by a combinatorial code 
consisting of an array of transcription factors and changes in their 
preferred binding sites from gene to gene. The sites at which these 
transcription factors bind to the region flanking a gene are often 
called response elements. Here, we will focus on the glucocorti- 
coids, a group of steroid hormones (e.g., cortisol) synthesized by the 
adrenal gland. Analogues of these hormones, such as prednisolone, 
are prescribed as potent anti-inflammatory agents. 

Secretion of glucocorticoids is highest during periods of stress, as 
occurs during starvation or following a severe physical injury. For a 
cell to respond to glucocorticoids, it must possess a specific receptor 
capable of binding the hormone. The glucocorticoid receptor (GR) is a 
member of a superfamily of nuclear receptors (including receptors for 
thyroid hormone, retinoic acid, and estrogen) that are thought to have 
evolved from a common ancestral protein. Members of this superfamily 
are more than just hormone receptors, they are also DNA-binding 
transcription factors. When a glucocorticoid hormone enters a target 
cell, it binds to a glucocorticoid receptor protein in the cytosol, chang- 
ing the conformation of the protein. This change exposes a nuclear 
localization signal (page 462), which facilitates translocation of the 
receptor into the nucleus (FIGURE 12.47). The ligand-bound receptor 
binds to a specific DNA sequence, called a glucocorticoid response 
element (GRE). For the PEPCK gene, this site is located upstream from 
the core promoter (Figure 12.44), and binding activates transcription 
of the gene (Figure 12.47). Identical or similar GRE sequences are 
located upstream from a number of other genes on different chromo- 
somes. If the site is the exact preferred binding site, the gene will be 
highly responsive to elevated glucocorticoid levels, whereas genes 
with imperfect binding sites will respond in accordance with the 
change in binding affinity for the glucocorticoid-receptor complex. 
Consequently, a single stimulus (elevated glucocorticoid concentra- 
tion) can simultaneously activate a number of genes, each at its own 
precise level, allowing a comprehensive, finely tuned response that 
meets the needs of the cell. 

The preferred glucocorticoid response element consists of the 
following sequence 


5’-AGAACAnnnTGTTCT-3’ 
3’-TCTTGTnnnACAAGA-5’ 


where n can be any nucleotide. A symmetrical sequence of this type 
is called a palindrome because the two strands have the same 5’ to 3’ 
sequence. The GRE is seen to consist of two defined stretches of 
nucleotides separated by three undefined nucleotides. The twofold 
nature of a GRE is important because pairs of GR polypeptides bind 
to the DNA as dimers in which each subunit of the dimer binds to 
one-half of the DNA sequence indicated above (see Figure 12.40). 
The importance of the GRE in mediating a hormone response is 
most clearly demonstrated by introducing one of these sequences 
into the upstream region of a gene that normally does not respond to 
glucocorticoids. When cells containing DNA that has been engi- 
neered in this way are exposed to glucocorticoids, transcription of 
the gene downstream from the transplanted GRE is initiated. Visual 
evidence that gene transcription can be stimulated by foreign regula- 
tory regions is presented in Figure 18.48. 


Cortisol 
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FIGURE 12.47 Activation of a gene by a steroid hormone. The hormone cortisol, a glucocorticoid, enters the cell from the extracellular fluid (step 1), 
diffusing through the lipid bilayer (step 2) and into the cytoplasm, where it binds to a glucocorticoid receptor (step 3), changing its conformation and causing it to 
translocate into the nucleus, where it acts as a transcription factor and binds to a glucocorticoid response element (GRE) of the DNA (step 4). The glucocorticoid 
receptor binds to the GRE as a dimer, which activates transcription of the DNA (step 5), leading to the synthesis of specific proteins in the cytoplasm (step 6). 
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1. How are the functions of a bacterial lac repressor and 
a mammalian glucocorticoid receptor similar? How 
are they different? 
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12.15 Transcriptional Activation: 
The Role of Enhancers, Promoters, 
and Coactivators 


The GRE situated upstream from the PEPCK gene and the other 
response elements illustrated in Figure 12.44 are referred to as distal 
promoter elements to distinguish them from the proximal promoter 
elements situated closer to the gene or the core promoter that dic- 
tates the site of initiation (Figure 12.45). The expression of most 
genes is also regulated by even more distant DNA elements called 
enhancers. An enhancer typically contains multiple binding sites 
for sequence-specific transcriptional activators. Enhancers are often 
distinguished from promoter elements by a unique property: They 
can be situated either upstream or downstream of the start site, and 
they can even be inverted (rotated 180°), without affecting the ability 
of a bound transcription factor to stimulate transcription. Deletion 
of an enhancer can decrease the level of transcription by 100-fold or 
more. A typical mammalian gene may have a number of enhancers 
scattered within the DNA in the vicinity of the gene. Different 
enhancers typically bind different sets of transcription factors and 
respond independently to different stimuli. Some enhancers are 
located thousands or even tens of thousands of base pairs upstream 
or downstream from the gene whose transcription they stimulate.” 


’There is wide variation in the terminology used to describe the various types of regu- 
latory elements that control gene transcription. The terms employed here—core pro- 
moter, proximal promoter elements, distal promoter elements, and enhancers—are 
not universally adopted, but they describe elements that are usually present and often 
(but not always) capable of being distinguished. 


Even though enhancers and promoters may be separated by 
large numbers of nucleotides, enhancers are thought to stimulate 
transcription by influencing events that occur at the core promoter. 
Enhancers and core promoters can be brought into proximity because 
the intervening DNA can form a loop through the interactions of 
bound proteins (FIGURE 12.48). If enhancers can interact with pro- 
moters over such long distances, what is to prevent an enhancer from 
binding to an inappropriate promoter located even farther down- 
stream on the DNA molecule? A promoter and its enhancers are, in 
essence, “cordoned off” from other promoter/enhancer elements by 
specialized boundary sequences called insulators. 

One of the most active areas of molecular biology in the past 
decade or so has focused on the mechanism by which a transcriptional 
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FIGURE 12.48 The mechanisms by which transcriptional activators 
bound at distant sites can influence gene expression. Transcriptional 
activators bound at upstream enhancers influence gene expression through 
interaction with coactivators. Four different types of coactivators are 
depicted here, two of them labeled “Histone modification complex” and 
“Chromatin remodeling complex” act by altering the structure of chroma- 
tin. Two others, labeled “TAFs” and “Mediator” act on components of the 
basal transcription machinery that assembles at the core promoter. These 
various types of coactivators are discussed in the following sections. 
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transcription at the core promoter. Transcription factors accomplish 
this feat through the action of intermediaries known as coactiva- 
tors. Coactivators are large complexes that consist of numerous 
subunits. Coactivators can be broadly divided into two functional 
groups: (1) those that interact with components of the basal tran- 
scription machinery (the general transcription factors and RNA 
polymerase II) and (2) those that act on chromatin, converting it 
from a state that is relatively inaccessible to the transcription machin- 
ery to astate that is much more transcription “friendly” Figure 12.48 
shows a schematic portrait of four types of coactivators, two of each 
of the major groups. These various types of coactivators work 
together in an orderly manner to activate the transcription of par- 
ticular genes in response to specific intracellular signals. Given the 
large number of transcription factors encoded in the genome and the 
limited diversity of coactivators, each coactivator complex operates 
in conjunction with a wide variety of different transcription factors. 
The coactivator CBP, for example, which is discussed below, partici- 
pates in the activities of hundreds of different transcription factors. 


Coactivators That Interact with the Basal 
Transcription Machinery 


Transcription is accomplished through the recruitment and subse- 
quent collaboration of large protein complexes. TFIID, one of the 
GTFs required for initiation of transcription (page 417), consists of a 
dozen or more subunits. The key component is TATA binding protein 
(TBP), which binds the TATA box. Its binding is facilitated by a num- 
ber of associated factors denoted as TAFs (TBP-associated factors). 
Some transcription factors are thought to influence events at the core 
promoter by interacting with one or more of these TFIID subunits. 
Another coactivator that communicates directly between enhancer- 
bound transcription factors and the basal transcription machinery is 
called Mediator, which is a huge, multisubunit complex that interacts 
directly with RNA polymerase II. Mediator is required by a wide vari- 
ety of transcriptional activators and may be an essential element in 
the transcription of most, if not all, protein-coding genes. Despite 
considerable effort, the mechanism of action of Mediator remains 
unclear. 


Coactivators That Alter Chromatin 
Structure 


The discovery of nucleosomes in the 1970s raised an important 
question that still has not been completely answered: How are non- 
histone proteins (such as transcription factors and RNA polymer- 
ases) able to interact with DNA that is tightly wrapped around core 
histones? A large body of evidence suggests that incorporation of 
DNA into nucleosomes does, in fact, impede access to DNA and 
markedly inhibits both the initiation and elongation stages of tran- 
scription. How do cells overcome this inhibitory effect that results 
from chromatin structure? 

As discussed on page 466, each of the histone molecules of the 
nucleosome core has a flexible N-terminal tail that extends outside 
the core particle and past the DNA helix. As discussed earlier, cova- 
lent modifications of these tails have a profound impact on chroma- 
tin structure and function. We have already seen how the addition of 
methyl groups on H3K9 residues can promote chromatin compac- 
tion and transcriptional silencing (page 472). The addition of acetyl 


groups to specific lysine residues in core histones tends to have the 
opposite effect. On a larger scale, acetylation of histone residues is 
thought to prevent chromatin fibers from folding into compact 
structures, which helps to maintain active, euchromatic regions. On 
a finer scale, histone acetylation increases access of specific regions 
of the DNA template to interacting proteins, which promotes tran- 
scriptional activation. As we will see shortly, the enzymes that carry 
out these histone modifications are parts of large multiprotein com- 
plexes involved in regulating gene expression. 

Techniques have been developed in recent years to determine 
the nature of the modifications in the histones of nucleosomes on a 
genome-wide scale. These techniques involve a ChIP technique sim- 
ilar to that illustrated in Figure 12.46, but rather than determining 
the genome-wide location of particular transcription factors, the 
goal is to pinpoint the precise locations of particular histone 
modifications within the genome. Rather than using antibodies 
against transcription factors to precipitate a particular fraction of 
protein-bound DNA segments, researchers use antibodies that spe- 
cifically recognize particular histone modifications, such as acety- 
lated H3K9 or H4K16, or methylated H3K4 or H3K36. All four of 
these histone marks had been known to be associated with tran- 
scriptionally active genes (Figure 12.18), but it wasn’t known how 
these marks might be distributed within such genes. Are specific 
histone modifications spread uniformly throughout each gene, or 
are there differences from one end to the other? 

Results from analysis of active genes in the yeast genome are 
shown in FIGURE 12.49 and reveal marked differences within various 
parts of these genes. Acetylated H3 and H4 histones and methylated 
H3Ké4 residues are clustered in the promoter regions and largely 
absent from the main body of active genes, suggesting that these 
modifications are primarily important in the activation of a gene or 
the initiation of its transcription. In contrast, methylated H3K36 resi- 
dues are largely absent from the promoters and instead are concen- 
trated in the transcribed regions of active genes. Several studies 
provide a rationale for these differences in location and give an 
example of the complex relationships between histone modifications. 
As described on page 418, the initiation of transcription is correlated 
with the phosphorylation of Ser5 residues in the CTD of RNA 
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FIGURE 12.49 Histone modifications can act as signatures of tran- 
scribed chromatin regions. The figure depicts selective localization of 
histone modifications within the chromatin of transcribed genes based on 
genome-wide analyses in yeast. Histone acetylation and H3K4 methylation 
are localized primarily in the promoter region of active genes and decrease 
markedly in the transcribed portion of the gene. In contrast, H3K36 methyla- 
tion displays the opposite localization pattern. TSS, transcription start site. 


polymerase II. These phosphorylated Ser5 residues serve as a 
recognition platform for the histone methyltransferase Set1, which 
methylates H3K4 in the chromatin of the promoter. Methylated 
H3K4 residues, in turn, serve as binding sites for a number of protein 
complexes, including ones involved in chromatin remodeling (see 
Figure 12.54) and pre-mRNA splicing (see Figure 11.30). In contrast, 
methylation of H3K36 is catalyzed by the histone methyltransferase 
Set2, which travels with the elongating polymerase. Once this residue 
is methylated, it serves as a recruitment site for another enzyme com- 
plex (Rpd3S) that catalyzes the removal of acetyl groups from lysine 
residues of histones in the transcribed portion of the gene. Evidence 
suggests that removal of acetyl groups from nucleosomes in the wake 
of an elongating RNA polymerase prevents the inappropriate initia- 
tion of transcription within the internal coding region of a gene. 

Let’s look more closely at events that occur at the promoter 
regions of genes during the activation of transcription. Acetyl groups 
are added to specific lysine residues on the core histones by a family 
of enzymes called histone acetyltransferases (HATs). In the late 
1990s, it was discovered that a number of coactivators possessed 
HAT activity. If the HAT activity of these coactivators was elimi- 
nated by mutation, so too was their ability to stimulate transcription. 
The discovery that coactivators contain HAT activity provided a 
crucial link between histone acetylation, chromatin structure, and 
gene activation. Figure 12.50 shows an ordered series of reactions 
that has been proposed to occur following the binding of a transcrip- 
tional activator, such as the glucocorticoid receptor, to its response 
element on the DNA. Once bound to the DNA, the activator recruits 
a coactivator (e.g., CBP) to a region of the chromatin that is targeted 
for transcription. Once positioned at the target region, the coactiva- 
tor acetylates the core histones of nearby nucleosomes, which 
exposes a binding site for a chromatin remodeling complex (dis- 
cussed below). The combined actions of these various complexes 
increase the accessibility of the promoter to the components of the 
transcription machinery, which assembles at the site where tran- 
scription will be initiated. 

FIGURE 12.50 depicts the activity of two coactivators that affect 
the state of chromatin. We have already seen how the HATs act to 


FIGURE 12.50 A model describing the activation of transcription. 
Transcription factors, such as the glucocorticoid receptor (GR), bind to the 
DNA and recruit coactivators, which facilitate the assembly of the 
transcription preinitiation complex. Step 1 of this drawing depicts a region 
of a chromosome that is in a repressed state because of the association of 
its DNA with deacetylated histones. In step 2, the GR is bound to the GRE, 
and the coactivator CBP has been recruited. CBP contains a subunit that 
has histone acetyltransferase (HAT) activity. These enzymes transfer acetyl 
groups from an acetyl CoA donor to the amino groups of specific lysine 
residues on histone proteins. As a result, histones of the nucleosome core 
particles in the regions both upstream and downstream from the TATA box 
become acetylated. In step 3, the acetylated histones recruit SWI/SNF, 
which is a chromatin remodeling complex. Together, the two coactivators 
CBP and SWI/SNF change the structure of the chromatin to a more open, 
accessible state. In step 4, TFIID binds to the open region of the DNA. One 
of the subunits of TFIID (called TAF,250 or TAF1) also possesses histone 
acetyltransferase activity as indicated by the red arrow. Together, CBP and 
TAF, 250 modify additional nucleosomes to allow transcription initiation. 

In step 5, the remaining nucleosomes of the promoter have been 
acetylated, RNA polymerase II is bound to the promoter, and 

transcription is set to begin. 


modify the histone tails; let us look more closely at the other type of 497 


coactivator, the chromatin remodeling complexes. Chromatin 
remodeling complexes use the energy released by ATP hydrolysis to 
alter nucleosome structure and location along the DNA. This in turn 
may allow the binding of various proteins to regulatory sites in the 
DNA. The best studied chromatin remodeling machines are mem- 
bers of the SWI/SNF family. SWI/SNF complexes, which consist of 
nine to twelve subunits, do not bind to specific DNA sequences but 
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present on nucleosomal histones or other proteins already bound to 
the DNA. In Figure 12.50, the coactivator CBP has acetylated the core 
histones, providing a high-affinity binding site for the remodeling 
complex. Once recruited to a promoter, chromatin remodeling com- 
plexes are thought to disrupt histone-DNA interactions, which can 


1. Promote the mobility of the histone octamer so that it slides 
along the DNA to new positions (FIGURE 12.51, path 1). In the 
best studied case, the binding of transcriptional activators to an 
enhancer upstream from the IFN- gene leads to the sliding of a 
key nucleosome approximately 35 base pairs along the DNA, 
exposing the TATA box that had been previously covered by 
histones. Sliding occurs as the remodeling complex translocates 
along the DNA. 


2. Change the conformation of the nucleosome. In the example 
depicted in Figure 12.51, path 2, the DNA has formed a transient 
loop or bulge on the surface of the histone octamer, making that 
site more accessible for interaction with DNA-binding regulatory 
proteins. 


3. Facilitate the replacement within the histone octamer of a stand- 
ard core histone by a histone variant (page 467) that is correlated 
with active transcription. For example, the SWR1 complex 
exchanges H2A/H2B dimers with a variant histone (H2A.Z) that 
dimerizes with H2B. (Figure 12.51, path 3). 


4. Displace the histone octamer from the DNA entirely (Figure 
12.51, path 4). For example, nucleosomes are thought to be tem- 
porarily disassembled as an elongating RNA polymerase com- 
plex moves along the DNA within a gene. 


Evidence has accumulated for many years that nucleosomes are 
not positioned randomly along the DNA and, more importantly, 
that specific nucleosome positioning plays an important role in 
regulating transcription of specific regions of the genome. A number 
of attempts have been made to determine whether nucleosomes are 
preferentially present or absent from particular sites within the 
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FIGURE 12.51 Chromatin remodeling. In pathway 1, a key nucleosome 
slides along the DNA, thereby exposing the TATA binding site and allowing 
the preinitiation complex to assemble. In pathway 2, the histone octamer of 
a nucleosome has been reorganized. Although the TATA box is not 
completely free of histone association, it is now able to bind the proteins of 
the preinitiation complex. In pathway 3, the standard H2A/H2B dimers of a 
nucleosome have been exchanged with histone variants (e.g., H2A.Z/H2B 
dimers) that associate with active chromatin. In pathway 4, the histone 
octamer has been disassembled and is lost from the DNA entirely. 
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FIGURE 12.52 The nucleosomal landscape of yeast genes. The top portion of the illustration shows a typical region of DNA in the vicinity of a gene that 
is being actively transcribed by an RNA polymerase II complex. The consensus positions of nucleosomes are illustrated by the gray ovals. The lower portion 
of the illustration shows the distribution of nucleosomes along the DNA with the height of the line reflecting the likelihood that a nucleosome is found at 
that site. The 5’ region of the gene has the most highly defined chromatin architecture. There is a very high likelihood that the promoter region is bare of 
nucleosomes (a nucleosome-free region or NFR) blanked by two very well-positioned nucleosomes that lie on either side of the transcription start site (green 
light); these two nucleosomes are identified as +1 and —1 at the top of the figure. The subsequent nucleosomes near the 5’ end of the transcribed region 
also tend to be well positioned, as indicated by the distinct locations of these nucleosomes at the top of the drawing. A nucleosome is also seen to be 
displaced by RNA polymerase as it transcribes the gene. The green shading corresponds to a region of chromatin with high levels of the H2A.Z histone 
variant, high histone acetylation and H3K4 methylation, and a high likelihood of nucleosomal positioning. 


Source: From Cizhong Jiang and B. Franklin Pugh, Nature Revs. Gen. 10:164, 2009; © Copyright 2009, Macmillan Magazines Limited, adapted from T. N. 


Mavrich, et al., Genome Res. 18:1074, 2008. 


genome in a given type of cell. To carry out these studies, the 
chromatin is typically treated with a nuclease that digests those por- 
tions of the DNA that are not protected by their association with 
histone octamers. The DNA that has been protected from nuclease 
digestion is then dissociated from its associated histones and 
sequenced. These techniques have allowed researchers to prepare 
genome-wide maps of nucleosome positions along the DNA. The 
most complete analyses of nucleosome positioning have been car- 
ried out on yeast cells, and they provide a somewhat different picture 
from the traditional view that is based on years of biochemical stud- 
ies on the transcription of individual genes in mammalian cells that 
was presented in Figure 12.50. 

Studies on yeast cells suggest that the majority of genes share a 
common chromatin architecture, which is depicted in FIGURE 12.52. 
As indicated in this figure, nucleosomes are not randomly distrib- 
uted along the DNA but instead are surprisingly well positioned. Most 
notably, the promoter sequences tend to reside within nucleosome- 
free regions (NFRs of Figure 12.52), which are flanked on either side 
by two very well-positioned nucleosomes identified as +1 and —1 in 
Figure 12.52. The nucleosome-free region surrounding the pro- 
moter is likely an important consideration in allowing access by 
regulatory factors to these target sites in the DNA. Of all the nucle- 
osomes in a yeast gene, the —1 nucleosome undergoes the most 
extensive modifications upon transcriptional activation. This nucle- 
osome may be moved to a new site, evicted from the DNA, or 
undergo extensive histone modifications or histone exchange. The 
downstream nucleosomes (+1, +2, +3, etc.) are also subject to many 
of these same alterations during transcriptional activation, but the 
degree to which a nucleosome is affected diminishes with the dis- 
tance it is located from the transcriptional start site. It is not clear yet 
whether the promoter regions of most nontranscribed genes in 
higher eukaryotes tend to be relatively covered in nucleosomes as 
depicted in Figure 12.50 (referred to as “closed promoters”) or rela- 
tively free of nucleosomes as suggested in Figure 12.52 (referred to as 
“open promoters”). Even if the transcription start site is covered by a 
nucleosome, studies suggest that this nucleosome may be less stable 
than its neighbors, containing histone variants (H3.3 and H2A.Z) in 
place of the standard core histones. Regardless of the specific nucleo- 
some topology, the promoter regions of chromatin are the targets of 
a wide variety of histone modifying enzymes (e.g., histone acetyl- 
transferases, deacetylases, methyltransferases, and demethylases), 
chromatin remodeling complexes (e.g., SWI/ SNF), and gene-specific 
transcription factors. 


REVIEW | 


1. What are two different ways that co-activators can 
influence gene expression? 
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12.16 Transcriptional Activation 
from Paused Polymerases 


Throughout this discussion of transcriptional activation, we have 
described how transcription factors bind to specific DNA sequences 
and induce the recruitment of general transcription factors (GTFs), 
chromatin modification complexes, and RNA polymerase II, which 
leads to the initiation of transcription. It came as a surprise to learn 


that RNA polymerases are also bound to the promoters of many 499 


genes that show no evidence they are being transcribed. In some 
cases, an RNA polymerase situated at one of these “transcriptionally 
silent” genes initiates RNA synthesis, but the polymerase fails to 
transition to the elongation stage of transcription. In other cases, the 
polymerase goes on to synthesize an RNA of about 30 nucleotides 
and then becomes stalled. In either case, a full-length primary tran- 
script is never generated. According to one model, RNA polymerase 
molecules situated downstream of promoters are held in the paused 
state by bound inhibitory factors (e.g., DSIF and NELF). Inhibition 
is then relieved as (1) these factors are phosphorylated by stimula- 
tory kinases (e.g., P-TEFb) and (2) elongation factors (e.g., ELL) are 
recruited to the polymerase (as in Figure 11.18). These studies have 
suggested that some transcription factors (e.g., Myc) may stimulate 
the transcription of some genes by acting at the level of transcription 
elongation as well as that of transcription initiation. Because it 
doesn’t require the assembly of the transcriptional machinery at the 
promoter, the induced release of paused polymerases may facilitate 
the rapid activation of genes in response to developmental or envi- 
ronmental signals. 


REVIEW | 


1. What is one possible advantage of regulating a gene 
by controlling elongation by an already bound RNA 
polymerase? 


12.17 Transcriptional Repression 


As is evident from Figures 12.2 and 12.3, control of transcription in 
bacteria relies heavily on repressors that block transcription. 
Although research in eukaryotes has focused primarily on factors 
that activate or enhance transcription of specific genes, eukaryotic 
cells also possess negative regulatory mechanisms. 

We've seen that transcriptional activation is associated with 
changes in the state and/or position of nucleosomes in a particular 
region of chromatin. The state of acetylation of chromatin is a 
dynamic property; just as there are enzymes (HATs) to add acetyl 
groups, there are also enzymes to remove them. Removal of acetyl 
groups is accomplished by histone deacetylases (HDACs). 
Whereas HATs are associated with transcriptional activation, HDACs 
are associated with transcriptional repression. HDACs are present as 
subunits of larger complexes described as corepressors. Corepressors 
are similar to coactivators, except that they are recruited to specific 
genetic loci by transcriptional factors (repressors) that cause the tar- 
geted gene to be silenced rather than activated (FIGURE 12.53). The 
progression of several types of cancer may depend on the tumor cells 
being able to repress the activity of certain genes. A number of anti- 
cancer drugs (e.g., Zolinza) are currently being tested that act by 
inhibiting HDAC enzymes. 

Recent studies indicate that the removal of acetyl groups from 
histone tails is accompanied by another histone modification: meth- 
ylation of the lysine residue at position #9 of histone H3 molecules. 
You might recall that this modification, H3K9me, was discussed at 
some length on page 472 as a key event in the formation of hetero- 
chromatin. It now appears that this same modification is also 
involved in the more dynamic types of transcriptional repression 
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FIGURE 12.53 A model for transcriptional repression. Histone tails in 
the promoter regions of active chromatin are usually heavily acetylated. 
When a transcriptional repressor binds to its DNA binding site, it recruits a 
corepressor complex (e.g., SMRT/N-CoR or CoREST) and an associated 
HDAC activity. The HDAC removes acetyl groups from the histone tails. 

A separate protein containing histone methyltransferase activity adds 
methyl groups to the K9 residue of H3 histone tails. Together, the loss of 
acetyl groups and addition of methyl groups lead to chromatin inactivation 
and gene silencing. 


that occur within euchromatic regions of the genome. Figure 12.53 
suggests one of numerous possible models of transcriptional repres- 
sion that incorporates several aspects of chromatin modification. 


DNA Methylation 


One of the key factors in silencing a region of the genome involves 
a phenomenon known as DNA methylation. Examination of the 
DNA of mammals and other vertebrates indicates that as many as 
one out of 100 nucleotides bears an added methyl group, which is 
always attached to carbon 5 of a cytosine. Methyl groups are added 
to the DNA by a family of enzymes called DNA methyltransferases 
encoded in humans by DNMT genes. This simple chemical modifi- 
cation is thought to serve as an epigenetic mark or “tag” that allows 
certain regions of the DNA to be identified and utilized differently 
from other regions. In mammals, the methylcytosine residues are 


part of a 5’-CpG-3’ dinucleotide within a symmetrical sequence, 
such as 
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in which the red dots indicate the positions of the methyl groups.’ As 
a true epigenetic mark, the pattern of DNA methylation must be 
maintained through repeated cell divisions. This is accomplished by 
an enzyme, Dnmt1, that travels with the replication fork and methyl- 
ates the daughter DNA strands by copying the methylation pattern 
of the parental strands. 

The majority of methylcytosine residues in mammalian DNA 
are located within noncoding, repeated sequences, primarily trans- 
posable elements (page 392). Methylation is thought to maintain 
these elements in an inactive state. Organisms that harbor mutant 
DNMTSs tend to exhibit a marked increase in transposition activity, 
which can be detrimental to the health of the organism. As discussed 
on page 435, piRNAs serve as mediators of transposable element 
suppression in germ cells. During the formation of male gametes 
(ie., during spermatogenesis), piRNAs act by guiding the DNA- 
methylation machinery to sites where transposable elements reside 
within the genome. 

In addition to the general suppression of transposable elements, 
DNA methylation has long been implicated in the repression of tran- 
scription of specific genes. In recent years, techniques have been 
developed to determine the specific cytosine residues that are methyl- 
ated in any given population of cells across the entire genome. These 
studies confirm that (1) the promoter regions of inactive genes tend 
to be more heavily methylated than the promoter regions of active 
genes and (2) DNA methylation patterns vary from one cell type to 
another, reflecting the differential activity of genes among various tis- 
sues. Most evidence suggests that methylation of a gene’s promoter 
serves more to maintain that gene in an inactive state than as a mech- 
anism for initial inactivation. As an example, inactivation of the genes 
on one of the X chromosomes of female mammals (page 470) occurs 
prior to a wave of methylation of gene promoters that is thought to 
convert the DNA into a more permanently repressed condition. 

How does a given cell determine which promoters should 
become methylated, and how does methylation of these promoters 
enforce the transcriptionally repressed state of that gene? DNA 
methylation has been closely linked with another repressive epige- 
netic mark—histone methylation. It may be that gene inactivation 
begins with the establishment of a transcriptionally repressive pat- 
tern of histone modifications in the core histones of promoter 
regions and that these modified histone tails then recruit the DNA 
methylation machinery to those nucleosomes. Once the DNA in 
these regions is methylated, the methylated cytosine residues can 


An exception is found in embryonic stem cells, where approximately one-fourth of 
methylations occur in a non-CpG context, including CpA and CpT. At some sites, 
methylcytosine can be enzymatically converted to hydroxy-methylcytosine, which 
may represent either an intermediate in the DNA demethylation process or an alter- 
nate epigenetic mark. 
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FIGURE 12.54 Changes in DNA methylation levels during mammalian 
development. The DNA of the fertilized egg (zygote) is substantially 
methylated. During cleavage, the genome undergoes global demethylation. 
Interestingly, DNA inherited from one’s father undergoes demethylation at 
an earlier stage and by a different mechanism than DNA inherited from 
one’s mother. After implantation, the DNA is subjected to new (de novo) 
methylation, which is maintained in the somatic cells at a high level 
throughout the remainder of development and adulthood. In contrast, the 
DNA of primordial germ cells, which give rise to the gametes in the adult, 
is subsequently demethylated. The DNA of the germ cells then becomes 
remethylated at later stages of gamete formation. 

Source: Based on a figure from R. Jaenisch, Trends Genetics 13-325, 1997; 
copyright 1997. Trends in Genetics by Elsevier Ltd. Reproduced with 
permission of Elsevier Ltd. in the format reuse in a book/textbook via 
Copyright Clearance Center. 


serve as binding sites to recruit additional histone modifying enzymes 
that further repress and compact the chromatin of that promoter (as 
in Figure 12.53). 

Although DNA methylation is a relatively stable epigenetic 
mark, the transmission of these marks from a parental cell to its 
daughters is subject to regulation. The remarkable shifts in DNA- 
methylation levels that occur during the life of a mammal are 
depicted in FIGURE 12.54. The first major change in the level of 
methylation occurs between fertilization and the first few divisions 
of the zygote, when the DNA loses the methylation “tags” that were 
inherited from the previous generation. Then, at about the time the 
embryo implants into the uterus, a wave of new (or de novo) 
methylation spreads through the cells, establishing a new pattern of 
methylation throughout the DNA. We don't know the signals that 
determine whether a given gene in a particular cell is targeted for 
methylation or spared at this time. It is evident, however, that abnor- 
mal DNA methylation patterns are often associated with disease. For 
example, the development of tumors often depends on aberrant 
methylation and subsequent silencing of genes whose expression 
would normally suppress tumor growth. 

DNA methylation is not a universal mechanism for inactivating 
eukaryotic genes. DNA methylation has not, for example, been 
found in yeast or nematodes. Plant DNA, in contrast, is often heavily 
methylated, and studies on cultured plant cells indicate that, as in 
animals, DNA methylation is associated with gene inactivation. In 
one experiment, plants treated with compounds that interfere with 
DNA methylation produced a greatly increased number of leaves 
and flower stalks. Moreover, flowers that developed on these stalks 
had a markedly altered morphology. 

One of the most dramatic examples of the role of DNA methylation 
in silencing gene expression occurs as part of an epigenetic phenome- 
non known as genomic imprinting, which is unique to mammals. 


Genomic Imprinting 


It had been assumed until the mid-1980s that the set of chromo- 
somes inherited from a male parent was functionally equivalent to 
the corresponding set of chromosomes inherited from the female 
parent. But, as with many other long-standing assumptions, this 
proved not to be the case. Instead, certain genes are either active or 
inactive during early mammalian development depending solely on 
whether they were brought into the zygote by the sperm or the egg. 
For example, the gene that encodes a fetal growth factor called IGF2 
is only active on the chromosome transmitted from the male parent. 
In contrast, the gene that encodes a specific potassium channel 
(KVLQT1) is only active on the chromosome transmitted from the 
female parent. Genes of this type are said to be imprinted accord- 
ing to their parental origin. Imprinting can be considered an epige- 
netic phenomenon (page 480), because the differences between 
alleles are inherited from one’s parents but are not based on differ- 
ences in DNA sequence. It is estimated, based largely on the study of 
mutant mice, that the mammalian genome contains at least 80 
imprinted genes located primarily in several distinct chromosomal 
clusters. 

Genes are thought to become imprinted as the result of selective 
DNA methylation of certain regions that control the expression of 
either the male or female alleles. As a result, the maternal and pater- 
nal versions of imprinted genes differ consistently in their degree of 
methylation. Furthermore, mice that lack a key DNA methyltrans- 
ferase (Dnmtl1) are unable to maintain the imprinted state of the 
genes they inherit. The methylation state of imprinted genes is not 
affected by the waves of demethylation and remethylation that sweep 
through the early embryo (Figure 12.54). Consequently, the same 
alleles that are inactive due to imprinting in the fertilized egg will be 
inactive in the cells of the fetus and most adult tissues. The major 
exception occurs in the germ cells, where the imprints inherited 
from the parents are erased during early development and then rees- 
tablished when that individual begins to produce his or her own 
gametes. Some mechanism must exist whereby specific genes (e.g., 
KVLQT1) are selected for inactivation during sperm formation, 
whereas other genes (e.g., IGF2) are selected for inactivation during 
formation of the egg. Each of the imprinted gene clusters produces at 
least one noncoding RNA. These RNAs play a key role in directing 
the silencing of nearby genes in several of the clusters. 

Disturbances in imprinting patterns have been implicated in a 
number of rare human genetic disorders, particularly those invol- 
ving a cluster of imprinted genes residing on chromosome 15. 
Prader-Willi syndrome (PWS) is an inherited neurological disorder 
characterized by mental retardation, obesity, and underdevelopment 
of the gonads. The disorder often occurs when chromosome 15 
inherited from the father carries a deletion in a small region contain- 
ing the imprinted genes. Because the paternal chromosome carries a 
deletion of one or more genes and the maternal chromosome carries 
the inactive, imprinted version of the homologous region, the indi- 
vidual lacks a functional copy of the gene(s). Although genes are 
usually imprinted for the life of an individual, cases are known where 
imprinting can be lost. In fact, loss of imprinting of the IGF2 gene 
occurs in about 10 percent of the population, leading to increased 
production of the encoded growth factor. Individuals with this epi- 
genetic alteration are at increased risk of developing colorectal can- 
cer. One case has been uncovered of a woman who, because of a 
presumed deficiency in a DNA methylating enzyme, produced 
oocytes totally lacking imprinted genes. When fertilized, these oocytes 
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nature of this epigenetic contribution. 

What possible role could genomic imprinting play in the devel- 
opment of an embryo? Although there are several different thoughts 
on this question, there is no definitive answer. According to one 
researcher, genomic imprinting is a “phenomenon in search of a rea- 
son,’ which is where we will leave the matter. 


Long Noncoding RNAs (IncRNAs) 
as Transcriptional Repressors 


As discussed on page 436, a large fraction of the mammalian genome 
is transcribed into RNAs, including thousands of species that are 
large enough to be described as long noncoding RNAs, or IncRNAs. 
The functions of a handful of IncRNAs have been well studied, 
including several species involved in genomic imprinting or in X- 
chromosome inactivation. These studies suggest that IncRNAs serve 
as sequence-specific molecules that can guide protein complexes to 
specific sites in the chromatin. Like XIST (page 470), most IncRNAs 
appear to play a role in orchestrating transcriptional repression, 
although some IncRNAs may instead function in transcriptional 
activation at some loci. 

An example of IncRNA-mediated gene repression is seen in the 
expression of human HOX genes, whose encoded proteins play a 
key role in determining the anterior-posterior axis of the early 
embryo. The 5’ end of the IncRNA HOTAIR (which is transcribed 
from the HOXC locus in the human genome) binds the PRC2 com- 
plex, while the 3’ end of this ncRNA binds the CoREST complex. 
PRC2 contains a histone methyltransferase that is specific for the 
K27 residue on the tails of H3 core histones. Methylation of H3K27 
is a repressive modification that maintains the transcriptionally 
inactive state of the chromatin of those genes to which it has been 
added. CoREST is a corepressor complex that was shown in Figure 
12.53. In that situation, CoREST acted as a corepressor through its 
association with a histone deacetylase. In the present case shown in 
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FIGURE 12.55, CoREST is associated with a histone demethylase 
that removes methyl groups from K4 residues of histone H3. As 
indicated in Figure 12.49, methylated H3K4 is a mark of transcrip- 
tionally active genes, and, consequently, removal of the added 
methyl groups leads to repression of transcription. HOTAIR guides 
these two repressive protein complexes to another locus (HOXD) 
situated on a different chromosome. While they are tethered to the 
HOXD target locus, PRC2 and CoREST modify the adjacent chro- 
matin and inhibit its transcription. This is not an isolated instance: 
Genome-wide ChIP-chip analysis (page 493) indicates that more 
than 700 genes in human fibroblasts are occupied by both PRC2 
and CoREST, with HOTAIR providing the link between the two 
complexes. 

It is proposed that IncRNAs can act as scaffolds to hold protein 
complexes in close association to specific target sites in the genome, 
where they can carry out their chromatin-modifying functions. 
When the expression of large numbers of IncRNAs is selectively 
blocked, the gene-expression patterns of the affected cells can be 
dramatically altered, suggesting that these RNA molecules can playa 
widespread role in the regulation of gene expression. These gene 
regulators can have important biological roles to play. For example, 
when transcription of certain IncRNAs is blocked in embryonic 
stem cells, the cells lose their pluripotent state (page 18) and express 
genes characteristic of specific lineages that would normally be 
repressed. IncRNAs are emerging as important players in diseases 
including cancer and neurodegenerative diseases. For example, 
HOTAIR overexpression is seen in many types of tumors, and high 
levels of HOTAIR expression correlate with poor prognosis. It 
remains unclear whether these changes are simply correlations or 
whether they show an actual role for the ncRNA in disease progres- 
sion, but experiments with cancer cells grown in culture have shown 
that reducing expression of cancer-linked IncRNA can reduce cell 
proliferation and increase cell death. Heritable genetic diseases can 
also result from changes in IncRNA expression. For example, a 
genetic defect in placenta formation is caused by changes in an 
IncRNA known as HELLPAR. 
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FIGURE 12.55 An IncRNA acting as a mediator of transcriptional repression. In step 1, the In>cRNA HOTAIR is being transcribed from a portion of 
the HOXC locus located on human chromosome 12. In step 2, the HOTAIR RNA has adopted a secondary structure that allows it to interact with specific 
protein complexes that will act on the HOXD locus located on human chromosome 2. The 3’ end of the IncRNA binds specifically to the CoREST 
corepressor complex, which is associated with the enzyme LSD1, which removes methyl groups from H3K4 residues of the nucleosomes, resulting in the 
removal of a transcriptionally active epigenetic mark. Meanwhile, the 5’ end of HOTAIR is bound specifically to the PRC2 complex (a Polycomb Group 
protein complex) that has an enzyme subunit that adds methyl groups to H3K27 residues, which is a transcriptionally repressive epigenetic mark. Because 
of the demethylation of H3K4 and the methylation of H3K27, the HOXD locus is transcriptionally repressed and has adopted a compact conformation 


(step 3). 


REVIEW | 


1. What is meant by the term epigenetic? How is it that 
such diverse phenomena as histone methylation, 
DNA methylation, and centromere determination can 
all be described as epigenetic? 


2. How does methylation of DNA affect gene 
expression? How is it related to histone acetylation or 
histone methylation? What is meant by genomic 
imprinting? 

3. What is the difference between an HDAC and 
an HAT? 
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12.18 RNA Processing Control 


Once an RNA is transcribed, it typically must undergo a number of 
processing events before it can function. This is particularly true for 
mRNAs, which must have a 5’ cap structure added (see Figure 
11.19), be correctly spliced, and have an intact poly(A) tail at the 3’ 
end. Each of these events must be properly regulated to enable an 
mRNA to exit the nucleus and serve as a template for protein assem- 
bly. The genes of complex plants and animals contain numerous 
introns and exons, and the introns must be precisely removed to 
allow transport through the nuclear pore. However, the pattern of 
intron removal can be subject to dramatic regulation, allowing for 
multiple protein products from the same gene. The particular splic- 
ing pathway that is followed may depend on the particular stage of 
development, or the particular cell type or tissue being considered. 
In the simplest case, a specific exon can either be retained or spliced 
out of the transcript. An example of this type of alternative splicing 
occurs during synthesis of fibronectin, a protein found in both blood 
plasma and the extracellular matrix (FIGURE 12.56). Fibronectin 
produced by fibroblasts and retained in the matrix is encoded by an 
mRNA that contains two extra exons compared to the version of the 
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FIGURE 12.56 Alternative splicing of the fibronectin gene. The 
fibronectin gene consists of a number of exons shown in the top drawing 
(the introns shown in black are not drawn to scale). Two of these exons 
encode portions of the polypeptide called EIIIA and EIIIB, which are 
included in the protein produced by fibroblasts, but which are excluded 
from the protein produced in the liver. The difference is due to alternative 
splicing; those portions of the pre-mRNA that encode these two exons are 
excised from the transcript in liver cells. The sites of the missing exons are 
indicated by the arrows in the liver mRNA. 


protein produced by liver cells and secreted into the blood. The extra 
peptides are encoded by portions of the pre-mRNA that are retained 
during processing in the fibroblast but are removed during process- 
ing in the liver cell. 

A more complicated pattern is observed in the Dscam gene, 
which encodes a family of cell-adhesion molecules that are expressed 
on the cell surface and function in axon guidance during early neu- 
ronal development in fruit flies. A homologue has been implicated 
in features of Down syndrome in humans. In fruit flies, this gene 
consists of 24 exons but includes multiple alternative versions of 
exons 4, 6, 9, and 17 (FIGURE 12.57). Remarkably, as a result of the 
combinatorial patterns that can be generated by selecting one of the 
variable exons, a single Dscam gene can encode up to 38,106 differ- 
ent cell-adhesion molecules. The fruit fly genome only contains 
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FIGURE 12.57 A more complex example of alternative splicing. Most eukaryotic genes are subject to alternative splicing. The Drosophila Dscam gene 
illustrates the diversity of proteins that can be generated by alternative splicing of transcripts from a single gene.The top line shows the organization of the 
pre-mRNA and genomic DNA. The gene contains 24 exons but a given primary transcript only contains one of the multiple possibilities from each of the 
exon 4, 6, 9, and 17 clusters. The middle line shows the mature in mRNA with the selected exon from each of these four clusters depicted in a different 
color. The bottom line shows the domain structure of the encoded protein. Exons 4 and 6 encode portions of two of the lg domains of the protein, exon 9 
encodes a different Ig domain, and exon 17 encodes the protein's transmembrane domain. Combining all the possibilities, the Dscam gene can encode 
38,106 possible mRNAs. For comparison, the entire fruit fly genome only contains about 14,000 genes. 


Source: D. L. Black, Cell 103:368; 2000, Fig. 1; Cell by Cell Press. Reproduced with permission of Cell Press in the format reuse in a book/textbook via 
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about 14,000 genes, illustrating how alternative splicing can enable 
a relatively small genome to encode a much larger proteome. 

The regulation of alternative splicing that allows one neuron to 
select a single Dscam isoform and not another is complex and an 
important goal of ongoing research for all alternative splicing events. 
The mechanism by which a particular exon is included or excluded 
depends primarily on whether specific 3’ and 5’ splice sites are 
selected by the splicing machinery as sites to be cleaved (page 425). 
Many factors can influence splice site selection. Some splice sites are 
described as “weak, which indicates that they can be bypassed by 
the splicing machinery under certain conditions. The recognition 
and use of weak splice sites are governed by sequences in the RNA, 
including exonic splicing enhancers (page 425) that are located 
within the exons whose inclusion is regulated. Exonic splicing 
enhancers serve as binding sites for specific regulatory proteins. If a 
particular regulatory protein is present and active in a given cell, that 
protein can bind to the splicing enhancer and recruit the necessary 
splicing factors to a nearby weak 3’ or 5’ splice site. Use of these 
splice sites results in the inclusion of the exon into the mRNA. A 
model of how this may work is depicted in FIGURE 12.58. If the regu- 
latory protein is not present and active in the cell, the neighboring 
splice sites are not recognized and the exon is excised along with the 
flanking introns. 

Another way in which gene expression can be regulated at the 
posttranscriptional level is by RNA editing, in which specific 
nucleotides are converted to other nucleotides after the RNA has 
been transcribed. RNA editing can create new splice sites, generate 
stop codons, or lead to amino acid substitutions. Although not nearly 
as widespread as alternative splicing, RNA editing is particularly 


FIGURE 12.58 Mechanisms of alternative splicing. (a) Changes in the 
sequence of a 5’ splice site can affect pairing with U1 snRNA. This can 
affect the kinetics of splicing by allowing those sites with better matches to 
U1 to recruit the spliceosome to one 5’ splice site over another. In this 
figure, two potential 5’ splice sites are present, as indicated by the two GU 
dinucleotides. The black line indicates the segments of the transcript that 
will be ligated after excision of the intervening section. In the top drawing, 
the splicing machinery has recognized the second of the two potential 5’ 
splice sites. In the middle drawing, a change in sequence (indicated by the 
black rectangle) has occurred in the region of the second potential 5’ splice 
site, and the splicing machinery now recognizes the first splice site. In the 
bottom drawing, a second change of sequence has been introduced in the 
region of the first potential 5’ splice site, causing the splicing machinery to 
ignore this site and use the other site as the 5’ splice site. (b) Different- 
strength splice sites can also be repressed or activated, depending on 
nearby binding of proteins that tend to either activate splice sites (SR 
proteins) or repress splice sites (hnRNP proteins). The SR proteins bind to 
specific sites within either exons or introns called exon and intron splicing 
enhancers (ESEs and ISEs) shown in light blue. The hnRNP proteins bind to 
other sites in exons or introns called exon and intron splicing silencers (ESSs 
and ISSs) shown in light red. Binding of these proteins can regulate splice 
site selection by determining whether or not splicing components such as 
U2AF, U1 snRNP, and U2 snRNP bind to a particular site on the pre-mRNA. 
Source: (a—b) J. Valcarcel, et al., Curr. Opin. Cell Biol. 21:377-386, 2009, 
Fig. 2. Current Opinion in Cell Biology by Elsevier Ltd. Reproduced with 
permission of Elsevier Ltd. in the format reuse in a book/textbook via 
Copyright Clearance Center. 


important in the nervous system, where a significant number of 
messages appear to have one or more adenines (A) converted to 
inosines (I). This modification involves the enzymatic removal of an 
amino group from the nucleotide. I is subsequently read as a G by 
the translational machinery. The glutamate receptor, which mediates 
excitatory synaptic transmission in the brain (page 164), is a product 
of RNA editing. In this case, an A-to-I modification generates a glu- 
tamate receptor whose internal channel is impermeable to Ca”* ions. 
Genetically engineered mice that are unable to carry out this specific 
RNA-editing step develop severe epileptic seizures and die within 
weeks after birth. Another important example of RNA editing affects 
the cholesterol-carrying protein apolipoprotein B. The LDL com- 
plexes discussed on page 302 are produced in the liver and contain 
the protein apolipoprotein B-100, which is translated from a full- 
length mRNA approximately 14,000 nucleotides long. In the intes- 
tine, the cytidine at nucleotide residue 6666 in the RNA is converted. 
enzymatically to a uridine, which generates a stop codon (UAA) that 
terminates translation. The shortened version of the protein, apoli- 
poprotein B-48, is produced only in cells of the small intestine where 
it plays an essential role in the absorption of fats. 


REVIEW | 


1. How is it that alternative splicing can effectively 
increase the number of genes in the genome? 


2. Describe an example of alternative splicing. What is 
the value to a cell of this type of control? How might 
a cell regulate the sites on the pre-mRNA that are 
chosen for splicing? 


3. What is RNA editing, and how can it increase the 
number of proteins that can be formed from a single 
pre-mRNA transcript? 


ee 


12.19 Translational Control 


We recall from the Central Dogma that DNA is used to make RNA 
and RNA is used to make protein. We have already seen in this chapter 
that gene expression is controlled extensively at the level of transcrip- 
tion, but the subsequent translation of the mRNA into protein pro- 
vides a second stage at which to control the final expression of genes 
into functional proteins. Translational control encompasses a wide 
variety of regulatory mechanisms affecting the translation of mRNAs 
previously transported from the nucleus to the cytoplasm. Subjects 
considered under this general regulatory umbrella include (1) locali- 
zation of mRNAs to certain sites within a cell; (2) whether an mRNA 
is translated and, if so, how often; and (3) the half-life of the mRNA, a 
property that determines how long the message is translated. 

Translational control mechanisms generally operate through 
interactions between specific mRNAs and various proteins and 
microRNAs present within the cytoplasm. It was noted on page 419 
that mRNAs contain noncoding regions, called untranslated 
regions (UTRs), at both their 5’ and 3’ ends. The 5’ UTR extends 
from the 5’ cap to the AUG initiation codon, whereas the 3’ UTR 
extends from the termination codon to the end of the RNA transcript 
(see Figure 11.19). For many years the untranslated regions of the 
message were largely ignored, but it has become evident that the 
UTRs contain nucleotide sequences used by the cell to mediate trans- 
lational control. In eukaryotic cells, a number of important biological 
processes depend on translation of stored mRNAs that are not trans- 
lated immediately upon entry to the cytoplasm. As an example, many 
mRNAs are stored in unfertilized eggs that must remain inactive until 
fertilization and subsequent development. The initiation of transla- 
tion of these mRNAs during early development involves at least two 
distinct events: removal of bound inhibitory proteins and increase in 
the length of the poly(A) tails by action of an enzyme residing in the 
egg cytoplasm. These events are illustrated in the model of transla- 
tional activation in Xenopus embryos in FIGURE 12.59 and serve to 
emphasize the fact that the 5’ and 3’ ends of mRNAs _ often commu- 
nicate by protein-protein interaction to regulate translation. 


Initiation of Translation 


Prokaryotic cells have polycistronic mRNAs that encode numerous 
polypeptides, whereas eukaryotic mRNAs are predominately 
monocistronic, encoding only a single polypeptide. In both cases, 


Coding 


region 5' UTR 


elF4E 


capyh 


translation starts at an AUG codon, but how the ribosome finds this 505 


start codon is entirely different and provides a means to differen- 
tially regulate overall protein production. We will start with eukary- 
otic translation initiation since it is fairly simple—the first AUG 
downstream of the 7-methylguanosine cap is almost always the start 
codon. The small subunit of the ribosome assembles at the cap struc- 
ture and scans down until the first AUG is found, after which the 
large subunit of the ribosome joins and translation begins (see Figure 
11.47). There are a few examples where an internal ribosome entry 
site (IRES) allows the ribosome to start at an AUG other than the 
first one downstream of the cap, but the overwhelming majority of 
mRNAs are structured such that translation begins at the first AUG 
downstream of the cap. 

A number of mechanisms have been discovered that regulate 
the rate of translation of mRNAs in response to changing cellular 
requirements. Some of these mechanisms can be considered to act 
globally because they affect translation of all messages. When a 
human cell is subjected to certain stressful stimuli, a protein kinase 
is activated that phosphorylates the initiation factor elF2, which 
blocks further protein synthesis. As discussed on page 443, eIF2- 
GTP delivers the initiator tRNA to the small ribosomal subunit, after 
which it is converted to eIF2-GDP and released. The phosphorylated 
version of e[F2 cannot exchange its GDP for GTP, which is required 
for elF2 to become engaged in another round of initiation of transla- 
tion. It is interesting to note that four different protein kinases have 
been identified that phosphorylate the same Ser residue of the eIF2a 
subunit to trigger translational inhibition. Each of these kinases 
becomes activated after a different type of cellular stress, including 
heat shock, viral infection, the presence of unfolded proteins, or 
amino acid starvation. Thus at least four different stress pathways 
converge to induce the same response. 

Other mechanisms influence the rate of translation of specific 
mRNAs through the action of proteins that recognize specific ele- 
ments in the UTRs of those mRNAs. One of the best studied exam- 
ples involves the mRNA that encodes the protein ferritin. Ferritin 
sequesters iron atoms in the cytoplasm of cells, thereby protecting 
the cells from the toxic effects of excess free metal. The translation of 
ferritin mRNAs is regulated by a specific repressor, called the iron 
regulatory protein (IRP), whose activity depends on the concentra- 
tion of unbound iron in the cell. At low iron concentrations, IRP 
binds to a specific sequence in the 5’ UTR of the mRNA called the 
iron-response element (IRE) (FIGURE 12.60). The bound IRP physi- 


FIGURE 12.59 A model for the mechanism of translational activation of mRNAs following fertilization of a Xenopus egg. Maternally contributed 
messenger RNAs in the egg are maintained in the cytoplasm in an inactive state by a combination of their short poly(A) tails and a bound inhibitory protein 
called Maskin. Maskin is tethered at one surface to CPEB, a protein that binds to sequences in the 3’ UTR of specific mRNAs, and at another surface to the 
cap-binding protein elF4E. Following fertilization, CPEB is phosphorylated, which displaces Maskin. The phosphorylated version of CPEB recruits another 
protein CPSF, which recruits poly(A) polymerase (PAP), an enzyme that adds adenosine residues to the poly(A) tail. The elongated poly(A) tail serves as a binding 
site for PABP molecules, which help to recruit elF4G, an initiation factor required for translation. As a result of these changes, the mRNA is actively translated. 


Source: From R. D. Mendez and J. D. Richter, Nature Reviews Mol. Cell. Biol. 2:514, 2001, copyright 2001, Nature Reviews Molecular Cell Biology by Nature 
Publishing Group. Reproduced with permission of Nature Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center. 
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FIGURE 12.60 5’ UTR control of ferritin mRNA translation. When iron 
concentrations are low, an iron-binding repressor protein, called the iron 
regulatory protein (IRP), binds to a specific sequence in the 5’ UTR of the 
ferritin mRNA, called the iron-response element (IRE), which is folded into a 
hairpin loop. When iron becomes available, it binds to the IRP, changing its 
conformation and causing it to dissociate from the IRE, allowing translation 
of the mRNA to form ferritin. 


cally interferes with the binding of a ribosome to the 5’ end of the 
mRNA, thereby inhibiting the initiation of translation. At high iron 
concentrations, the IRP is modified such that it loses its affinity for 
the IRE. Dissociation of the IRP from the ferritin mRNA gives the 
translational machinery access to the mRNA, and the encoded pro- 
tein is synthesized. 

For prokaryotes, the process of translation initiation is more 
complex because there are multiple start codons in a polycistronic 
message and ribosomes must find the correct start sites. From a reg- 
ulation standpoint, one would predict that all the open reading 
frames in a polycistronic mRNA should be translated at equal levels. 
However, this is not the case. The mechanism used to identify bona 
fide start codons is the presence of a Shine-Dalgarno sequence just 
upstream of start codons. As discussed on page 442, this sequence is 
complementary to a region in 16S rRNA within the small subunit of 
the ribosome. Base pairing between the Shine-Dalgarno sequence 
and 16S rRNA places the small subunit just upstream ofa start codon 
allowing entry of the large subunit and initiation of translation. For 
some mRNAs, the Shine-Dalgarno sequence is perfectly comple- 
mentary to 16S rRNA, and therefore initiation is highly efficient. For 
other mRNAs, the Shine-Dalgarno sequence may not be exactly 
complementary to 16S rRNA, so initiation of translation will be less 
efficient. In this way, the amounts of protein produced from each 
open reading frame within a polycistronic mRNA can be regulated. 
For example, perhaps enzyme A in the trp operon is needed at much 
higher levels than the other four. A perfect Shine-Dalgarno sequence 
upstream of the start codon for enzyme A and imperfect pairing for 
the other four would allow more efficient initiation for this open 
reading frame compared to the others. 


Cytoplasmic Localization of mRNAs 


In eukaryotic cells, the localization of specific mRNAs to specific 
cytoplasmic regions is a widely utilized mechanism by which 
cells establish functionally distinct cytoplasmic domains. Recent 
innovations in live-cell imaging techniques are allowing researchers 
to follow the movements of specific mRNAs out of the nucleus and 
through the cytoplasm. We will briefly consider the fruit fly, whose 
egg, larval, and adult stages are illustrated in Figure 12.61a. The 
development of the anterior-posterior (head-abdomen) axis of a 
larval fly and subsequent adult is foreshadowed by localization of 
specific mRNAs along this same axis in the oocyte. For example, 
mRNAs transcribed during oogenesis from the bicoid gene become 
preferentially localized at the anterior end of the oocyte, while 
mRNAs transcribed from the oskar gene become localized at the 
opposite end (FIGURE 12.61b,c). The mRNAs are subsequently trans- 
lated at the site of localization where the newly synthesized protein 
accumulates. The protein encoded by bicoid mRNA plays a critical 
role in the development of the head and thorax, whereas the protein 
encoded by oskar mRNA is required for the formation of germ cells, 
which develop at the posterior end of the larva. 

The information that governs the cytoplasmic localization of 
these mRNAs resides in the 3’ UTR. This can be demonstrated using 
fruit flies that carry a foreign gene whose coding region is fused to a 
DNA sequence containing the 3’ UTR of either the bicoid or oskar 
mRNA. When the foreign gene is transcribed during oogenesis, the 
mRNA becomes localized in the site determined by the 3’ UTR. 
Localization of mRNAs is mediated by RNA-binding proteins that 
recognize localization sequences (called zip codes) in this region of 
the mRNA. 

Microtubules, and the motor proteins that use them as tracks, 
play a key role in transporting mRNA-containing particles to specific 
locations. The localization of oskar mRNAs in a fruit fly oocyte, for 
example, is disrupted by agents such as colchicine that depolymerize 
microtubules and by mutations that alter the activity of the kinesin I 
motor protein. Microfilaments, on the other hand, are thought to 
anchor mRNAs after they have arrived at their destination. During 
the process of localization, translation of the mRNAs is specifically 
inhibited by associated proteins. 

The localization of mRNAs is not restricted to eggs and oocytes 
but occurs in all types of polarized cells. For instance, actin mRNAs 
are localized near the leading edge of a migrating fibroblast, which is 
the site where actin molecules are needed for locomotion (Figure 
12.61d). Studies of large numbers of genes suggest that approxi- 
mately 70 percent of all mRNA are localized to specific regions 
of the cell, indicating that mRNA localization may play a far more 
general role in regulating gene function than had previously been 
suspected. 


The Control of mRNA Stability 


The longer an mRNA is present in a cell, the more times it can serve 
as a template for synthesis of a polypeptide. Ifa cell is to control gene 
expression, it is just as important to regulate the survival of an 
mRNA as it is to regulate the synthesis of that mRNA in the first 
place. Unlike prokaryotic mRNAs, which begin to be degraded at 
their 5’ end even before their 3’ end has been completed, most 
eukaryotic mRNAs are relatively long-lived. Even so, the half-life of 
eukaryotic mRNAs is quite variable. The FOS mRNA, for example, 
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FIGURE 12.61 Cytoplasmic localization of mRNAs. (a) Schematic 
drawings showing three stages in the life of a fruit fly: the egg, larva, and 
adult. Segments of the thorax and abdomen are indicated. (b) Localization 
of bicoid mRNA at the anterior pole of an early cleavage stage of a fly 
embryo by in situ hybridization. (c) Localization of oskar mRNA at the 
posterior pole of a comparable stage to that shown in b. Both of these 
localized RNAs play an important role in the development of the anterior— 
posterior axis of the fruit fly. (d) Localization of B-actin mRNA (red) near the 
leading edge of a migrating fibroblast. This is the region of the cell where 
actin is utilized during locomotion (see Figure 9.61). 

Source: (6) Courtesy of Daniel St Johnston; (c) Courtesy of Antoine Guichet 
and Anne Ephrussi, EMBL, Heidelberg, Germany; (d) From V. M. Latham 
et al., Courtesy Robert H. Singer, Curr. Biol. 11:1010, 2001, Fig 4d, with 
permission from Elsevier. 
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(half-life of 10 to 30 minutes). In contrast, mRNAs that encode the 
production of the dominant proteins of a particular cell, such as 
hemoglobin in an erythrocyte precursor or ovalbumin in a cell of a 
hen’s oviduct, typically have half-lives of more than 24 hours. Thus, as 
with mRNA localization or the rate of initiation of mRNA translation, 
specific mRNAs can be recognized by the cell's regulatory machin- 
ery and given differential treatment. 

Unless protected by mechanisms such as those used in unferti- 
lized eggs (page 505), mRNAs with short or absent poly(A) tails are 
rapidly degraded. This suggests that the longevity of an mRNA is 
related to the length of its poly(A) tail. When a typical mRNA leaves 
the nucleus, it contains a tail of approximately 200 adenosine 
residues (FIGURE 12.62a, step 1). As an MRNA remains in the cyto- 
plasm, its poly(A) tail tends to be gradually reduced in length as it is 
nibbled away by a type of exonuclease known as a deadenylase. No 
dramatic effect on mRNA stability is observed until the tail becomes 
shortened to approximately 30 residues (step 2). Once the tail is 
shortened to this length, the mRNA is usually degraded rapidly 
by either of two pathways. In one of these pathways (shown in 
Figure 12.62a), degradation of the mRNA begins at its 5’ end 
following removal of the remaining poly(A) at the 3’ end of the mes- 
sage. The fact that the poly(A) tail at the 3’ end of the message pro- 
tects the cap at the 5’ end of the molecule suggests that the two ends 
of the mRNA are held in close proximity (see Figure 11.45). Once 
the 3’ tail is removed (step 3, Figure 12.62a), the message is decapped 
(step 4) and degraded from the 5’ end toward the 3’ end (step 5). 
Deadenylation, decapping, and 5’ — 3’ degradation occur within 
small transient cytoplasmic granules called P-bodies (Figure 
12.62c). In addition to destroying “unwanted” mRNAs, P-bodies can 
also act as sites where mRNAs that are no longer being translated are 
stored temporarily. In the alternate mRNA degradation pathway 
shown in Figure 12.62b, removal of the poly(A) tail (step 3a) is fol- 
lowed by continued digestion of the mRNA from its 3’ end (step 4a). 
The digestion of mRNAs in the 3’ > 5’ direction is carried out by an 
exonuclease that is part of a complex of 3’ > 5’ exonucleases called 
the exosome. 

There must be more to mRNA longevity than simply the length 
of the poly(A) tail, as mRNAs having very different half-lives begin 
with a similar-sized tail. Once again, differences in the nucleotide 
sequence of the 3’ UTR have been shown to play a role in the rate at 
which the poly(A) tail becomes shortened. The 3’ UTR of a globin 
mRNA, for example, contains a number of CCUCC repeats that 
serve as binding sites for specific proteins that stabilize the mRNA. If 
these sequences are mutated, the mRNA is destabilized. In contrast, 
short-lived mRNAs often contain AU-rich elements (e.g., AUUUA 
repeats) in their 3’ UTR that destabilize the message. If one of these 
destabilizing sequences is introduced into the 3’ UTR of a globin 
gene, the stability of the mRNA transcribed from the modified gene 
is reduced from a half-life of 10 hours to a half-life of 90 minutes. 
The importance of these destabilizing sequences (and the overall 
mRNA instability they produce) can be appreciated by considering 
the normally short-lived FOS mRNA mentioned above. If the desta- 
bilizing sequence from the FOS gene is lost through a deletion, the 
half-life of FOS mRNA increases, and the cells often become malig- 
nant. Destabilizing sequences in the 3’ UTR are thought to serve as 
binding sites both for proteins (e.g., AUF1) and microRNAs that 
bring about the deadenylation and subsequent destruction of the 
mRNA, as discussed below. 
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FIGURE 12.62 mRNA degradation in mammalian cells. (a, b) The 
steps depicted in these drawings are described in the text. (c) Fluorescence 
micrograph showing P-bodies (yellow) in the cytoplasm of HeLa cells. The 
P-bodies are revealed as the site of localization of GFP-DCP1, a protein 
involved in mRNA decapping. Nuclei are red. 

Source: (c) From F. Tritschler et. al., Nature Revs. Mol. Cell Biol. 11:380, 
2010; box 1. Reprinted by permission of Macmillan Publishers Ltd. Courtesy 
of D. Lazaretti, Max Planck Institute, Tuebigen. 


REVIEW | 


1. Describe three different ways in which gene expression 
might be controlled at the translational level. Cite an 
example of each of these control mechanisms. 


2. What is the role of poly(A) in mRNA stability? How 
might the cell regulate the stability of different mRNAs? 


3. Describe the different levels at which gene 
expression is regulated to allow a B-globin gene with 
the following structure to direct the formation of a 
protein that accounts for over 95 percent of the 
protein of the cell. 


exon 3 
e- 


intron—exon 2—intron 


exon 1 


12.20 The Role of MicroRNAs 
in Translational Control 


Proteins are not the only molecules that can act as regulators of 
mRNA translation and stability. The formation and mechanism of 
action of microRNAs was discussed in Chapter 11 (page 434). Like 
most of the proteins we have been discussing that regulate mRNA 
translation and stability, miRNAs act primarily by binding to sites in 
the 3’ UTR of their target mRNAs. It is becoming increasingly appar- 
ent that miRNAs are important regulators of virtually every biologi- 
cal process. Even the earliest stages of embryonic development 
require the involvement of miRNAs, as evidenced by the fact that 
animals lacking the miRNA-producing enzyme Dicer fail to develop 
beyond gastrulation. Similarly, when Dicer is absent only from a 
particular tissue, the development of the cells of that tissue displays 
obvious abnormalities. It is also becoming apparent that abnormali- 
ties in miRNA levels play a major role in the development of many 
common diseases. 

MicroRNAs are thought to exert their regulatory activity by 
multiple mechanisms as depicted in FIGURE 12.63. 


1. Most recent work supports a model that was first discovered in 
zebrafish embryos that showed that miR-430 functioned to rid 
embryos of maternal mRNAs by inducing deadenylation and 
decay. In this model, miRNA pairing recruits exonucleases to the 
3’ end of an mRNA target, resulting in shortening of the poly(A) 
tail followed by degradation. As discussed on page 505, many 
maternal mRNAs are stored for use as embryos develop, but as 
those embryos begin to transcribe their own genomes, maternal 
mRNAs are actively destroyed. miR-430 is extremely abundant 
during this transition in zebrafish embryos, and this one miRNA 
binds to hundreds of mRNAs, accelerating their decay. This 
mechanism of miRNA-mediated mRNA degradation has since 
been demonstrated in numerous other organisms, including 
mice and humans, and is especially prevalent in plant cells. The 
widespread occurrence of this pathway of mRNA degradation is 
best revealed by comparing the numbers of particular mRNAs in 
cells under different conditions. It is generally found that mRNA 
levels are inversely correlated with the levels of complementary 
miRNAs. In other words, when a particular miRNA is intro- 
duced into a population of cells, the mRNAs that possess binding 


Deadenylation 
(followed by decapping and degradation) 


ORF 


o= 


‘Ss 


ORF 


a 


Initiation block 
(repressed cap recognition or 60S joining) 


Ue aN 


ORF AAAAA 


Proteolysis 
(degradation of nascent peptide) 


By ge B.. 


ae 


x So 


miRNP binding 


Elongation block 
(slowed elongation or ribosome ‘drop-off’) 


” ie 


FIGURE 12.63 miRNA mediated gene silencing. miRNAs, as part of an miRNP protein complex as illustrated in Figure 11.35, pair with sequence 
elements in the 3’ UTR of target mRNAs. They suppress gene expression posttranscriptionally by either promoting mRNA deadenylation and degradation 
(upper left), inhibiting the initiation of translation (lower left), inhibiting translation elongation (lower right), or possibly activating degradation of nascent 
peptides (upper right). Some mRNA: miRNA pairs may be stored in cytoplasmic P-bodies and, upon depression by removal of miRNAs, exit P-bodies and 
resume translation. ORF (open reading frame) corresponds to the amino acid coding segment of the mRNA. 


Source: W. Filipowicz, et al., Nat. Revs. Gen. 9, 109, 2008, Fig. 3. Nature Reviews Genetics by Nature Publishing Group. Reproduced with permission of 
Nature Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center. 


sites for that miRNA drop in number. Conversely, when the 
numbers of a particular miRNA are experimentally depleted, the 
abundance of mRNAs with binding sites for that miRNA concur- 
rently increases. 


2. A body of evidence also suggests that miRNAs can act by inhibit- 
ing the translation of mRNAs. Most evidence points to miRNAs 
acting at the step where translation is initiated, but they may also 
act to block translation elongation. 


3. Several early studies suggest that miRNAs may recruit proteases 
that degrade nascent proteins during translation. 


In addition to playing an important role in overall gene regulation, 
miRNAs also appear to be important mediators of stress responses. 
One way in which miRNAs might facilitate a rapid response to stress 
is by blocking translation of specific mRNAs and holding those 
mRNAs in place until some external cue or stress releases the inhibi- 
tion and allows translation to resume. In this model, repressed mRNAs 
are held in cytoplasmic P-bodies and under specific conditions leave 
the P-body and resume translation (Figure 12.63). 


REVIEW | 


1. Describe some of the ways in which miRNAs might 
regulate gene expression. 


———— 


12.21 Posttranslational Control: 
Determining Protein Stability 


We have seen how cells possess elaborate mechanisms to control the 
rates at which proteins are synthesized. It is not unexpected that cells 
would also possess mechanisms to control the length of time that 
specific proteins survive once they are fully functional. Although the 
subject of protein stability does not fall technically under the head- 
ing of control of gene expression, it is a logical extension of that 
topic. Pioneering studies in the area of selective protein degradation 
were carried out by Avram Hershko and Aaron Ciechanover in Israel 
and Irwin Rose and Alexander Varshavsky in the United States. 

Degradation of cellular proteins is carried out within hollow, 
cylindrical, protein-degrading machines called proteasomes that 
are found in both the nucleus and cytosol of cells. Proteasomes con- 
sist of four rings of polypeptide subunits stacked one on top of the 
other with a cap attached at either end of the stack (FIGURE 12.64a, b). 
The two central rings consist of polypeptides (f subunits) that func- 
tion as proteolytic enzymes. The active sites of these subunits face 
the enclosed central chamber, where proteolytic digestion can occur 
in a protected environment. 

Proteasomes digest proteins that have been specifically selected 
and marked for destruction as described below. Some proteins are 
selected because they are recognized as abnormal—either misfolded 
or incorrectly associated with other proteins. Included in this group 
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FIGURE 12.64 Proteasome structure and function. (a) High-resolution electron micrograph of an isolated Drosophila proteasome. (b) Model of a 
proteasome based on high-resolution electron microscopy and X-ray crystallography. Each proteasome consists of two multisubunit caps (or regulatory 
particles) on either end of a tunnel-shaped cylinder (or core particle) that is formed by a stack of four rings. Each ring consists of seven subunits that are 
divided into two classes, a-type and f-type. The two inner rings are composed of B subunits, which surround a central chamber. The subunits are drawn in 
different shades of color because they are similar, but not identical, polypeptides. Three of the seven f subunits in each ring possess proteolytic activity; the 
other four are inactive in eukaryotic cells. The two outer rings are composed of enzymatically inactive « subunits that form a narrow opening (approximately 
13 A) through which unfolded polypeptide substrates are threaded to reach the central chamber, where they are degraded. (c) Steps in the degradation of 
proteins by a proteasome. In step 1, the protein to be degraded is linked covalently to a string of ubiquitin molecules. Ubiquitin attachment requires the 
participation of three distinct enzymes (E1, E2, and E3) in a process that is not discussed in the text. In step 2, the polyubiquitinated target protein binds to 


a subunit 
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the cap of the proteasome. The ubiquitin chain is then removed, and the unfolded polypeptide is threaded into the central chamber of the proteasome 
(step 3), where it is degraded by the catalytic activity of the B subunits (steps 4 and 5). 


Source: (a) Courtesy H. Hélzl and Wolfgang Baumeister, J. Cell Biol. 150:126, 2000. Reproduced with permission of the Rockefeller University Press. 


are abnormal proteins that had been produced on membrane-bound 
ribosomes of the rough ER (page 265). The selection of “normal” 
proteins for proteasome destruction is based on the protein’s biologi- 
cal stability. Every protein is thought to have a characteristic longe- 
vity or half-life. Some protein molecules, such as the enzymes of 
glycolysis or the globin molecules of an erythrocyte, are present for 
days to weeks. Other proteins that are required for a specific, fleeting 
activity, such as regulatory proteins that initiate DNA replication or 
trigger cell division, may survive only a few minutes. The destruc- 
tion of such key regulatory proteins by proteasomes plays a crucial 
role in the progression of cellular processes (as illustrated in Figure 
14.26). Velcade, a drug that specifically inhibits proteasomal diges- 
tion, has been approved for treatment of some forms of cancer. 

The factors that control a protein's lifetime are not well under- 
stood. One of the determinants is the specific amino acid that resides 
at the N-terminus of a polypeptide chain. Polypeptides that termi- 
nate in arginine or lysine, for example, are typically short-lived. A 
number of proteins that act at specific times within the cell cycle are 
marked for destruction when certain residues become phosphoryl- 
ated. Still other proteins carry a specific internal sequence of amino 
acids called a degron that ensures they will not survive long within 
the cell. 

Ubiquitin is a small, highly conserved protein with a number 
of functions in diverse cellular processes. We saw on page 300, for 
example, that membrane proteins bearing a single attached ubiqui- 
tin molecule are selectively incorporated into endocytic vesicles. 
A single attached ubiquitin functions primarily as a sorting signal. In 


contrast, proteins are targeted for destruction by attachment of a 
polyubiquitin chain consisting of multiple ubiquitin molecules 
linked covalently to one another (step 1, Figure 12.64c). In the first 
stage of this process, ubiquitin is transferred enzymatically from a 
carrier protein to a lysine residue on the condemned protein. 
Enzymes that transfer ubiquitin to target proteins comprise a huge 
family of ubiquitin ligases in which different members recognize 
proteins bearing different degradation signals. These enzymes play a 
crucial role in determining the life or death of key proteins and are a 
focus of current research. 

Once it is polyubiquitinated, a protein is recognized by the 
cap of the proteasome (step 2, Figure 12.64c), which removes the 
polyubiquitin chain and unfolds the target protein using energy pro- 
vided by ATP hydrolysis. The unfolded, linear polypeptide is then 
threaded through the narrow opening in the ring of « subunits and 
passed into the central chamber of the proteasome (step 3), where it 
is digested into small peptides (steps 4 and 5). The peptide products 
are released back into the cytosol where they are degraded into their 
component amino acids. 


REVIEW | 


1. What are the different roles of a single ubiquitin 
versus multiple ubiquitins when attached to a 
target protein? 


eo ___ 


Analytic Questions RA NN — 


. Methylation of K9 of histone H3 (by an enzyme SUV39H1) is asso- 


10. 


11. 


12. 


ciated with heterochromatinization and gene silencing. It has 
been reported that methylation of H3 by other enzymes can lead 
to transcriptional activation. How can methylation lead to oppo- 
site effects? 


. How many copies of each type of core histone would it take to 


wrap the entire human genome into nucleosomes? How has 
evolution solved the problem of producing such a large number 
of proteins in a relatively short period of time? 


. Suppose you discovered a temperature-sensitive mutant whose 


nucleus failed to accumulate certain nuclear proteins at an ele- 
vated (restrictive) temperature but continued to accumulate 
other nuclear proteins. What conclusions might you draw about 
nuclear localization and the nature of this mutation? 


. Humans born with three X chromosomes but no Y chromosomes 


often develop into females of normal appearance. How many 
Barr bodies would you expect the cells of these women to 
have? Why? 


. Suppose that X-inactivation were not a random process, but 


always led to the inactivation of the X chromosome derived from 
the father. What effect would you expect this to have on the phe- 
notype of females? 


. The chromosomes shown in Figure 12.22b were labeled by 


incubating the preparation with DNA fragments known to be 
specific for each of the chromosomes. Suppose one of the 
chromosomes in the field contained regions having two different 
colors. What might you conclude about this chromosome? 


. What advantage might be gained by having transcripts synthe- 


sized and processed in certain regions of the nucleus rather than 
randomly throughout the nucleoplasm? (See Quantitative Tutorial 
Video). 


. Compare and contrast the effect of a deletion in the operator of 


the lactose operon with one in the operator of the tryptophan 
operon. 


. If you were to find a mutant of E. coli that produced continuous 


polypeptide chains containing both B-galactosidase and galac- 
toside permease (encoded by the y gene), how might you 
explain how this happened? 


You suspect that a new hormone you are testing functions 
to stimulate myosin synthesis by acting at the transcriptional 
level. What type of experimental evidence would support this 
contention? 


Suppose you had conducted a series of experiments in which 
you had transplanted nuclei from several different adult tissues 
into an activated, enucleated mouse egg and found that the egg 
did not develop past the blastocyst stage. Could you conclude 
that the transplanted nucleus had lost genes that were required 
for postblastula development? Why or why not? What does this 
type of experiment tell you more generally about interpreting 
negative results? 


It was noted on page 493 that DNA footprinting allows isolation 
of DNA sequences that bind specific transcription factors. 


13. 


14. 


15: 


16. 


17. 


18. 


19. 


20. 


Describe an experimental protocol to identify transcription factors 
that bind to an isolated DNA sequence. (You might consider the 
techniques discussed in Section 18.11.) 


How do you explain why enhancers can be moved around within 
the DNA without affecting their activity, whereas the TATA box 
can only operate at one specific site? 


Suppose that you are working with a cell that exhibits a very low 
level of protein synthesis, and you suspect the cells are subject 
to a global translational-control inhibitor. What experiment 
might you perform to determine whether this is the case? 


The signal sequences that direct the translocation of proteins 
into the endoplasmic reticulum are cleaved by a signal pepti- 
dase, whereas the NLSs and NESs required for movement of a 
protein into or out of the nucleus remain as part of that protein. 
Consider a protein such as hnRNPA1, which is involved in the 
export of mRNA to the cytoplasm. Why is it important that 
the transport signal sequences for this protein remain as part 
of the protein, whereas the signal sequence for ER proteins can 
be cleaved? 


When methylated DNA is introduced into cultured mammalian 
cells, it is generally transcribed for a period before it becomes 
repressed. Why would you expect this type of delay before inhi- 
bition of transcription would occur? 


Suppose you had isolated a new transcription factor and wanted 
to know which genes this protein might regulate. Is there any 
way that you could use a cDNA microarray of the type shown in 
Figure 12.35 to approach this question? (Note: The microarray 
of Figure 12.35 contains the DNA of protein-coding regions, 
unlike that of Figure 12.46). 


Although several different mammalian species have been 
cloned, the efficiency of this process is extremely low. Often tens 
or even hundreds of oocytes must be implanted with donor 
nuclei to obtain one healthy live birth. Many researchers believe 
the difficulties with cloning reside in the epigenetic modifica- 
tions, such as DNA and histone methylation, that occur within 
various cells during an individual's life. How do you suspect such 
modifications might affect the success of an experiment such as 
that depicted in Figure 12.32? 


A study in a British medical journal found there was a correlation in 
telomere length between fathers and their daughters and between 
mothers and both their sons and daughters but not between 
fathers and their sons. How can you explain this finding? 


Some scientific reports are best described as correlations, in that 
they report on two events or conditions that tend to accompany 
one another. Correlations are often interpreted as evidence of a 
causal relationship between the two events or conditions. Other 
scientific reports involve experimental intervention and gener- 
ally make a stronger case for a causal relationship. Pick one sci- 
entific conclusion that is put forth in this chapter that is based on 
both types of evidence. Which type of report do you find more 
convincing? 


SUOIISENH IAjeuYy 


512 


CHAPTER 


DNA Replication and Repair 


BRCA1 TESTING AND BREAST 
CANCER RISK 


— 


ae time our cells divide, they need to make an exact 
copy of our genome. Mistakes in the genome sequence can 
start cells down the road to becoming cancer by mutating 
genes that control cell division. It is thus critical to copy our 
genomes accurately when cells divide and to fix any DNA 
damage that might change the sequence of our genome. 
DNA damage can take place in response to sunlight or 
radiation, but it can also happen randomly without any 
outside cause. No matter the cause, our cells need to repair Source: (left-hand) Eamonn M. McCormack/Getty Images, Inc.; 

the damage, and if they can’t, then our genomes accumu- (right-hand) Evan Oto/Science Source Images 

late mutations or chromosome rearrangements that can 

lead straight to cancer. Women with particular alleles of the BRCA71 gene involved in DNA repair have a three to seven times 
higher risk of developing breast cancer (the abbreviation “BRCA” stands for “Breast Cancer”). This discovery led to simple 
genetic testing for BRCA1 abnormalities, providing an early warning system for high breast cancer risk. But this test creates a 
situation in which many individuals find out that they have an elevated risk for breast cancer even though at the time they do not 
actually have cancer. What can someone do if she finds out she is at risk? One extreme approach is to have both breasts removed 
surgically, a method known as preventive mastectomy. This surgical approach does indeed reduce the risk of breast cancer by 
90 percent for at-risk patients. A less invasive option is for patients with BRCA1 genetic abnormality to undergo more frequent 
screening to catch any cancers that might develop at an earlier, more treatable stage. As a society, we are still learning how 
best to make use of genetic diagnosis of risk. It is a complex decision that each individual must make for herself, but it helps 
for the public to understand the issues and risks. In the case of BRCA1 genetic testing, the issue was put dramatically before 
the public by actress Angelina Jolie, who elected to have preventive mastectomy after finding out that she had a BRCA1 
genetic abnormality. You can read about her decision in an article that she wrote for the New York Times: http://www.nytimes 
.com/2013/05/14/opinion/my-medical-choice.html?_r=2&. In this chapter we will see how DNA is replicated with a high level of 
precision to avoid mistakes and how mistakes or damage to our DNA are repaired. 


CHAPTER OUTLINE 


13.1. DNA Replication 13.4 The Structure and Functions of 13.8 DNA Repair 

13.2 DNA Replication in Bacterial DNA Polymerases 13.9 Between Replication and Repair 
Cells 13.5. Replication in Viruses 13.10 THE HUMAN PERSPECTIVE: 

13.3 The Machinery Operating at the 13.6 DNA Replication in Eukaryotic Cells Consequences of DNA Repair 
Replication Fork 13.7 Chromatin Structure and Replication Deficiencies 


13.1 DNA Replication 


Reproduction is a fundamental property of all living systems. The 
process of reproduction can be observed at several levels: Organisms 
duplicate by asexual or sexual reproduction, cells duplicate by cellu- 
lar division, and the genetic material duplicates by DNA replica- 
tion. The machinery that replicates DNA is also called into action in 
another capacity: to repair the genetic material after it has sustained 
damage. These two processes—DNA replication and DNA repair— 
are the subjects addressed in this chapter. 

The capacity for self-duplication is presumed to have been one 
of the first critical properties to have appeared in the evolution of the 
earliest primitive life forms. Without the ability to propagate, any 
primitive assemblage of biological molecules would be destined for 
oblivion. The early carriers of genetic information were probably 
RNA molecules that were able to self-replicate. As evolution pro- 
gressed and RNA molecules were replaced by DNA molecules as the 
genetic material, the process of replication became more complex, 
requiring a large number of auxiliary components. Thus, although a 
DNA molecule contains the information for its own duplication, it 
lacks the ability to perform the activity itself. As Richard Lewontin 
expressed it, “the common image of DNA as a self-replicating mol- 
ecule is about as true as describing a letter as a self-replicating docu- 
ment. The letter needs a photocopier; the DNA needs a cell.” Let us 
see then how the cell carries out this activity. 

The proposal for the structure of DNA by Watson and Crick in 
1953 was accompanied by a suggested mechanism for its “self- 
duplication.” The two strands of the double helix are held together 
by hydrogen bonds between the bases. Individually, these hydrogen 
bonds are weak and readily broken. Watson and Crick envisioned 
that replication occurred by gradual separation of the strands of the 
double helix (FIGURE 13.1), much like the separation of two halves 
of a zipper. Because the two strands are complementary to each 
other, each strand contains the information required for construc- 
tion of the other strand. Thus once the strands are separated, each 
can act as a template to direct the synthesis of the complementary 
strand and restore the double-stranded state. 

The Watson-Crick proposal shown in Figure 13.1 made certain 
predictions concerning the behavior of DNA during replication. 
According to the proposal, each of the daughter duplexes should 
consist of one complete strand inherited from the parental duplex 
and one complete strand that has been newly synthesized. Replication 
of this type (FIGURE 13.2, scheme 1) is said to be semiconservative 
because each daughter duplex contains one strand from the parent 
structure. In the absence of information on the mechanism respon- 
sible for replication, two other types of replication had to be consid- 
ered. In conservative replication (Figure 13.2, scheme 2), the two 
original strands would remain together (after serving as templates), 
as would the two newly synthesized strands. As a result, one of the 
daughter duplexes would contain only parental DNA, while the 
other daughter duplex would contain only newly synthesized DNA. 
In dispersive replication (Figure 13.2, scheme 3), the parental strands 
would be broken into fragments, and the new strands would be syn- 
thesized in short segments. Then the old fragments and new segments 
would be joined together to form a complete strand. As a result, the 
daughter duplexes would contain strands that were composites of 
old and new DNA. At first glance, dispersive replication might seem 
like an unlikely solution, but it appeared to Max Delbriick at the time 
as the only way to avoid the seemingly impossible task of unwinding 


Old Old 


Old New 


New Old 


FIGURE 13.1 The original Watson-Crick proposal for the replication of 
a double-helical molecule of DNA. During replication, the double helix 
unwinds, and each of the parental strands serves as a template for the 
synthesis of anew complementary strand. As discussed in this chapter, 
these basic tenets have been borne out. 


two intertwined strands of a DNA duplex as it replicated (discussed 
on page 517). 

To decide among these three possibilities, it was necessary to 
distinguish newly synthesized DNA strands from the original DNA 
strands that served as templates. This was accomplished in studies 
on bacteria in 1957 by Matthew Meselson and Franklin Stahl of the 
California Institute of Technology who used heavy ('°N) and light 
('4N) isotopes of nitrogen to distinguish between parental and newly 
synthesized DNA strands (FIGURE 13.3). These researchers grew 
bacteria in medium containing '"N-ammonium chloride as the sole 
nitrogen source. Consequently, the nitrogen-containing bases of the 
DNA of these cells contained only the heavy nitrogen isotope. 
Cultures of “heavy” bacteria were washed free of the old medium 
and incubated in fresh medium with light, 4N-containing com- 
pounds, and samples were removed at increasing intervals over a 
period of several generations. DNA was extracted from the samples 
of bacteria and subjected to equilibrium density-gradient centrifu- 
gation (see Figure 18.38). In this procedure, the DNA is mixed with 
a concentrated solution of cesium chloride and centrifuged until the 
double-stranded DNA molecules reach equilibrium according to 
their density. 

In the Meselson-Stahl experiment, the density of a DNA mole- 
cule is directly proportional to the percentage of '°N or ‘4N atoms it 
contains. If replication is semiconservative, one would expect that 
the density of DNA molecules would decrease during culture in the 
“*N-containing medium in the manner shown in the upper set of 
centrifuge tubes of Figure 13.3a. After one generation, all DNA mol- 
ecules would be '°"N-“N hybrids, and their buoyant density would 
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DNA molecules from 
1st generation progeny 


DNA molecules from 
2nd generation progeny 


Semiconservative replication 


2) 


Parental DNA 
molecules 


DNA molecules from 
1st generation progeny 


DNA molecules from 
2nd generation progeny 


Conservative replication 


Dispersive replication 


3] 


Parental DNA 
molecules 


DNA molecules from 
1st generation progeny 


DNA molecules from 
2nd generation progeny 


FIGURE 13.2 Three alternate schemes of replication. Semiconservative 
replication is depicted in scheme 1, conservative replication in scheme 2, 
and dispersive replication in scheme 3. A description of the three alternate 
modes of replication is given in the text. 


be halfway between that expected for totally heavy and totally light 
DNA (Figure 13.3a). As replication continued beyond the first gen- 
eration, the newly synthesized strands would continue to contain 
only light isotopes, and two types of duplexes would appear in the 


gradients: those containing '°N-'“N hybrids and those containing 
only '“N. As the time of growth in the light medium continued, a 
greater and greater percentage of the DNA molecules present would 
be light. However, as long as replication continued semiconserva- 
tively, the original heavy parental strands would remain intact and 
present in hybrid DNA molecules that occupied a smaller and 
smaller percentage of the total DNA (Figure 13.3a). The results of 
the density-gradient experiments obtained by Meselson and Stahl 
are shown in Figure 13.3b, and they demonstrate unequivocally that 
replication occurs semiconservatively. The results that would have 
been obtained if replication occurred by conservative or dispersive 
mechanisms are indicated in the two lower sets of centrifuge tubes of 
Figure 13.3a.' 

Interestingly, the Meselson-Stahl experiment succeeded because 
of a fortuitous experimental artifact. The cell cultures used in the 
experiment were asynchronous, meaning that different cells start 
DNA replication at different times, so that when the shift was made 
from heavy to light nitrogen medium, some of the cells had already 
started replicating their DNA. The genome of bacterial cells is a sin- 
gle large DNA molecule. Starting with DNA containing heavy nitro- 
gen, the replicating genomic DNA will incorporate more and more 
light nitrogen during the experiment. Only if the genome is fully 
replicated in the presence of light nitrogen would all of the DNA be 
at exactly the hybrid density. In cells that had already started repli- 
cating when the medium was switched, the parts of the genome that 
had already replicated would remain heavy/heavy, while only the 
parts of the genome that replicated after the switch in medium would 
achieve the hybrid light/heavy density. This means that if the density 
of the whole genome was measured after giving enough time to rep- 
licate the genome once, only those cells that happened to just start 
replication when the medium was switched would have completely 
light/heavy genomes. The rest would have intermediate densities 
because part of their genome was light/heavy and part heavy/heavy. 
This would also be true during the next round of replication—the 
genome would gradually increase in its density from light/heavy to 
light/light as DNA replication continues. So if Meselson and Stahl 
had been able to measure the density of the intact genomic DNA of 
the cell, what they would have seen was a smear of densities that 
started out heavy/heavy and then continuously shifted to lighter and 
lighter densities until eventually reaching light/light. This would 
have looked exactly like the predicted outcome for dispersive repli- 
cation (Figure 13.3). Fortunately for Meselson and Stahl, they were 
inadvertently breaking their DNA into small pieces when loading it 
into the centrifuge chamber. Even if the whole genome is only grad- 
ually replicating, a given small piece of the genome is either repli- 
cated or it is not, and thus would either be heavy/heavy or heavy/ 
light, thus giving the clean step-wise increase in density that was 
observed. This shows us that even the most brilliant experimental 
designs can also benefit from a bit of good luck. 

During the same period of time, replication was demonstrated 
to occur semiconservatively in eukaryotes as well, using an entirely 
different experimental approach. The original experiments were 
carried out by J. Herbert Taylor of Columbia University, and in fact 
were published more than a year before the studies of Meselson and 


‘Anyone looking to explore the circumstances leading up to this heralded research 
effort and examine the experimental twists and turns as they unfolded might want to 
read the book Meselson, Stahl, and the Replication of DNA by Frederick Lawrence 
Holmes, 2001. A discussion of the experiment can also be found in PNAS 101:17889, 
2004, which is on the Web. 
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FIGURE 13.3 Experiment demonstrating that DNA replication in bacteria is semiconservative. DNA was extracted from bacteria at different stages in 
the experiment, mixed with a concentrated solution of the salt cesium chloride (CsCl), placed into a centrifuge tube, and centrifuged to equilibrium at high 
speed in an ultracentrifuge. Cesium ions have sufficient atomic mass to be affected by the centrifugal force, and they form a density gradient during the 
centrifugation period with the lowest concentration (lowest density) of Cs at the top of the tube and the greatest concentration (highest density) at the 
bottom of the tube. During centrifugation, DNA fragments within the tube become localized at a position having a density equal to their own density, which 
in turn depends on the ratio of '°N/"4N that is present in their nucleotides. The greater the '4N content, the higher in the tube the DNA fragment is found at 
equilibrium. (a) The results expected in this type of experiment for each of the three possible schemes of replication. The single tube on the left indicates 
the position of the parental DNA and the positions at which totally light or hybrid DNA fragments would band. (b) Experimental results obtained by 
Meselson and Stahl. The appearance of a hybrid band and the disappearance of the heavy band after one generation eliminates conservative replication. 
The subsequent appearance of two bands, one light and one hybrid, eliminates the dispersive scheme. 


Source: (b) From M. Meselson and F. Stahl, Proc. Nat'l. Acad. Sci. U.S.A. 44:671, 1958. Courtesy of Matthew Meselson. 


Stahl. The drawing and photograph of FIGURE 13.4 show the results 
of a more recent experiment in which cultured mammalian cells 
were allowed to undergo replication in bromodeoxyuridine (BrdU), 
a compound that is incorporated into DNA in place of thymidine. 
Following replication, a chromosome is made up of two chromatids. 
After one round of replication in BrdU, both chromatids of each 
chromosome contained BrdU (Figure 13.4a). After two rounds of 
replication in BrdU, one chromatid of each chromosome was com- 
posed of two BrdU-containing strands, whereas the other chromatid 
was a hybrid consisting of a BrdU-containing strand and a thymi- 
dine-containing strand (Figure 13.4a,b). The thymidine-containing 
strand had been part of the original parental DNA molecule prior to 
addition of BrdU to the culture. It is interesting to consider why 
Meselson and Stahl, who briefly cite Taylor in their landmark paper, 
are so widely viewed as being the first to demonstrate semiconserva- 
tive replication despite the fact that Taylor had already established 
this result a year earlier. The reason is likely that Meselson and Stahl’s 


approach probed the molecular nature of the replicated DNA more 
directly than Taylor’s microscopic approach, and their paper did a 
better job of putting the results into the context of the double helical 
structure of DNA. 


REVIEW | 


1. The original Watson-Crick proposal for DNA 
replication envisioned the continuous synthesis of 
DNA strands. How and why has this concept been 
modified over the intervening years? 


2. What does it mean that replication is 
semiconservative? How was this feature of replication 
demonstrated in bacterial cells? in eukaryotic cells? 


3. Why are there no heavy bands in the top three 
centrifuge tubes of Figure 13.3a? 


— — 
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FIGURE 13.4 Experimental demonstration that DNA replication occurs semiconservatively in eukaryotic cells. (a) Schematic diagram of the results of 
an experiment in which cells were transferred from a medium containing thymidine to one containing bromodeoxyuridine (BrdU) and allowed to complete 
two successive rounds of replication. DNA strands containing BrdU are shown in red. (b) The results of an experiment similar to that shown in a. In this 
experiment, cultured mammalian cells were grown in BrdU for two rounds of replication before mitotic chromosomes were prepared and stained by a 
procedure using fluorescent dyes and Giemsa stain. Using this procedure, chromatids containing thymidine within one or both strands stain darkly, whereas 
chromatids containing only BrdU stain lightly. The photograph indicates that, after two rounds of replication in BrdU, one chromatid of each duplicated 
chromosome contains only BrdU, while the other chromatid contains a strand of thymidine-labeled DNA. (Some of the chromosomes are seen to have exchanged 
homologous portions between sister chromatids. This process of sister chromatid exchange is common during mitosis but is not discussed in the text.) 


Source: (b) Courtesy of Sheldon Wolff. 


13.2 DNA Replication 
in Bacterial Cells 


We will focus in this section of the chapter on replication in bacterial 
cells, which is better understood than the corresponding process in 
eukaryotes. The early progress in bacterial research was driven by 
genetic and biochemical approaches including: 


e The availability of mutants that cannot synthesize one or another 
protein required for the replication process. The isolation of 
mutants unable to replicate their chromosome may seem 
paradoxical: How can cells with a defect in this vital process 
be cultured? This paradox was solved by the isolation of 
temperature-sensitive (ts) mutants, in which the deficiency 
only reveals itself at an elevated temperature, termed the 
nonpermissive (or restrictive) temperature. When grown at the 
lower (permissive) temperature, the mutant protein can function 
sufficiently well to carry out its required activity, and the cells 
can continue to grow and divide. Temperature-sensitive mutants 


have been isolated that affect virtually every type of physiologic 
activity (see also page 261), and they have been particularly 
important in the study of DNA synthesis as it occurs in 
replication, DNA repair, and genetic recombination. 


e The development of in vitro systems in which replication can be 


studied using purified cellular components. In some studies, the 
DNA molecule to be replicated is incubated with cellular extracts 
from which specific proteins suspected of being essential have 
been removed. In other studies, the DNA is incubated with a 
variety of purified proteins whose activity is to be tested. 


Taken together, these approaches have revealed the activity of 
more than 30 different proteins that are required to replicate the 
chromosome of E. coli. In the following sections, we will discuss the 
activities of several of these proteins whose functions have been 
clearly defined. Replication in bacteria and eukaryotes occurs by 
very similar mechanisms, and thus most of the information pre- 
sented in the discussion of bacterial replication applies to eukaryotic 
cells as well. 


Replication Forks and 
Bidirectional Replication 


Replication begins at a specific site on the bacterial chromosome 
called the origin. The origin of replication on the E. coli chromo- 
some is a specific sequence called oriC where a number of proteins 
bind to initiate the process of replication.* Once initiated, replica- 
tion proceeds outward from the origin in both directions, that is, 
bidirectionally (FIGURE 13.5). The sites in Figure 13.5 where the pair 
of replicated segments come together and join the nonreplicated 
DNA are termed replication forks. Each replication fork corre- 
sponds to a site where (1) the parental double helix is undergoing 
strand separation, and (2) nucleotides are being incorporated into 
the newly synthesized complementary strands. The two replication 
forks move in opposite directions until they meet at a point across 
the circle from the origin, where replication is terminated. The two 


The subject of initiation of replication is discussed in detail on page 526 as it occurs 
in eukaryotes. 


Replication fork 


Replication 
fork 


FIGURE 13.5 Model of a circular bacterial chromosome undergoing 
bidirectional, semiconservative replication. Two replication forks move in 
opposite directions from a single origin. When the replication forks meet at 
the opposite point on the circle, replication is terminated, and the two 
replicated duplexes detach from one another. New DNA strands are shown 
in red. 


newly replicated duplexes detach from one another and are ulti- 
mately directed into two different cells. 


Unwinding the Duplex and 
Separating the Strands 


Separation of the strands of a circular, helical DNA duplex poses 
major topological problems. To visualize the difficulties, we can 
briefly consider an analogy between a DNA duplex and a two- 
stranded helical rope. Consider what would happen if you placed a 
linear piece of this rope on the ground, took hold of the two strands 
at one end, and began to pull the strands apart just as DNA is pulled 
apart during replication. It is apparent that separation of the strands 
of a double helix is also a process of unwinding the structure. In the 
case of a rope, which is free to rotate around its axis, separation of 
the strands at one end would be accompanied by rotation of the 
entire fiber as it resisted the development of tension. Now, consider 
what would happen if the other end of the rope were attached to a 
hook on a wall (FIGURE 13.6a). Under these circumstances, separa- 
tion of the two strands at the free end would generate increasing 
torsional stress in the rope and cause the unseparated portion to 
become more tightly wound. Separation of the two strands of a cir- 
cular DNA molecule (or a linear molecule that is not free to rotate, 
as is the case in a large eukaryotic chromosome) is analogous to hav- 
ing one end of a linear molecule attached to a wall; in all of these 
cases, tension that develops in the molecule cannot be relieved by 
rotation of the entire molecule. Unlike a rope, which can become 
tightly overwound (as in Figure 13.6a), an overwound DNA molecule 
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Point of attachment 
of DNA 
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Replication machinery 
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FIGURE 13.6 The unwinding problem. (a) The effect of unwinding a 
two-stranded rope that has one end attached to a hook. The unseparated 
portion becomes more tightly wound. (b) When a circular or attached DNA 
molecule is replicated, the DNA ahead of the replication machinery becomes 
overwound and accumulates positive supercoils. Cells possess topoisomer- 
ases, such as the E. coli DNA gyrase, that remove positive supercoils. 


Source: (b) From J. C. Wang, Nature Reviews Mol. Cell Biol. 3:434, 2002, 
copyright 2002. Nature Reviews Molecular Cell Biology by Nature 
Publishing Group. Reproduced with permission of Nature Publishing Group 
in the format reuse in a book/textbook via Copyright Clearance Center. 
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ment of the replication fork generates positive supercoils in the 
unreplicated portion of the DNA ahead of the fork (Figure 13.6b). 
When one considers that a complete circular chromosome of E. coli 
contains approximately 400,000 turns and is replicated by two forks 
within 40 minutes, the magnitude of the problem becomes apparent. 

It was noted on page 381 that cells contain enzymes, called 
topoisomerases, that can change the state of supercoiling in a DNA 
molecule. One enzyme of this type, called DNA gyrase, a type II 
topoisomerase, relieves the mechanical strain that builds up during 
replication in E. coli. DNA gyrase molecules travel along the DNA 
ahead of the replication fork, removing positive supercoils. DNA gyrase 
accomplishes this feat by cleaving both strands of the DNA duplex, 
passing a segment of DNA through the double-stranded break to the 
other side, and then sealing the cuts, a process that is driven by the 
energy released during ATP hydrolysis (shown in detail in Figure 
10.14b). Eukaryotic cells possess similar enzymes that carry out this 
required function. 


The Properties of DNA Polymerases 


We begin our discussion of the mechanism of DNA replication by 
describing some of the properties of DNA polymerases, the 
enzymes that synthesize new DNA strands. Study of these enzymes 
was begun in the 1950s by Arthur Kornberg at Washington University. 
In their initial experiments, Kornberg and his colleagues purified an 
enzyme from bacterial extracts that incorporated radioactively 
labeled DNA precursors into an acid-insoluble polymer identified as 
DNA. The enzyme was named DNA polymerase (and later, after the 
discovery of additional DNA-polymerizing enzymes, it was named. 
DNA polymerase I). For the reaction to proceed, the enzyme required 
the presence of DNA and all four deoxyribonucleoside triphosphates 
(dTTP, dATP, dCTP, and dGTP). The newly synthesized, radioac- 
tively labeled DNA had the same base composition as the original 
unlabeled DNA, which strongly suggested that the original DNA 
strands had served as templates for the polymerization reaction. 

As additional properties of the DNA polymerase were uncov- 
ered, it became apparent that replication was more complex than 
previously thought. When various types of template DNAs were 
tested, it was found that the template DNA had to meet certain 
structural requirements if it was to promote the incorporation of 
labeled precursors (FIGURE 13.7). An intact, double-stranded DNA 
molecule, for example, did not stimulate incorporation. This was not 
surprising considering the requirement that the strands of the helix 
must be separated for replication to occur. It was less obvious why a 
single-stranded, circular molecule was also devoid of activity; one 
might expect this structure to be an ideal template to direct the man- 
ufacture of a complementary strand. In contrast, addition of a par- 
tially double-stranded DNA molecule to the reaction mixture 
produced an immediate incorporation of nucleotides. 

It was soon discovered that a single-stranded DNA circle cannot 
serve as a template for DNA polymerase because the enzyme cannot 
initiate the formation of a DNA strand. Rather, it can only add nucle- 
otides to the 3’ hydroxyl terminus of an existing strand. The strand 
that provides the necessary 3’ OH terminus is called a primer. All 
DNA polymerases—both prokaryotic and eukaryotic—have these 
same two basic requirements (Figure 13.8a): a template DNA strand 
to copy and a primer strand to which nucleotides can be added. 
These requirements explain why certain DNA structures fail to pro- 
mote DNA synthesis (Figure 13.7a). An intact, linear double helix 
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FIGURE 13.7 Templates and nontemplates for DNA polymerase 
activity. (a) Examples of DNA structures that do not stimulate the synthesis 
of DNA in vitro by DNA polymerase isolated from E. coli. (b) Examples of 
DNA structures that stimulate the synthesis of DNA in vitro. In all cases, the 
molecules in b contain a template strand to copy and a primer strand with a 
3’ OH on which to add nucleotides. 


provides the 3’ hydroxyl terminus but lacks a template. A circular 
single strand, on the other hand, provides a template but lacks a 
primer. The partially double-stranded molecule (Figure 13.7b) satis- 
fies both requirements and thus promotes nucleotide incorporation. 
The finding that DNA polymerase cannot initiate the synthesis of a 
DNA strand raises a critical question: How is the synthesis of a new 
strand initiated in the cell? We will return to this question shortly. 
The DNA polymerase purified by Kornberg had another property 
that was difficult to understand in terms of its presumed role as a 
replicating enzyme: It only synthesized DNA in a 5’-to-3’ (written 
5’ — 3’) direction. As Watson and Crick first discovered, the two 
strands of a DNA helix have an antiparallel orientation. The diagram 
of DNA replication first presented by Watson and Crick (see Figure 13.1) 
depicted events as they would be expected to occur at the replication 
fork. The diagram suggested that one of the newly synthesized 
strands is polymerized in a 5’ > 3’ direction, while the other strand 
is polymerized in a 3’—5’ direction. Is there some other enzyme 
responsible for the construction of the 3’ 5’ strand? Does the 
enzyme work differently in the cell than under in vitro conditions? 
We will return to this question as well. 

During the 1960s, there were hints that the “Kornberg enzyme” 
was not the only DNA polymerase in a bacterial cell. Then in 1969, a 
mutant strain of E. coli was isolated that had less than 1 percent of 
the normal activity of the enzyme, yet was able to multiply at the 
normal rate. Further studies revealed that the Kornberg enzyme, or 
DNA polymerase I, was only one of several distinct DNA polymer- 
ases present in bacterial cells and in fact was not the polymerase that 
replicates most of the DNA. Instead, the major enzyme responsible 
for DNA replication (ie., the replicative polymerase) is DNA 
polymerase III. A typical bacterial cell contains 300 to 400 molecules 
of DNA polymerase I but only about 10 copies of DNA polymerase 
III. The presence of DNA polymerase III had been masked by the 
much greater amounts of DNA polymerase I in the cell. But the 
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FIGURE 13.8 The activity of a DNA polymerase. (a) The polymerization of a nucleotide onto the 3’ end of the primer strand. The enzyme selects 
nucleotides for incorporation based on their ability to pair with the nucleotide of the template strand. (b) A simplified model of the two-metal ion mecha- 


nism for the reaction in which nucleotides are incorporated into a growing DNA strand by a DNA polymerase. In this model, one of the magnesium ions 
draws the proton away from the 3’ hydroxyl group of the terminal nucleotide of the primer, facilitating the nucleophilic attack of the negatively charged 
3’ oxygen atom on the @ phosphate of the incoming nucleoside triphosphate. The second magnesium ion stabilizes the pyrophosphate, promoting its 
release. The two metal ions are bound to the enzyme by highly conserved aspartic acid residues of the active site. (c) Schematic diagram showing the 


direction of movement of each polymerase along the two template strands. 


discovery of other DNA polymerases did not answer the two basic 
questions posed above; none of the enzymes can initiate DNA 
chains, nor can any of them construct strands in a 3’ > 5’ direction. 


Semidiscontinuous Replication 


The lack of polymerization activity in the 3’— 5’ direction has a 
straightforward explanation: DNA strands cannot be synthesized in 
that direction. Rather, both newly synthesized strands are assembled 
ina 5’ — 3’ direction. During the polymerization reaction, the —OH 
group at the 3’ end of the primer carries out a nucleophilic attack 
on the 5’ a-phosphate of the incoming nucleoside triphosphate, as 
shown in Figure 13.8b. The polymerase molecules responsible for 
construction of the two new strands of DNA both move in a 3’-to-5’ 
direction along the template, and both construct a chain that grows 
by adding nucleotides to its 3’-OH terminus (FIGURE 13.8c). Conse- 
quently, one of the newly synthesized strands grows toward the 
replication fork where the parental DNA strands are being sepa- 
rated, while the other strand grows away from the fork. 

Although this solves the problem concerning an enzyme that 
synthesizes a strand in only one direction, it creates an even more 
complicated dilemma. It is apparent that the strand that grows 
toward the fork in Figure 13.8c can be constructed by the continuous 
addition of nucleotides to its 3’ end. But how is the other strand 
synthesized? Evidence was soon gathered to indicate that the strand 
that grows away from the replication fork is synthesized discontinu- 
ously, that is, as fragments (Figure 13.9). Before the synthesis of a 
fragment can be initiated, a suitable stretch of template must be 
exposed by movement of the replication fork. Once initiated, each 
fragment grows away from the replication fork toward the 5’ end of 
a previously synthesized fragment to which it is subsequently linked. 
Thus, the two newly synthesized strands of the daughter duplexes 
are synthesized by very different processes. The strand that is 
synthesized continuously is called the leading strand because its 


synthesis continues as the replication fork advances. The strand that 
is synthesized discontinuously is called the lagging strand because 
initiation of each fragment must wait for the parental strands to sep- 
arate and expose additional template (FIGURE 13.9). As discussed on 
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FIGURE 13.9 The two strands of a double helix are synthesized by a 
different sequence of events. DNA polymerase molecules move along a 
template only in a 3’ > 5’ direction. As a result, the two newly assembled 
strands grow in opposite directions, one growing toward the replication fork 
and the other growing away from it. One strand is assembled in continuous 
fashion, the other as fragments that are joined together enzymatically. The 
diagram shown here depicts the differences in synthesis of the two strands. 
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that the terms leading and lagging may not be as appropriate as 
thought when they were first coined. Because one strand is synthe- 
sized continuously and the other discontinuously, replication is said 
to be semidiscontinuous. 

The discovery that one strand was synthesized as small frag- 
ments was made by Reiji Okazaki of Nagoya University, Japan, fol- 
lowing various types of labeling experiments. Okazaki found that if 
bacteria were incubated in (?H]thymidine for a few seconds and 
immediately killed, most of the radioactivity could be found as part 
of small DNA fragments 1000 to 2000 nucleotides in length. In con- 
trast, if cells were incubated in the labeled DNA precursor for a 
minute or two, most of the incorporated radioactivity became part 
of much larger DNA molecules (FIGURE 13.10). These results indi- 
cated that a portion of the DNA was constructed in small segments 
(later called Okazaki fragments) that were rapidly linked to 
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FIGURE 13.10 Results of an experiment showing that part of the DNA 
is synthesized as small fragments. Sucrose density gradient profiles of 
DNA from a culture of phage-infected E. coli cells. The cells were labeled 
for increasing amounts of time, and the sedimentation velocity of the 
labeled DNA was determined. When DNA was prepared after very short 
pulses, a significant percentage of the radioactivity appeared in very short 
pieces of DNA (represented by the peak near the top of the tube on the 
left). After periods of 60-120 seconds, the relative height of this peak falls 
as labeled DNA fragments become joined to the ends of high-molecular- 
weight molecules. 

Source: From R. Okazaki et al., Cold Spring Harbor Symp. Quant. Biol. 
33:130, 1968. Reprinted with permission from Cold Spring Harbor 
Laboratory Press. 


longer pieces that had been synthesized previously. The enzyme 
that joins the Okazaki fragments into a continuous strand is called 
DNA ligase. 

The discovery that the lagging strand is synthesized in pieces 
raised a new set of perplexing questions about the initiation of DNA 
synthesis. How does the synthesis of each of these fragments begin 
when none of the DNA polymerases are capable of strand initia- 
tion? Further studies revealed that initiation is not accomplished by 
a DNA polymerase but, rather, by a distinct type of RNA polymer- 
ase, called primase, that constructs a short primer composed of 
RNA, not DNA. The leading strand, whose synthesis begins at the 
origin of replication, is also initiated by a primase molecule. The 
short RNAs synthesized by the primase at the 5’ end of the leading 
strand and at the 5’ end of each Okazaki fragment serve as the 
required primer for the synthesis of DNA by a DNA polymerase. 
The RNA primers are subsequently removed, and the resulting gaps 
in the strand are filled with DNA and then sealed by DNA ligase. 
These events are illustrated schematically in FIGURE 13.11. The for- 
mation of transient RNA primers during the process of DNA repli- 
cation is a curious activity. It is thought that the likelihood of 
mistakes is greater during initiation than during elongation, and the 
use of a short removable segment of RNA avoids the inclusion of 
mismatched bases. 
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FIGURE 13.11 The use of short RNA fragments as removable primers 
in initiating synthesis of each Okazaki fragment of the lagging strand. 


The major steps are indicated in the drawing and discussed in the text. The 


roles of various accessory proteins in these activities are indicated in the 


following figures. 


REVIEW | 


1. How is it possible to obtain mutants whose defects 
lie in genes that are required for an essential activity 
such as DNA replication? 


2. Describe the events that occur at an origin of 
replication during the initiation of replication in yeast 
cells. What is meant by replication being 
bidirectional? 


3. Why do the DNA molecules depicted in Figure 13.7a 
fail to stimulate the polymerization of nucleotides by 
DNA polymerase |? What are the properties of a 
DNA molecule that allow it to serve as a template for 
nucleotide incorporation by DNA polymerase |? 


4. Describe the mechanism of action of DNA 
polymerases operating on the two template strands 
and the effect this has on the synthesis of the lagging 
versus the leading strand. 


o—___—__- 


13.3 The Machinery Operating 
at the Replication Fork 


Replication involves more than incorporating nucleotides. Unwinding 
the duplex and separating the strands require the aid of two types of 
proteins that bind to the DNA, a helicase (or DNA unwinding 
enzyme) and single-stranded DNA-binding (SSB) proteins. 
DNA helicases unwind a DNA duplex in a reaction that uses energy 
released by ATP hydrolysis to move along one of the DNA strands, 
breaking the hydrogen bonds that hold the two strands together and 
exposing the single-stranded DNA templates. E. coli has at least 12 
different helicases for use in various aspects of DNA (and RNA) 
metabolism. One of these helicases—the product of the dnaB gene— 
serves as the major unwinding machine during replication. The 
DnaB helicase consists of six subunits arranged to form a ring- 
shaped protein that encircles a single DNA strand (FIGURE 13.12a). 
Initiation of replication begins in E. coli when multiple copies of the 
DnaA protein bind to the origin of replication (oriC) and separate 
(melt) the DNA strands at that site. The DnaB helicase is then loaded 
onto the single-stranded DNA of the lagging strand of oriC, with the 
help of the protein DnaC. The DnaB helicase then translocates in a 
5’ > 3’ direction along the lagging-strand template, unwinding the 
helix as it proceeds (Figure 13.12). DNA unwinding by the helicase 
is aided by the attachment of SSB proteins to the separated DNA 
strands (Figure 13.12). These proteins bind selectively to single- 
stranded DNA, keeping it in an extended state and preventing it 
from becoming rewound or damaged. A visual portrait of the com- 
bined action of a DNA helicase and SSB proteins on the structure of 
the DNA double helix is illustrated in the electron micrographs of 
Figure 13.120. 

Recall that an enzyme called primase initiates the synthesis of 
each Okazaki fragment. In bacteria, the primase and the helicase 
associate transiently to form what is called a “primosome.” Of the 
two members of the primosome, the helicase moves along the 
lagging-strand template processively (ie., without being released 
from the template strand during the lifetime of the replication fork). 


As the helicase “motors” along the lagging-strand template, opening 521 


the strands of the duplex, the primase periodically binds to the heli- 
case and synthesizes the short RNA primers that begin the forma- 
tion of each Okazaki fragment. As noted above, the RNA primers are 
subsequently extended as DNA by a DNA polymerase, specifically 
DNA polymerase III. 

A body of evidence suggests that the same DNA polymerase HI 
molecule synthesizes successive fragments of the lagging strand. To 
accomplish this, the polymerase HI molecule is recycled from the 
site where it has just completed one Okazaki fragment to the next 
site along the lagging-strand template closer to the site of DNA 
unwinding. Once at the new site, the polymerase attaches to the 3’ 
OH of the RNA primer that has just been laid down by a primase and 
begins to incorporate deoxyribonucleotides onto the end of the 
short RNA. 

How does a polymerase II molecule move from one site on the 
lagging-strand template to another site that is closer to the replica- 
tion fork? The enzyme does this by “hitching a ride” with the DNA 
polymerase that is moving in that direction along the leading-strand 
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FIGURE 13.12 The roles of the DNA helicase, single-stranded DNA- 
binding proteins, and primase at the replication fork. (a) The helicase 
moves along the DNA, catalyzing the ATP-driven unwinding of the duplex. 
As the DNA is unwound, the strands are prevented from reforming the 
duplex by tetrameric single-stranded DNA-binding proteins (SSBs). The 
primase associated with the helicase synthesizes the RNA primers that 
begin each Okazaki fragment. The RNA primers, which are about 10 
nucleotides long, are subsequently removed. (b) A series of five electron 
micrographs showing DNA molecules incubated with a viral DNA helicase 
(T antigen, page 528) and E. coli SSB proteins. The DNA molecules are 
progressively unwound from left to right. The helicase appears as the round 
particle at the fork, and the SSB proteins are bound to the single-stranded 
ends, giving them a thickened appearance. 

Source: (b) From Rainer Wessel, Johannes Schweizer, and Hans Stahl, 

J. Virol. 66:807, 1992. © American Society for Microbiology. 
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FIGURE 13.13 Replication of the leading and lagging strands in E. coli is accomplished by two DNA polymerases working together as part of a 
single complex. (a) The two DNA polymerase Ill molecules travel together, even though they are moving toward the opposite ends of their respective 
templates. This is accomplished by causing the lagging-strand template to form a loop. (b) The polymerase releases the lagging-strand template when it 
encounters the previously synthesized Okazaki fragment. (c) The polymerase that was involved in the assembly of the previous Okazaki fragment has now 
rebound the lagging-strand template farther along its length and is synthesizing DNA onto the end of RNA primer #3 that has just been constructed by 


the primase. 


Source: (a—c) From D. Voet and J. G. Voet, Biochemistry, 2d ed.; Copyright © 1995, John Wiley and Sons, Inc. Reprinted by permission of John Wiley and 


Sons, Inc. 


template. Thus even though the two polymerases are moving in 
opposite directions with respect to the linear axis of the DNA mole- 
cule, they are, in fact, part ofa single protein complex (FIGURE 13.13). 
The two tethered polymerases can replicate both strands by looping 
the DNA of the lagging-strand template back on itself, causing this 
template to have the same orientation as the leading-strand tem- 
plate. Both polymerases then can move together as part of a single 
replicative complex without violating the “5’ > 3’ rule” for synthesis 
of a DNA strand (Figure 13.13). Once the polymerase assembling 
the lagging strand reaches the 5’ end of the Okazaki fragment syn- 
thesized during the previous round, the lagging-strand template is 
released and the polymerase begins work at the 3’ end of the next 
RNA primer toward the fork. The model depicted in Figure 13.13 is 
often referred to as the “trombone model” because the looping DNA 
repeatedly grows and collapses during the replication of the lagging 
strand, reminiscent of the movement of the brass “loop” of a trom- 
bone as it is played. Recent studies have shown that the replication 
complex, or replisome, contains three copies of DNA polymerase HI 


rather than the traditional model of two copies that is presented 
here. According to this revised model, two of the three enzymes act on 
the lagging strand and one acts on the leading strand. Single-molecule 
experiments have found that the presence of the extra polymerase 
makes lagging strand synthesis more efficient and also helps the entire 
replisome to keep moving along the template processively without 
falling off. The manner in which the activities of the two lagging- 
strand polymerases are coordinated is unclear, but the mechanism of 
lagging-strand synthesis remains as discussed in this section. 


REVIEW | 


1. Contrast the role of DNA polymerases | and Ill in 
bacterial replication. 
2. What is the consequence of having the DNA of the 


lagging-strand template looped back on itself as in 
Figure 13.13a? 


o—____" 


13.4 The Structure and 
Functions of DNA Polymerases 


DNA polymerase III, the enzyme that synthesizes DNA strands dur- 
ing replication in E. coli, is part of a large “replication machine” 
called the DNA polymerase III holoenzyme (FIGURE 13.14). One of 
the noncatalytic components of the holoenzyme, called the § clamp, 
keeps the polymerase associated with the DNA template. DNA poly- 
merases (like RNA polymerases) possess two somewhat contrasting 
properties: (1) They must remain associated with the template over 
long stretches if they are to synthesize a continuous complementary 
strand, and (2) they must be attached loosely enough to the template 
to move from one nucleotide to the next. These contrasting proper- 
ties are provided by the doughnut-shaped B clamp that encircles the 
DNA (FIGURE 13.15a) and slides along it. As long as it is attached to 
a “sliding clamp,” a DNA polymerase can move processively from 
one nucleotide to the next without diffusing away from the template. 
The polymerase on the leading-strand template remains tethered to 
a single B clamp during replication. In contrast, when the polymer- 
ase on the lagging-strand template completes the synthesis of an 
Okazaki fragment, it disengages from the B clamp and is cycled to a 
new B clamp that has been assembled at an RNA primer-DNA tem- 
plate junction located closer to the replication fork (Figure 13.15). 
But how does a highly elongated DNA molecule get inside of a 
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FIGURE 13.14 Schematic representation of DNA polymerase III 
holoenzyme. The holoenzyme contains 10 different subunits organized into 
several distinct components. Included as part of the holoenzyme are (1) two 
core polymerases which replicate the DNA, (2) two or more B clamps, which 
allow the polymerase to remain associated with the DNA, and (3) a clamp 
loading (y) complex, which loads each sliding clamp onto the DNA. The 
clamp loader of an active replication fork contains two t subunits, which 
hold the core polymerases in the complex and also bind the helicase. 
Another term, the replisome, is often used to refer to the entire complex of 
proteins that is active at the replication fork, including the DNA polymerase 
Ill holoenzyme, the helicase, SSBs, and primase. (If the bacterial replisome 
contains three DNA polymerase III molecules, as discussed on page 522, 
then it would also contain three t subunits to hold all the proteins in a 
complex, as shown in Science 335:329, 2012). 


Source: Based on drawings by M. O'Donnell. 


ring-shaped clamp as in Figure 13.15a? The assembly of the B clamp 523 


around the DNA requires a multisubunit clamp loader that is also 
part of the DNA polymerase III holoenzyme (Figures 13.14, 13.15c). 
In the ATP-bound state, the clamp loader binds to a primer-tem- 
plate junction while holding the 8 clamp in an open conformation as 
illustrated in Figure 13.15c. Once the DNA has squeezed through 
the opening in the clamp wall, the ATP bound to the clamp loader is 
hydrolyzed, causing the release of the clamp, which closes around 
the DNA. The B clamp is then ready to bind polymerase III as 
depicted in Figure 13.150. 

DNA polymerase I, which consists of only a single subunit, is 
involved primarily in DNA repair, a process by which damaged sec- 
tions of DNA are corrected (Section 13.8). DNA polymerase I also 
removes the RNA primers at the 5’ end of each Okazaki fragment 
during replication and replaces them with DNA, as described in 
Figure 13.13b. The enzyme’s ability to accomplish this feat is dis- 
cussed in the following section. 


Exonuclease Activities of DNA 
Polymerases 


Now that we have explained several of the puzzling properties of 
DNA polymerase I, such as the enzyme’s inability to initiate strand 
synthesis, we can consider another curious observation. Kornberg 
found that DNA polymerase I preparations always contained exonu- 
clease activities; that is, they were able to degrade DNA polymers by 
removing one or more nucleotides from the end of the molecule. At 
first, Kornberg assumed this activity was due to a contaminating 
enzyme because the action of exonucleases is so dramatically opposed 
to that of DNA synthesis. Nonetheless, the exonuclease activity could 
not be removed from the polymerase preparation and was, in fact, a 
true activity of the polymerase molecule. It was subsequently shown 
that all of the bacterial DNA polymerases possess exonuclease activ- 
ity. Exonucleases can be divided into 5’—> 3’ and 3’ > 5’ exonucle- 
ases, depending on the direction in which the strand is degraded. 
DNA polymerase I has both 3’—> 5’ and 5’ — 3’ exonuclease activi- 
ties, in addition to its polymerizing activity (FIGURE 13.16). These 
three activities are found in different domains of the single polypep- 
tide. Thus, remarkably, DNA polymerase I is three different enzymes 
in one. The two exonuclease activities have entirely different roles in 
replication. We will consider the 5’ > 3’ exonuclease activity first. 
Most nucleases are specific for either DNA or RNA, but the 
5’ — 3’ exonuclease of DNA polymerase I can degrade either type of 
nucleic acid. Initiation of Okazaki fragments by the primase leaves a 
stretch of RNA at the 5’ end of each fragment (see RNA primer #1 of 
Figure 13.130), which is removed by the 5’ — 3’ exonuclease activity of 
DNA polymerase I (Figure 13.16a). As the enzyme removes ribonu- 
cleotides of the primer, its polymerase activity simultaneously fills the 
resulting gap with deoxyribonucleotides. The last deoxyribonucleo- 
tide incorporated is subsequently joined covalently to the 5’ end of the 
previously synthesized DNA fragment by DNA ligase. The role of the 
3’ > 5’ exonuclease activity will be apparent in the following section. 


Ensuring High Fidelity 
during DNA Replication 
The survival of an organism depends on the accurate duplication of 


the genome. A mistake made in the synthesis of a messenger RNA 
molecule by an RNA polymerase results in the synthesis of defective 
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(a) 


FIGURE 13.15 The 6 sliding clamp and clamp loader. (a) Space-filling model showing the two subunits 
that make up the doughnut-shaped sliding clamp in E. coli. Double-stranded DNA is shown in blue 
within the B clamp. (b) Schematic diagram of polymerase cycling on the lagging strand. The polymerase is 
held to the DNA by the § sliding clamp as it moves along the template strand and synthesizes the 
complementary strand. Following completion of the Okazaki fragment, the enzyme disengages from its 

B clamp and cycles to a recently assembled clamp “waiting” at an upstream RNA primer-DNA template 
junction. The original B clamp is left behind for a period on the finished Okazaki fragment, but it is 
eventually disassembled and reutilized. (c) A model of a complex between a sliding clamp and a clamp 
loader from an archaean prokaryote based on electron microscopic image analysis. The clamp loader 
(shown with red and green subunits) is bound to the sliding clamp (blue), which is held in an open, spiral 
conformation resembling a lock-washer. The DNA has squeezed through the gap in the clamp. The primer 
strand of the DNA terminates within the clamp loader, whereas the template strand extends through an 
opening at the top of the protein. The clamp loader has been described as a “screw-cap” that fits onto 
the DNA in such a way that the subdomains of the protein form a spiral that can thread onto the helical 
DNA backbone. The clamp-loading reaction is shown in detail in Science 334:1675, 2011. 
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Source: (a) From John Kuriyan, Cell 69:427, 1992; with permission from Elsevier; (b) From P. T. Stukenberg, 
J. Turner, and M. O'Donnell, Cell 78:878, 1994; Cell by Cell Press. Reproduced with permission of Cell Press in the format reuse in a book/textbook via 
Copyright Clearance Center; (c) From T. Miyata, et al., Proc. Nat'l. Acad. Sci. U.S.A. 102:13799, 2005, Fig. 4c. © 2005 National Academy of Sciences, U.S.A. 


Image provided Courtesy of K. Morikawa, Osaka, Japan. 


proteins, but an mRNA molecule is only one short-lived template 
among a large population of such molecules; therefore, little lasting 
damage results from the mistake. In contrast, a mistake made during 
DNA replication results in a permanent mutation and the possible 
elimination of that cell’s progeny. In E. coli, the chance that an incor- 
rect nucleotide will be incorporated into DNA during replication 
and remain there is less than 10~”, or fewer than 1 out of 1 billion 
nucleotides. Because the genome of E. coli contains approximately 
4x10° nucleotide pairs, this error rate corresponds to fewer than 1 
nucleotide alteration for every 100 replication cycles. This represents 
the spontaneous mutation rate in this bacterium. Humans are 
thought to have a similar spontaneous mutation rate for replication 
of protein-coding sequences. 

Incorporation of a particular nucleotide onto the end of a grow- 
ing strand depends on the incoming nucleoside triphosphate being 
able to form an acceptable base pair with the nucleotide of the tem- 
plate strand (see Figure 13.8b). Analysis of the distances between 
atoms and bond angles indicates that A-T and G-C base pairs have 
nearly identical geometry (i.e., size and shape). Any deviation from 
those pairings results in a different geometry, as shown in FIGURE 13.17. 
At each site along the template, DNA polymerase must discriminate 
among four different potential precursors as they move in and out of 


the active site. Among the four possible incoming nucleoside 
triphosphates, only one forms a proper geometric fit with the tem- 
plate, producing either an A-T or a G-C base pair that can fit into a 
binding pocket within the enzyme. This is only the first step in the 
discrimination process. If the incoming nucleotide is “perceived” by 
the enzyme as being correct, a conformational change occurs in 
which the “fingers” of the polymerase rotate toward the “palm” 
(FIGURE 13.18a), gripping the incoming nucleotide. This is an exam- 
ple of an induced fit as discussed on page 94. If the newly formed 
base pair exhibits improper geometry, the active site cannot achieve 
the conformation required for catalysis, and the incorrect nucleotide 
is not incorporated. In contrast, if the base pair exhibits proper 
geometry, the incoming nucleotide is covalently linked to the end of 
the growing strand. 

On occasion, the polymerase incorporates an incorrect nucleo- 
tide, resulting in a mismatched base pair, that is, a base pair other 
than A-T or G-C. It is estimated that an incorrect pairing of this sort 
occurs once for every 10°-10° nucleotides incorporated, a frequency 
that is 10°-10* times greater than the spontaneous mutation rate of 
approximately 10~°. How is the mutation rate kept so low? Part of the 
answer lies in the second of the two exonuclease activities mentioned 
above, the 3’ —> 5’ activity (Figure 13.16b). When an incorrect 


5' —> 3' Exonuclease 
hydrolysis site 


Exonuclease 
(b) hydrolysis site 


FIGURE 13.16 The exonuclease activities of DNA polymerase I. 

(a) The 5’ > 3’ exonuclease function removes nucleotides from the 5’ end 
of a single-strand nick. This activity also plays a key role in removing the 
RNA primers. (b) The 3’ > 5’ exonuclease function removes mispaired 
nucleotides from the 3’ end of the growing DNA strand. This activity plays 
a key role in maintaining the accuracy of DNA synthesis. 

Source: (a-b) From D. Voet and J. G. Voet, Biochemistry, 2d ed.; Copyright 
© 1995, John Wiley and Sons, Inc. Reprinted by permission of 

John Wiley and Sons, Inc. 


FIGURE 13.17 Geometry of proper (top row) and mismatched 
(bottom row) base pairs. 

Source: From Annual Review of Biochemistry. Vol. 60 by Richardson, Charles 
reproduced with permission of Annual Reviews, Incorporated in the format 
Republish in a book via Copyright Clearance Center. 


nucleotide is incorporated by DNA polymerase I, the enzyme stalls 525 


and the end of the newly synthesized strand has an increased ten- 
dency to separate from the template and form a single-stranded 
3’ terminus. When this occurs, the frayed end of the newly synthesized. 
strand is directed into the 3’ — 5’ exonuclease site (Figure 13.18), 
which removes the mismatched nucleotide. This job of “proofreading” 
is one of the most remarkable of all enzymatic activities and illus- 
trates the sophistication to which biological molecular machinery 
has evolved. The 3’ 5’ exonuclease activity removes approximately 
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FIGURE 13.18 Activation of the 3’ > 5’ exonuclease of DNA polymer- 
ase I. (a) A schematic model of a portion of DNA polymerase | known as 
the Klenow fragment, which contains the polymerase and 3’ > 5’ exonucle- 
ase active sites. The 5’ > 3’ exonuclease activity is located in a different 
portion of the polypeptide, which is not shown here. The regions of the 
Klenow fragment are often likened to the shape of a partially opened right 
hand, hence the portions labeled as “fingers,” “palm,” and “thumb.” The 
catalytic site for polymerization is located in the central “palm” subdomain. 
The 3’ terminus of the growing strand can be shuttled between the 
polymerase and exonuclease active sites. Addition of a mismatched base 
to the end of the growing strand produces a frayed (single-stranded) 3’ end 
that enters the exonuclease site, where it is removed. (The polymerase and 
exonuclease sites of polymerase III operate similarly but are located on 
different subunits.) (6) A molecular model of the Klenow fragment com- 
plexed to DNA. The template DNA strand being copied is shown in blue, 
and the primer strand to which the next nucleotides would be added is 
shown in red. 

Source: (a) After T. A. Baker and S. P. Bell, Cell 92:296, 1998, after a drawing 
by C. M. Joyce and T. A. Steitz. Cell by Cell Press. Reproduced with 
permission of Cell Press in the format reuse in a book/textbook via Copyright 
Clearance Center; (b) Courtesy of Thomas A. Steitz, Yale University. 
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10-’-10-8. In addition, bacteria possess a mechanism called mis- 
match repair that operates after replication (page 534) and corrects 
nearly all of the mismatches that escape the proofreading step. 
Together these processes reduce the overall observed error rate to 
about 10~. Thus the fidelity of DNA replication can be traced to 
three distinct activities: (1) accurate selection of nucleotides, (2) 
immediate proofreading, and (3) postreplicative mismatch repair. 

Another remarkable feature of bacterial replication is its rate. 
The replication of an entire bacterial chromosome in approximately 
40 minutes at 37°C requires that each replication fork move about 
1000 nucleotides per second, which is equivalent to the length of an 
entire Okazaki fragment. Thus the entire process of Okazaki frag- 
ment synthesis, including formation of an RNA primer, DNA elon- 
gation and simultaneous proofreading by the DNA polymerase, 
excision of the RNA, its replacement with DNA, and strand ligation, 
occurs within a few seconds. Although it takes E. coli approximately 
40 minutes to replicate its DNA, a new round of replication can 
begin before the previous round has been completed. Consequently, 
when these bacteria are growing at their maximal rate, they double 
their numbers in about 20 minutes. 


REVIEW | 


1. Describe the role of the DNA helicase, the SSBs, the 
B clamp, the DNA gyrase, and the DNA ligase during 
replication in bacteria. 

2. How do the two exonuclease activities of DNA 
polymerase | differ from one another? What are their 
respective roles in replication? 


3. Describe the factors that contribute to the high 
fidelity of DNA replication. 


ee _ 


13.5 Replication in Viruses 


Viruses often produce their own replication machinery. The replica- 
tion system of the T4 bacteriophage played a particularly important 
role as an experimental system for dissecting the mechanisms of DNA 
replication. Some viruses such as the HIV virus have RNA-based 
genomes, but as part of their replication cycle they produce a DNA 
copy of their genome using the enzyme reverse transcriptase (RT). 
As discussed in the Human Perspectives in Chapter 3 (Section 3.8) 
reverse transcriptase has a much higher error rate than bacterial 
or human DNA polymerases. But because the enzyme is less selec- 
tive in what nucleotides it incorporates, it is able to incorporate 
nucleotide analogs into the newly synthesized DNA chain. One such 
analog is azidothymidine, or AZT. AZT resembles thymidine, but 
the 3°OH group is replaced with an azide group. If AZT is incorpo- 
rated into DNA, it creates a block to further polymerization because 
there is no 3°OH with which to extend the chain. Because reverse 
transcriptase can incorporate AZT into DNA, while eukaryotic poly- 
merases (see Section 13.6) cannot, the drug selectively inhibits 
DNA synthesis by the HIV reverse transcriptase. The discovery that 
AZT blocks HIV reverse transcription was a major breakthrough, 
and it came at a time when the rapid growth of HIV/AIDS, com- 
bined with a complete lack of drugs to treat it, was creating a sense of 
terror in the medical community. AZT was approved by the FDA in 


1987, only six years after AIDS was recognized in the United States 
and after just one clinical trial showed it to be highly effective in 
preventing death from HIV infection. In fact, it was so effective that 
the trial was ended early so that the drug could be more rapidly 
approved. 


REVIEW | 


1. How does the distinct biochemistry of reverse 
transcriptase make viruses like HIV more vulnerable 
to nucleotide analogs? 


ee 


13.6 DNA Replication 
in Eukaryotic Cells 


As noted in Chapter 10, the nucleotide letters of the human genome 
sequence would fill a book roughly one million pages in length. 
While it took several years for hundreds of researchers to initially 
sequence the human genome, a single cell nucleus of approximately 
10m diameter can copy all of this DNA within a few hours. Given 
the fact that eukaryotic cells have large genomes and complex chro- 
mosomal structure, our understanding of replication in eukaryotes 
has lagged behind that in bacteria. This imbalance has been addressed 
by the development of eukaryotic experimental systems that parallel 
those used for decades to study bacterial replication. These include 


e The isolation of mutant yeast and animal cells unable to produce 
specific gene products required for various aspects of replication. 


e Analysis of the structure and mechanism of action of replication 
proteins from archaeal species (as in Figure 13.15c). Replication 
in these prokaryotes begins at multiple origins and requires 
proteins that are homologous to those of eukaryotic cells but are 
less complex and easier to study. 


e The development of in vitro systems where replication can occur 
in cellular extracts or mixtures of purified proteins. The most 
valuable of these systems has utilized Xenopus, an aquatic frog 
that begins life as a huge egg stocked with all of the proteins 
required to carry it through a dozen or so very rapid rounds of 
cell division. Extracts can be prepared from these frog eggs that 
will replicate any added DNA, regardless of sequence. Frog egg 
extracts will also support the replication and mitotic division of 
mammalian nuclei, which has made this a particularly useful 
cell-free system. Antibodies can be used to deplete the extracts of 
particular proteins, and the replication ability of the extract can 
then be tested in the absence of the affected protein. 


Initiation of Replication 
in Eukaryotic Cells 


Replication in E. coli begins at only one site along the single, circular 
chromosome (Figure 13.5). Cells of higher organisms may have a 
thousand times as much DNA as this bacterium, yet their polymer- 
ases incorporate nucleotides into DNA at much slower rates. To 
accommodate these differences, eukaryotic cells replicate their 
genome in small portions, termed replicons. Each replicon has its 
own origin from which replication forks proceed outward in both 


directions (see Figure 13.23a). In a human cell, replication begins at 
about 10,000 to 100,000 different replication origins. The existence 
of replicons was first demonstrated in autoradiographic experiments 
in which single DNA molecules were shown to be replicated simul- 
taneously at several sites along their length (FIGURE 13.19). 

Approximately 10 to 15 percent of replicons are actively engaged 
in replication at any given time during the S phase of the cell cycle 
(see Figure 14.1). Replicons located close together in a given chro- 
mosome undergo replication simultaneously (as evident in Figure 
13.19). Moreover, those replicons active at a particular time during 
one round of DNA synthesis tend to be active at a comparable time 
in succeeding rounds. In mammalian cells, the timing of replication 
of a chromosomal region is roughly correlated with the activity of 
the genes in the region and/or its state of compaction. The presence 
of acetylated histones, which is closely correlated with gene tran- 
scription (page 497), is a likely factor in determining the early repli- 
cation of active gene loci. The most highly compacted, least acetylated 
regions of the chromosome are packaged into heterochromatin 
(section 12.4), and they are the last regions to be replicated. This dif- 
ference in timing of replication is not related to DNA sequence 
because the inactive, heterochromatic X chromosome in the cells of 
female mammals (page 469) is replicated late in S phase, whereas the 
active, euchromatic X chromosome is replicated at an earlier stage. 
Similarly, the B-globin locus replicates early during S phase in eryth- 
roid cells where the gene is expressed, but much later in noneryth- 
roid cells where the gene remains silent. 

The mechanism by which replication is initiated in eukaryotes 
has been a focus of research over the past decade. The greatest pro- 
gress in this area has been made with budding yeast because the ori- 
gins of replication can be removed from the yeast chromosome and 
inserted into bacterial DNA molecules, conferring on them the abil- 
ity to replicate either within a yeast cell or in cellular extracts con- 
taining the required eukaryotic replication proteins. Because these 
sequences promote replication of the DNA in which they are con- 
tained, they are referred to as autonomous replicating sequences 
(ARSs). Those ARSs that have been isolated and analyzed share sev- 
eral distinct elements. The core element of an ARS consists of a con- 
served sequence of 11 base pairs, which functions as a specific 
binding site for an essential multiprotein complex called the origin 


FIGURE 13.19 Experimental demonstration that replication in 
eukaryotic chromosomes begins at many sites along the DNA. 

Cells were incubated in [2H] thymidine for a brief period before preparation 
of DNA fibers for autoradiography. The lines of black silver grains indicate 
sites that had incorporated the radioactive DNA precursor during the 
labeling period. It is evident that synthesis is occurring at separated sites 
along the same DNA molecule. As indicated in the accompanying line 
drawing, initiation begins in the center of each site of thymidine 
incorporation, forming two replication forks that travel away from each 
other until they meet a neighboring fork. 


Source: Courtesy of Joel Huberman. 


recognition complex (ORC) (see FIGURE 13.20). Ifthe ARS is mutated 
so that it is unable to bind the ORC, initiation of replication cannot 
occur. 

Replication origins have proven more difficult to study in verte- 
brate cells than in yeast. Part of the problem stems from the fact that 
virtually any type of purified, naked DNA is suitable for replication 
using extracts from frog eggs. These studies suggested that, unlike 
yeast, vertebrate DNA does not possess specific sequences (e.g., 
ARSs) at which replication is initiated. However, studies of replication 
of intact mammalian chromosomes in vivo suggest that replication 
does begin within defined regions of the DNA, rather than by ran- 
dom selection as occurs in the amphibian egg extract. It is thought 
that a DNA molecule contains many sites where DNA replication can 
be initiated, but only a subset of these potential sites are actually used 
at a given time in a given cell. Cells that reproduce via shorter cell 
cycles, such as those of early amphibian embryos, utilize a greater 
number of sites as origins of replication than cells with longer cell 
cycles. The actual selection of sites for initiation of replication is 
thought to be governed by local epigenetic factors (page 480), such as 
the positions of nucleosomes, the types of histone modifications, the 
state of DNA methylation, the degree of supercoiling, and the level of 
transcription. 


Restricting Replication to Once 
Per Cell Cycle 


It is essential that each portion of the genome is replicated once, and 
only once, during each cell cycle. Consequently, some mechanism 
must exist to prevent the reinitiation of replication at a site that has 
already been duplicated. The initiation of replication at a particular 
origin requires passage of the origin through several distinct states. 
Some of the steps that occur at an origin of replication in a yeast cell 
are illustrated in Figure 13.20. Similar steps requiring homologous 
proteins take place in plants and animals, indicating that the basic 
mechanism of initiation of replication is conserved among eukaryotes. 


1. In step 1 (Figure 13.20), the origin of replication is bound by an 
ORC protein complex, which in yeast cells remains associated 
with the origin throughout the cell cycle. The ORC has been 
described as a “molecular landing pad” because of its role in 
binding the proteins required in subsequent steps. 


2. The next major step (step 2) is the assembly of a protein-DNA 
complex, called the prereplication complex (pre-RC), that is 
“licensed” (competent) to initiate replication. Studies of the for- 
mation of the pre-RC have focused on a set of six related MCM 
proteins (Mcm2-Mcm7). The MCM proteins are loaded onto the 
replication origin at a late stage of mitosis, or soon thereafter, 
with the aid of accessory factors that have previously bound to 
the ORC. The six Mcm2-Mcm7 proteins interact with one 
another to form a hexameric (six-membered) ring-shaped com- 
plex (the MCM complex) that possesses helicase activity (as in 
step 4, Figure 13.20). Evidence strongly suggests that the MCM 
complex is the eukaryotic replicative helicase; that is, the helicase 
responsible for unwinding DNA at the replication fork (analo- 
gous to DnaB in E. coli). At the pre-RC stage shown in step 2, 
each of the origins contains a double hexameric MCM complex, 
that is, two complete replicative helicases, which remain inactive 
at this stage of the cell cycle. Each of these replicases will travel in 
opposite directions away from the origin once replication begins. 
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3. The assembly of a pre-RC marks that site on the genome as a 
potential origin of replication, but does not guarantee that it will 
actually be a site where replication will be initiated. During most 
cell cycles, many more pre-RCs are assembled than will be used 
and it is not clear what determines which of these potential sites 
of replication are subsequently selected. Regardless of the selec- 
tion mechanism, just before the beginning of S phase of the cell 
cycle, the activation of key protein kinases leads to the phospho- 
rylation of the MCM complex and other proteins and to the ini- 
tiation of replication at selected sites in the genome (step 3, 
Figure 13.20). One of these protein kinases is a cyclin-dependent 
kinase (Cdk) whose function is discussed at length in Chapter 14. 
Cdk activity remains high from S phase through mitosis, which 
suppresses the formation of new prereplication complexes. 
Consequently, each origin can only be activated once per cell 


FIGURE 13.20 Steps leading to the replication of a yeast replicon. 
Yeast origins of replication contain a conserved sequence (ARS) that binds 
the multisubunit origin recognition complex (ORC) (step 1). The presence of 
the bound ORC is required for initiation of replication. The ORC is bound 
to the origin throughout the yeast cell cycle. In step 2, a complex of six 
proteins (Mcm2-Mcm/7) binds to the origin during or following mitosis, 
establishing a prereplication complex (pre-RC) that is competent to initiate 
replication, given the proper stimulus. Loading of MCM proteins at the 
origin requires additional proteins (Cdcé and Cdt1, not shown). In step 3, 
DNA replication is initiated following activation of a cyclin-dependent 
kinase (Cdk) and a second protein kinase (DDK). Step 4 shows a stage 
where replication has proceeded a short distance in both directions from 
the origin. Each MCM complex forms a replicative DNA helicase that 
unwinds DNA at one of the oppositely directed replication forks. The other 
proteins required for replication are not shown in this illustration but are 
indicated in the next figure. In step 5, the two strands of the original duplex 
have been replicated, an ORC is present at both origins, and the replication 
proteins, including the MCM helicases, have been displaced from the DNA. 
In yeast, the MCM proteins are exported from the nucleus, and reinitiation 
of replication cannot occur until the cell has passed through mitosis. [In 
vertebrate cells, several events appear to prevent reinitiation of replication, 
including (1) release of the ORC complex after its use in S phase, (2) 
continued Cdk activity from S phase into mitosis, (3) phosphorylation of 
Cdcé and its subsequent export from the nucleus, and (4) degradation of 
Cdt1 or its inactivation by a bound inhibitor.] 


cycle. Cessation of Cdk activity at the end of mitosis permits the 
assembly of pre-RCs for the next cell cycle. 


4. Once replication is initiated at the beginning of S phase, the 
MCM helicase moves with the replication fork (step 4), although 
the mechanism of action of this ring-shaped protein is debated. 
The fate of the MCM proteins after replication depends on the 
species studied. In mammalian cells, the MCM proteins are dis- 
placed from the DNA but apparently remain in the nucleus. 
Regardless, MCM proteins cannot reassociate with an origin of 
replication that has already “fired” 


The Eukaryotic Replication Fork 


Overall, the activities that occur at replication forks are quite similar, 
regardless of the type of genome being replicated—whether viral, 
bacterial, archaeal, or eukaryotic. The various proteins in the repli- 
cation “tool kit” of eukaryotic cells are listed in Table 13.1 and 
depicted in FIGURE 13.21. All replication systems require helicases, 
single-stranded DNA-binding proteins, topoisomerases, primase, 
DNA polymerase, sliding clamp and clamp loader, and DNA ligase. 
When studying the initiation of eukaryotic replication in vitro, 
researchers often combine mammalian replication proteins with a 
viral helicase called the large T antigen, which is encoded by the 
SV40 genome. The large T antigen induces strand separation at the 
SV40 origin of replication and unwinds the DNA as the replication 
fork progresses (as in Figure 13.21a). 

As in bacteria, the DNA of eukaryotic cells is synthesized in a 
semidiscontinuous manner, although the Okazaki fragments of the 
lagging strand are considerably smaller than in bacteria, averaging 
about 150 nucleotides in length. As in E. coli, the leading and lagging 
strands are thought to be synthesized in a coordinate manner by a 
single replicative complex, or replisome (Figure 13.210). 


TABLE 13.1 Some of the Proteins Required for Replication 529 


E. coliprotein Eukaryotic protein Function 


DnaA 
Gyrase 
DnaB Mcm 
DnaC Cdc6, Cdt1 
SSB RPA 
y-complex RFC 

pol III core pol d/e 

6 clamp PCNA 


ORC proteins 
Topoisomerase I/II 


Recognition of origin of replication 

Relieves positive supercoils ahead of replication fork 

DNA helicase that unwinds parental duplex 

Loads helicase onto DNA 

Maintains DNA in single-stranded state 

Subunits of the DNA polymerase holoenzyme that load the clamp onto the DNA 

Primary replicating enzymes; synthesize entire leading strand and Okazaki fragments; have proofreading capability 
Ring-shaped subunit of DNA polymerase holoenzyme that clamps replicating polymerase to DNA; works with pol III in 


E. coli and pol Sore in eukaryotes 


Primase Primase Synthesizes RNA primers 
pola 
DNA ligase 


FEN-1 


DNA ligase 
poll 


To date, five “classic” DNA polymerases have been isolated from 
eukaryotic cells, and they are designated a, B, y, 6, and ¢. Of these 
enzymes, polymerase Y replicates mitochondrial DNA, and poly- 
merase B functions in DNA repair. The other three polymerases have 
replicative functions. Polymerase a is tightly associated with the pri- 
mase, and together they initiate the synthesis of each Okazaki frag- 
ment. The polymerase a-primase complex recognizes and binds to 
unwound DNA that is coated by a single-stranded DNA-binding 
protein called RPA. Primase initiates synthesis by assembly ofa short 
RNA primer, which is then extended by the addition of about 20 
deoxyribonucleotides by polymerase a. Polymerase § is thought to 
be the primary DNA-synthesizing enzyme during replication of the 
lagging strand, whereas polymerase is thought to be the primary 
DNA-synthesizing enzyme during replication of the leading strand. 
Like the major replicating enzyme of E. coli, both polymerase § and 
5’ require a “sliding clamp” that tethers the enzyme to the DNA, 
allowing it to move processively along a template. The sliding clamp 


Pol « 
DNA Ligase PCNA Topoisomerase 
“gs Primase 
\ RPA 
FEN-1 —4) 
(( Lagging-strand template 
Pol & 


(a) 


Helicase (T antigen) 


Synthesizes short DNA oligonucleotides as part of RNA-DNA primer 
Seals Okazaki fragments into continuous strand 
Removes RNA primers; pol I of E. coli also fills gap with DNA 


of eukaryotic cells is very similar in structure and function to the 
B clamp of E. coli polymerase II] illustrated in Figure 13.15. In eukar- 
yotes, the sliding clamp is called PCNA. The clamp loader that loads 
PCNA onto the DNA is called RFC and is analogous to the E. coli 
polymerase III clamp loader complex. After synthesizing an RNA- 
DNA primer, polymerase is replaced at each template—primer junction 
by the PCNA-polymerase § complex, which completes synthesis of 
the Okazaki fragment. When polymerase § reaches the 5’ end of the 
previously synthesized Okazaki fragment, the polymerase continues 
along the lagging-strand template, displacing the primer (shown as a 
green flap in Figure 13.21a). The displaced primer is cut from the 
newly synthesized DNA strand by an endonuclease (FEN-1) and the 
resulting nick in the DNA is sealed by a DNA ligase. FEN-1 and 
DNA ligase are thought to be recruited to the replication fork 
through an interaction with the PCNA sliding clamp. In fact, PCNA 
is thought to play a major role in orchestrating events that occur 
during DNA replication, repair, and recombination. Because of its 


Helicase (T antigen) 


Topoisomerase 


Primase 
(b) 


FIGURE 13.21 A schematic view of the major components at the eukaryotic replication fork. (a) The proteins required for eukaryotic replication. The 
viral T antigen is drawn as the replicative helicase in this figure because it is prominently employed in in vitro studies of DNA replication. DNA polymerases § 
and € are thought to be the primary DNA synthesizing enzymes of the lagging and leading strands, respectively. PCNA acts as a sliding clamp for both 
polymerases 6 and «. The sliding clamp is loaded onto the DNA by a protein called RFC (replication factor C), which is similar in structure and function to the 
y-clamp loader of E. coli. RPA is a trimeric single-stranded DNA-binding protein comparable in function to that of SSB utilized in E. coli replication. The 
RNA-DNA primers of the lagging strand that are synthesized by the polymerase a-primase complex are displaced by the continued movement of polymerase 
8, generating a flap of RNA-DNA that is removed by the FEN-1 endonuclease. The gap is sealed by a DNA ligase. As in E. coli, a topoisomerase is required 
to remove the positive supercoils that develop ahead of the replication fork. (b) A proposed version of events at the replication fork illustrating how the 
replicative polymerases on the leading- and lagging-strand templates might act together as part of a replisome. To date there is no firm evidence that the 
leading and lagging strands are replicated by a single replicative complex as in E. coli. 
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as a “molecular toolbelt” 

Like bacterial polymerases, all of the eukaryotic polymerases 
elongate DNA strands in the 5’— 3’ direction by the addition of 
nucleotides to a 3’ hydroxyl group, and none of them is able to initiate 
the synthesis of a DNA chain without a primer. Polymerases y, §, and 
€ possess a 3’ 5’ exonuclease, whose proofreading activity ensures 
that replication occurs with very high accuracy. Several other DNA 
polymerases (including n, «, and 1) have a specialized function that 
allows cells to replicate damaged DNA as described on page 535. 


Replication and Nuclear Structure 


Replication forks that are active at a given time are not distributed 
randomly throughout the cell nucleus, but instead are localized within 
50 to 250 sites, called replication foci (FIGURE 13.22). It is estimated 
that each of the purple regions indicated in Figure 13.22 contains 
approximately 10 to 100 replication forks incorporating nucleotides 
into DNA strands simultaneously. The clustering of replication forks 
may provide a mechanism for coordinating the replication of adjacent 
replicons on individual chromosomes (as in Figure 13.19). 


anti-RPA-647.STED 


FIGURE 13.22 Demonstration that replication activities do not occur 
randomly throughout the nucleus but are confined to distinct sites. 
Prior to the onset of DNA synthesis at the start of S phase, various factors 
required for the initiation of replication are assembled at discrete sites 
within the nucleus, forming prereplication centers. These sites appear as 
discrete purple dots in the micrograph, which has been stained with a 
fluorescent antibody against replication factor A (RPA), a single-stranded 
DNA-binding protein required for replication. Other replication factors, 
such as PCNA and the polymerase—primase complex, are also localized to 
these foci. The two boxes outlined are shown at higher magnification in the 
lower panels. Scale bar 2 um. 

Source: From Zoltan Cseresnyes, Ulf Schwartz, and Catherine Green, BMC 
Cell Biology, vol. 10, page 88, 2009. 
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1. What is the major difference between bacteria and 
eukaryotes that allows a eukaryotic cell to replicate its 


DNA in a reasonable amount of time? 
@ 


13.7 Chromatin Structure 
and Replication 


The chromosomes of eukaryotic cells consist of DNA tightly com- 
plexed to regular arrays of histone proteins that are present in the 
form of nucleosomes (page 466). Movement of the replication 
machinery along the DNA is thought to displace nucleosomes that 
reside in its path. Yet, examination of a replicating DNA molecule 
with the electron microscope reveals nucleosomes on both daughter 
duplexes very near the replication fork (Figure 13.23a), indicating 
that the reassembly of nucleosomes is a very rapid event. 

Collectively, the nucleosomes that form during the replication 
process are comprised of a roughly equivalent mixture of histone 
molecules that are inherited from parental chromosomes and his- 
tone molecules that have been newly synthesized. Recall from 
page 466 that the core histone octamer of a nucleosome consists of 
an (H3H4), tetramer together with a pair of H2A/H2B dimers. The 
way in which parental nucleosomes are distributed during replica- 
tion has been an area of recent debate. According to results from 
classic experiments, the (H3H4), tetramers present prior to replica- 
tion remain intact and are distributed randomly between the two 
daughter duplexes. As a result, old and new (H3H4), tetramers are 
thought to be intermixed on each daughter DNA molecule as indi- 
cated in the model shown in Figure 13.23b. According to this model, 
the two H2A/H2B dimers of each parental nucleosome fail to remain 
together as the replication fork moves through the chromatin. 
Instead, the H2A/H2B dimers of a nucleosome separate from one 
another and bind randomly to the new and old (H3H4), tetramers 
already present on the daughter duplexes (Figure 13.23b). Results of 
several recent experiments have raised the possibility of another 
model, one in which the (H3H4), tetramer from parental nucle- 
osomes is split into two H3/H4 dimers, each of which may combine 
with a newly synthesized H3/H4 dimer to form a “mixed” (H3H4), 
tetramer, which then assembles with H2A/H2B dimers. Regardless 
of the pattern by which it occurs, the stepwise assembly of nucle- 
osomes and their orderly spacing along the DNA is facilitated by a 
network of accessory proteins. Included among these proteins are a 
number of histone chaperones that are able to accept either newly 
synthesized or parental histones and transfer them to the daughter 
strands. The best studied of these histone chaperones, CAF-1, is 
recruited to the advancing replication fork through an interaction 
with the sliding clamp PCNA. 

For the most part, daughter cells carry out the same pattern of 
transcription as their parental cells; this is one of the cornerstones 
underlying the homeostatic functioning of tissues and organs. As 
discussed in Chapter 12, the transcriptional state of a cell depends to 
alarge degree upon the epigenetic state of the cell’s chromatin, which 
is inherited from one cell generation to the next. Epigenetic informa- 
tion is not encoded within a chromosomes DNA sequence, but 
rather is encoded in the pattern of methylated cytosine residues in a 
cell's DNA and in the pattern of posttranslational modifications of the 
core histones associated with the DNA. Consequently, it is essential 


(b) 


Gen. 11:285, 2010). 


that these patterns be faithfully transmitted from parental chromatin 
to the chromatin of daughter cells, yet very little is known about how 
such transmission occurs. 

As noted on page 500, DNA methylation patterns are apparently 
transmitted through the activities of the DNA methyltransferase 
DNMT1. Somehow, this enzyme appears capable of adding methyl 
groups to the cytosine residues of newly synthesized DNA strands 
using the pattern of such modifications on the parental DNA strands 
as a guide or template. Transmission of histone modifications is 
likely a much more complex challenge for the cell owing to the fact 
that there are quite a number of different types of such modifications 
as well as a number of different histone residues that can be modi- 
fied. If “old” and “new” histones are transmitted randomly from 
parental to daughter chromatin as depicted in Figure 13.23, then the 
transmission of histone modifications is likely to occur by a very dif- 
ferent mechanism than the transmission of DNA methylation pat- 
terns. It is likely, for example, that modifications present on old 
histones will guide the modification of new histones within neigh- 
boring nucleosomes on the same DNA strand. In the scheme that is 
simplest to envision, each particular type of mark, such as a trimeth- 
ylated H3K9 residue or an acetylated H4K12 residue, is part of a 
positive feedback loop that causes that modification to be copied to 
the histones of an adjacent nucleosome. This type of mechanism is 
basically similar to the one depicted in Figure 12.21, which describes 
a scheme for the spreading of heterochromatin. According to this 
scenario, a particular histone modification (e.g., H3K9me) would 
serve as a specific binding site for a protein (e.g., HP1) that would in 
turn recruit an enzyme (e.g., SUV39H1) that would catalyze the 
same modification (i.e., H3K9me) on an adjacent histone molecule. 


Nucleosome 


(H3H4), 


H2A/H2B 
H2A/H2B 


FIGURE 13.23 The distribution of histone core complexes to daughter strands following replication. 

(a) Electron micrograph of chromatin isolated from the nucleus of a rapidly cleaving Drosophila embryo showing a 
pair of replication forks (arrows) moving away from each other in opposite directions. Between the two forks one sees 
regions of newly replicated DNA that are already covered by nucleosomal core particles to the same approximate 
density as the parental strands that have not yet undergone replication. (b) Schematic model showing the distribu- 
tion of core histones after DNA replication. Each nucleosome core particle is shown schematically to be composed of 
a central (H3H4), tetramer flanked by two H2A/H2B dimers. Histones that were present in parental nucleosomes 
prior to replication are indicated in blue; newly synthesized histones are indicated in red. According to this model, 
which is supported by a body of experimental evidence, the parental (H3H4) tetramers remain intact and are 
distributed randomly to both daughter duplexes. In contrast, the pairs of HZA/H2B dimers present in parental 
nucleosomes separate and recombine randomly with the (H3H4), tetramers on the daughter duplexes. Other models 
have been presented in which the parental (H3H4), tetramer is split in half by a histone chaperone, and the two 
resulting H3/H4 dimers are distributed to different DNA strands (discussed in Cell 140:183, 2010 and Nature Revs. 


Source: (a) Courtesy of Steven L. Mcknight and Oscar L. Miller, Jr. 


Whether these ideas will hold up to experimental scrutiny remains 
to be seen. 
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1. What is the role of CAF-1 in rebuilding chromatin 
after DNA replication? 


oo 


13.8 DNA Repair 


Life on Earth is subject to a relentless onslaught of destructive forces 
that originate in both the internal and external environments of an 
organism. Of all the molecules in a cell, DNA is placed in the most 
precarious position. On one hand, it is essential that the genetic 
information remain mostly unchanged as it is passed from cell to cell 
and individual to individual. On the other hand, DNA is one of the 
molecules in a cell that is most susceptible to environmental damage. 
When struck by ionizing radiation, the backbone of a DNA molecule 
is often broken; when exposed to a variety of reactive chemicals, 
many of which are produced by a cell’s own metabolism, the bases of a 
DNA molecule may be altered structurally; when subjected to ultra- 
violet radiation, adjacent pyrimidines on a DNA strand have a ten- 
dency to interact with one another to form a covalent complex, that 
is, a dimer (FIGURE 13.24). Even the absorption of thermal energy 
generated by metabolism is sufficient to split adenine and guanine 
bases from their attachment to the sugars of the DNA backbone. The 
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FIGURE 13.24 A pyrimidine dimer that has formed within a DNA 


duplex following UV irradiation. 


magnitude of these spontaneous alterations, or lesions, can be appre- 
ciated from the estimate that each cell of a warm-blooded mammal 
loses approximately 10,000 bases per day! Failure to repair such 
lesions produces permanent alterations, or mutations, in the DNA. If 
the mutation occurs in a cell destined to become a gamete, the 
genetic alteration may be passed on to the next generation. Mutations 
also have effects in somatic cells (i-e., cells that are not in the germ 
line): They can interfere with transcription and replication, lead to 
the malignant transformation ofa cell, or speed the process by which 
an organism ages. 

Considering the potentially drastic consequences of alterations 
in DNA molecules and the high frequency at which they occur, it is 
essential that cells possess mechanisms for repairing DNA damage. 
In fact, cells have a bewildering arsenal of repair systems that correct 
virtually any type of damage to which a DNA molecule is vulnerable. 
It is estimated that less than one base change in a thousand escapes a 
cells repair systems. The existence of these systems provides an 
excellent example of the molecular mechanisms that maintain cel- 
lular homeostasis. The importance of DNA repair can be appreci- 
ated by examining the effects on humans that result from DNA 
repair deficiencies, a subject discussed in the Human Perspective in 
Section 13.10. 

Both prokaryotic and eukaryotic cells possess a variety of pro- 
teins that patrol vast stretches of DNA, searching for subtle chemical 
modifications or distortions of the DNA duplex. In some cases, 
damage can be repaired directly. Humans, for example, possess 
enzymes that can directly repair damage from cancer-producing 
alkylating agents. Most repair systems, however, require that a dam- 
aged section of the DNA be excised, that is, selectively removed. One 
of the great virtues of the DNA duplex is that each strand contains 
the information required for constructing its partner. Consequently, 
if one or more nucleotides are removed from one strand, the com- 
plementary strand can serve as a template for reconstruction of the 
duplex. The repair of DNA damage in eukaryotic cells is complicated 
by the relative inaccessibility of DNA within the folded chromatin 


fibers of the nucleus. As in the case of transcription, DNA repair 
involves the participation of chromatin-reshaping machines, such as 
the histone modifying enzymes and nucleosome remodeling com- 
plexes discussed on pages 497-499. Although important in DNA 
repair, the roles of these proteins will not be considered in the fol- 
lowing discussion. 


Nucleotide Excision Repair 


Nucleotide excision repair (NER) operates by a cut-and-patch 
mechanism that removes a variety of bulky lesions, including pyrim- 
idine dimers and nucleotides to which various chemical groups have 
become attached. Two distinct NER pathways can be distinguished: 


1. A transcription-coupled pathway in which the template strands of 
genes that are being actively transcribed are preferentially 
repaired. Repair of a template strand is thought to occur as the 
DNA is being transcribed, and the presence of the lesion may be 
signaled by a stalled RNA polymerase. This preferential repair 
pathway ensures that those genes of greatest importance to the 
cell, which are the genes the cell is actively transcribing, receive 
the highest priority on the “repair list.” 


2. A slower, less efficient global genomic pathway that corrects DNA 
strands in the remainder of the genome. 


Although recognition of the lesion is probably accomplished by 
different proteins in the two NER pathways (step 1, FIGURE 13.25), 
the steps that occur during repair of the lesion are thought to be very 
similar, as indicated in steps 2-6 of Figure 13.25. One of the key com- 
ponents of the NER repair machinery is TFIIH, a huge protein that 
also participates in the initiation of transcription. The discovery of 
the involvement of TFIIH established a crucial link between tran- 
scription and DNA repair, two processes that were previously 
assumed to be independent of one another. Included among the vari- 
ous subunits of TFIIH are two subunits (XPB and XPD) that possess 
helicase activity; these enzymes separate the two strands of the 
duplex (step 2, Figure 13.25) in preparation for removal of the lesion. 
The damaged strand is then cut on both sides of the lesion by a pair 
of endonucleases (step 3), and the segment of DNA between the inci- 
sions is released (step 4). Once excised, the gap is filled by a DNA 
polymerase (step 5), and the strand is sealed by DNA ligase (step 6). 


Base Excision Repair 


A separate excision repair system operates to remove altered nucleo- 
tides generated by reactive chemicals present in the diet or produced 
by metabolism. The steps in this repair pathway in eukaryotes, which 
is called base excision repair (BER), are shown in FIGURE 13.26. 
BER is initiated by a DNA glycosylase that recognizes the alteration 
(step 1, Figure 13.26) and removes the altered base by cleavage of the 
glycosidic bond holding the base to the deoxyribose sugar (step 2). 
A number of different DNA glycosylases have been identified, each 
more-or-less specific for a particular type of altered base, including 
uracil (formed by the hydrolytic removal of the amino group of cyto- 
sine), 8-oxoguanine (caused by damage from oxygen free radicals, 
page 34), and 3-methyladenine (produced by transfer of a methyl 
group from a methyl donor, page 414). 

Structural studies of the DNA glycosylase that removes the 
highly mutagenic 8-oxoguanine (oxoG) indicate that this enzyme 
diffuses rapidly along the DNA “inspecting” each of the G-C base 
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FIGURE 13.25 Nucleotide excision repair. The following steps are 
depicted in the drawing and discussed in the text: (1) Damage recognition 
in the global pathway is mediated by an XPC-containing protein complex, 
whereas damage recognition in the transcription-coupled pathway is 
thought to be mediated by a stalled RNA polymerase in conjunction with a 
CSB protein; (2) DNA strand separation (by XPB and XPD proteins, two 
helicase subunits of TFIIH); (3) incision (by XPG on the 3’ side and the 
XPF-ERCC1 complex on the 5’ side); (4) excision, (5) DNA repair synthesis 
(by DNA polymerase 6 and/or €); and (6) ligation (by DNA ligase |). 


pairs within the DNA duplex (FIGURE 13.27, step 1). In step 2, the 
enzyme has come across an oxoG-C base pair. When this occurs, the 
enzyme inserts a specific amino acid side chain into the DNA helix, 
causing the nucleotide to rotate (“flip”) 180 degrees out of the DNA 
helix and into the body of the enzyme (step 2). If the nucleotide does, 
in fact, contain an oxoG, the base fits into the active site of the 
enzyme (step 3) and is cleaved from its associated sugar. In contrast, 
if the extruded nucleotide contains a normal guanine, which only 
differs in structure by two atoms from oxoG, it is unable to fit into the 
enzyme’ active site (step 4) and it is returned to its appropriate posi- 
tion within the stack of bases. Once an altered purine or pyrimidine 
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FIGURE 13.26 Base excision repair. The steps are described in the text. 
Other pathways for BER are known, and BER also has been shown to have 
distinct transcription-coupled and global repair pathways. 


is removed by a glycosylase, the “beheaded” deoxyribose phosphate 
remaining in the site is excised by the combined action of a special- 
ized (AP) endonuclease and a DNA polymerase. AP endonuclease 
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FIGURE 13.27 Detecting damaged bases during BER. In step 1, a DNA glycosylase (named hOGG'1) is inspecting a nucleotide that is paired to a 
cytosine. In step 2, the nucleotide is flipped out of the DNA duplex. In this case, the base is an oxidized version of guanine, 8-oxoguanine, and it is able to 
fit into the active site of the enzyme (step 3) where it is cleaved from its attached sugar. The subsequent steps in BER were shown in Figure 13.26. In step 4, 
the extruded base is a normal guanine, which is unable to fit into the active site of the glycosylase and is returned to the base stack. Failure to remove oxoG 


would have resulted in a G-to-T mutation. 


Source: Based on S. S. David, with permission from Nature 434:569, 2005. Nature by Nature Publishing Group. Reproduced with permission of Nature 
Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center. 


cleaves the DNA backbone (Figure 13.26, step 3) and a phosphodi- 
esterase activity of polymerase g removes the sugar—phosphate 
remnant that had been attached to the excised base (step 4). 
Polymerase f then fills the gap by inserting a nucleotide comple- 
mentary to the undamaged strand (step 5), and the strand is sealed 
by DNA ligase ITI (step 6). 

The fact that cytosine can be converted to uracil may explain 
why natural selection favored the use of thymine, rather than uracil, 
as a base in DNA, even though uracil was presumably present in 
RNA when it served as genetic material during the early evolution of 
life (page 429). If uracil had been retained as a DNA base, it would 
have caused difficulty for repair systems to distinguish between a 
uracil that “belonged” at a particular site and one that resulted from 
an alteration of cytosine. 


Mismatch Repair 


It was noted earlier that cells can remove mismatched bases that are 
incorporated by the DNA polymerase and escape the enzyme’s 
proofreading exonuclease. This process is called mismatch repair 
(MMR). A mismatched base pair causes a distortion in the geometry 
of the double helix that can be recognized by a repair enzyme. But 
how does the enzyme “recognize” which member of the mismatched 
pair is the incorrect nucleotide? If it were to remove one of the nucle- 
otides at random, it would make the wrong choice 50 percent of the 
time, creating a permanent mutation at that site. Thus, for a mis- 
match to be repaired after the DNA polymerase has moved past a 
site, it is important that the repair system distinguish the newly syn- 
thesized strand, which contains the incorrect nucleotide, from the 
parental strand, which contains the correct nucleotide. In E. coli, the 
two strands are distinguished by the presence of methylated adeno- 
sine residues on the parental strand. DNA methylation does not 
appear to be utilized by the MMR system in eukaryotes, and the 
mechanism of identification of the newly synthesized strand remains 
unclear. Several different MMR pathways have been identified and 
will not be discussed. 


Double-Strand Breakage Repair 


X-rays, gamma rays, and particles released by radioactive atoms are 
all described as ionizing radiation because they generate ions as they 
pass through matter. Millions of gamma rays pass through our bod- 
ies every minute. When these forms of radiation collide with a fragile 


DNA molecule, they often break both strands of the double helix. 
Double-strand breaks (DSBs) can also be caused by certain 
chemicals, including several (e.g., bleomycin) used in cancer chemo- 
therapy, and free radicals produced by normal cellular metabolism 
(page 34). DSBs are also introduced during replication of damaged 
DNA. A single double-strand break can cause serious chromosome 
abnormalities, which can have grave consequences for the cell. DSBs 
can be repaired by several alternate pathways. The predominant 
pathway in mammalian cells is called nonhomologous end joining 
(NHE)J), in which a complex of proteins binds to the broken ends of 
the DNA duplex and catalyzes a series of reactions that rejoin the 
broken strands. The major steps that occur during NHEJ are shown 
in FIGURE 13.28a and described in the accompanying legend. Figure 
13.28b shows the nucleus of a human fibroblast that had been treated 
with a laser to induce a localized cluster of double-strand breaks and 
then stained for the presence of the protein Ku at various times after 
laser treatment. This NHEJ repair protein is seen to localize at the 
site of the DSBs immediately following their appearance. 

Another DSB repair pathway known as homologous recombina- 
tion (HR) requires a homologous chromosome to serve as a template 
for repair of the broken strand. The steps that occur during homolo- 
gous recombination, which include excision of the damaged DNA, 
are similar to those of genetic recombination depicted in Figure 
14.47. A comparison between these two DSB repair pathways shows 
major differences. Homologous recombination is a more accurate 
pathway; that is, there are fewer errors in the base sequence of the 
repaired DNA than NHEJ. However, because it requires that a 
homologous chromosome be present in the nucleus, HR can only be 
employed during the cell cycle after DNA replication takes place 
(i.e., during late S or G, phase). Defects in both repair pathways have 
been linked to increased cancer susceptibility. For example, the 
BRCAI protein is part of the homologous repair pathway, and as 
discussed in the Chapter Opener, inherited mutations in the BRCA1 
gene greatly increase the risk of breast cancer. 
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1. Contrast the events of nucleotide excision repair and 
base excision repair. 


2. Why is it important in mismatch repair that the cell 
distinguish the parental strands from the newly 
synthesized strands? 
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FIGURE 13.28 Repairing double-strand breaks (DSBs) by nonhomolo- 
gous end joining (NHEJ). (a) In this simplified model of double-strand 
break repair, the lesion (step 1) is detected by a heterodimeric, ring-shaped 
protein called Ku that binds to the broken ends of the DNA (step 2). The 
DNA-bound ku recruits another protein, called DNA-PK,,, which is the 
catalytic subunit of a DNA-dependent protein kinase (step 3). Most of the 
substrates phosphorylated by this protein kinase have not been identified. 
These proteins bring the ends of the broken DNA together in such a way 
that they can be joined by DNA ligase IV to regenerate an intact DNA 
duplex (step 4). The NHEJ pathway may also involve the activities of 
nucleases and polymerases (not shown) and is more error prone than is the 
homologous recombination pathway of DSB repair. (b) Time course analysis 
of Ku localization at sites of DSB formation induced by laser microbeam 
irradiation at a site indicated by the arrowheads. The NHEJ protein Ku 
becomes localized at the damage site immediately following irradiation but 
remains there just briefly as the damage is presumably repaired. 
Micrographs were taken (1) immediately, (2) 2 hours, and (3) 8 hours 

after irradiation. 


Source: (6) From Jong-Soo Kim et al., J. Cell Biol. 170:344, 2005, Fig. 3, 
reproduced with permission of the Rockefeller University Press. 


13.9 Between Replication 
and Repair 
The Human Perspective (Section 13.10) describes an inherited dis- 


ease xeroderma pigmentosum (XP)—that leaves patients with an 
inability to repair certain lesions caused by exposure to ultraviolet 


radiation. Patients described as having the “classical” form of XP 535 


have a defect in one of seven different genes involved in nucleotide 
excision repair (page 536). These genes are designated XPA, XPB, 
XPC, XPD, XPE, XPF, and XPG, and some of their roles in NER are 
indicated in the legend of Figure 13.25. Another group of patients 
was identified that, like those with XP, were highly susceptible to 
developing skin cancer as the result of sun exposure. However, 
unlike the cells from XP sufferers, cells from these patients were 
capable of nucleotide excision repair and were only slightly more 
sensitive to UV light than normal cells. This heightened UV sensitiv- 
ity revealed itself during replication, because these cells often pro- 
duced fragmented daughter strands following UV irradiation. Patients 
in this group were classified as having a variant form of XP, designated 
XP-V. We will return to the basis of the XP-V defect in a moment. 

We have seen in the previous section that cells can repair a great 
variety of DNA lesions. On occasion, however, a DNA lesion is not 
repaired by the time that segment of DNA is scheduled to undergo 
replication. On these occasions, the replication machinery arrives at 
the site of damage on the template strand and becomes stalled there. 
When this happens, some type of signal is emitted that leads to the 
recruitment of a specialized polymerase that is able to bypass the 
lesion.’ Suppose the lesion in question is a thymidine dimer (Figure 
13.24) in a skin cell that was caused by exposure to UV radiation. 
When the replicative polymerase (pol § or €) reaches the obstacle, 
the enzyme is temporarily replaced by a “specialized” DNA poly- 
merase designated pol , which is able to insert two A residues into 
the newly synthesized strand across from the two T residues that are 
covalently linked as part of the dimer. An alternative bypass poly- 
merase called PrimPol can also be recruited to polymerize past rep- 
lication blocks, and it has the interesting feature of being able to 
produce and use a DNA-based primer instead of the usual RNA- 
based primer that most replicative polymerases use. Regardless of 
which system is used to polymerize past the lesion, once this “dam- 
age bypass” is accomplished, the cell switches back to the normal 
replicative polymerase and DNA synthesis continues without leav- 
ing any trace that a serious problem had been resolved. Patients 
afflicted with XP-V have a mutation in the gene encoding pol n and 
thus have difficulty replicating past thymidine dimers, but they can 
still repair a certain level of thymidine dimers using the PrimPol 
polymerase. 

Discovered in 1999, polymerase 1 is a member of a family of 
DNA polymerases in which each member is specialized for incorpo- 
rating nucleotides opposite particular types of DNA lesions in the 
template strand. The polymerases of this family are said to engage in 
translesion synthesis (TLS). X-ray crystallographic studies reveal that 
the TLS polymerases have an unusually spacious active site that is 
able to physically accommodate altered nucleotides that would not 
fit in the active site of a replicative polymerase. These TLS polymer- 
ases are only capable of incorporating one to a few nucleotides into a 
DNA strand (they lack processivity); they have no proofreading 
capability; and they are much more likely to incorporate an incorrect 
(ie., noncomplementary) nucleotide when copying undamaged 
DNA than the classic polymerases. 

The stalling of replication forks that allows damage to be 
repaired can itself be a source of damage and genome alteration. 


3A cell has other options to deal with a stalled replication fork, but they are more 
complex and will not be discussed. 
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536 When a fork stalls, the helicase can continue to unwind the DNA 
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while the polymerases stay behind with the stalled fork, creating a 
single stranded region that can then be broken or cut by nucleases, 
creating double stranded breaks. These breaks create the potential 
for genomic rearrangements when the cell tries to fix the break by 
recombination. Such genome rearrangements have been implicated 
in a variety of diseases. For example, DNA double strand breaks 
associated with stalled forks are thought to be one cause of sponta- 
neous copy number variation (CNV), in which a section of DNA can 
become repeated several times. CNV is implicated in more than 20 
neurodegenerative or neurodevelopmental diseases, as well as in 


complex conditions such as schizophrenia. Similarly, the variation in 
trinucleotide repeat number discussed in Chapter 10 has also been 
proposed to result from stalling of replication forks caused by unu- 
sual DNA structures formed by the repeats. Maintaining the integ- 
rity of the genome requires careful coordination between DNA 


replication, repair, and recombination. 


REVIEW | 


1. What is the role of translesion synthesis? 


13.10 THE HUMAN PERSPECTIVE 


Consequences of DNA Repair Deficiencies 


We owe our lives to light from the sun, which provides the 
energy captured during photosynthesis. But the sun also emits a 
constant stream of ultraviolet rays that ages and mutates the cells 
of our skin. The hazardous effects of the sun are most dramati- 
cally illustrated by the rare recessive genetic disorder, xeroderma 
pigmentosum (XP). Patients with XP possess a deficient nucleo- 
tide excision repair system that cannot remove segments of DNA 
damaged by ultraviolet radiation. As a result, persons with XP 
are extremely sensitive to sunlight; even very limited exposure 
to the direct rays of the sun can produce large numbers of dark- 
pigmented spots on exposed areas of the body and a greatly ele- 
vated risk of developing disfiguring and fatal skin cancers. 

XP is not the only genetic disorder characterized by nucle- 
otide excision repair deficiency. Cockayne syndrome (CS) is an 
inherited disorder characterized by acute sensitivity to light, neu- 
rological dysfunction due to demyelination of neurons, and dwarf- 
ism, but no evident increase in the frequency of skin cancer. Cells 
from persons with CS are deficient in the pathway by which tran- 
scriptionally active DNA is repaired (page 532). The remainder of 
the genome is repaired at the normal rate, presumably accounting 
for the normal levels of skin cancer. But why are persons with a 
defective repair mechanism subject to specific abnormalities 
such as dwarfism? Most cases of CS can be traced to a mutation 
in one of two genes, either CSA or CSB, which are thought to 
be involved in coupling transcription to DNA repair (see Figure 
13.25). Mutations in these genes, in addition to impacting DNA 
repair, may also disturb the transcription of certain genes, leading 
to growth retardation and abnormal development of the nervous 
system. This possibility is strengthened by the finding that, in rare 
cases, the symptoms of CS can also occur in persons with XP who 
carry specific mutations in the XPD gene. As noted on page 532, 
XPD encodes a subunit of the transcription factor TFIIH required 
for transcription initiation. Mutations in XPD could lead to defects 
in both DNA repair and transcription. Certain other mutations 
in the XPD gene are responsible for another disease, trichothio- 
dystrophy (TTD), which also combines symptoms suggestive 
of both DNA repair and transcription defects. Like CS patients, 
individuals with TTD exhibit increased sun sensitivity without the 


increased risk of development of cancer. TTD patients have addi- 
tional symptoms, including brittle hair and scaly skin. These find- 
ings indicate that three distinct disorders—XP, CS, and TTD—can 
be caused by defects in a single gene, with the particular disease 
outcome likely determined by the specific mutation present in 
that gene. Structural studies of mutant XPD molecules suggest 
that these different mutations affect different functions of the 
protein. 

Elsewhere in this text, we have described circumstances that 
lead to premature (or accelerated) aging in humans or animal 
models: As the result of (1) increased free radicals (page 34), (2) 
increased mitochondrial DNA mutations (page 196), and (3) muta- 
tions in a protein of the nuclear envelope (page 461). In 2006, a 
15-year-old boy who suffered from frequent sunburns and certain 
characteristics of premature aging came to the attention of 
clinical researchers. Genetic analysis determined that the boy car- 
ried a mutation in the XPF gene whose encoded protein makes 
one of the cuts during the NER pathway (Figure 13.25). Patients 
with mild mutations in XPF develop XP and have impaired NER. 
This individual had a more severe mutation in the XPF gene, caus- 
ing his cells to be unable to repair covalent cross-links that form 
occasionally between the two strands of a DNA duplex. Studies 
on the cells of this individual, and on mice with a corresponding 
mutation, suggested that the unrepaired cross-links lead to 
increased cell death (apoptosis), which either directly or indi- 
rectly promotes premature aging. According to one hypothesis, 
defects in DNA repair systems that result primarily in an increased 
mutation rate in the body's cells are associated with an increased 
susceptibility to cancer, whereas defects in DNA repair systems 
that result primarily in cell death are associated with accelerated 
aging.’ Whether any of these premature-aging syndromes pro- 
vides insight into the mechanisms of normal aging remains a 
matter of debate. 

Persons with DNA-repair disorders are not the only indi- 
viduals who should worry about exposure to the sun. Even in a 
skin cell whose repair enzymes are functioning at optimal levels, 
a small fraction of the lesions fail to be excised and replaced. 
Alterations in DNA lead to mutations that can cause a cell 


to become malignant. Thus, one of the consequences of the 
failure to correct UV-induced damage is the risk of skin cancer. 
Consider the following statistics: More than one million per- 
sons develop one of three forms of skin cancer every year in 
the United States, and most of these cases are attributed to 
overexposure to the sun's ultraviolet rays. Fortunately, the two 
most common forms of skin cancer—basal cell carcinoma and 
squamous cell carcinoma—rarely spread to other parts of the 
body and can usually be excised in a doctor's office. Both of 
these types of cancer originate from the skin's epithelial cells. 

However, malignant melanoma, the third type of skin cancer, 
is a potential killer. Unlike the others, melanomas develop from 
pigment cells (melanocytes) in the skin. The number of cases of 
melanoma diagnosed in the United States is climbing at the alarm- 
ing rate of 4 percent per year due to the increasing amount of time 
people have spent in the sun over the past few decades. Studies 
suggest that one of the greatest risk factors to developing mela- 
noma as an adult is the occurrence of a severe, blistering sunburn 
as a child or adolescent. Individuals at greatest risk are Caucasians 
with extremely light skin. Many of these individuals have pigment 
cells whose surfaces lack a functioning receptor (called MC1R) 
for a hormone that is secreted by nearby epithelial cells of the 
skin in response to ultraviolet radiation. Melanocytes respond to 
MCIR activation by producing the dark pigment melanin, thereby 
providing the individual with a tan. Tanned skin is more protected 
from UV rays than is light, untanned skin, even though it is UV 
radiation that is responsible for triggering the tanning response. 
What if it were possible to develop a tanned skin without having 
to suffer UV exposure? A number of research groups are working 
on such an approach by using various means other than exposure 
to UV-containing sunlight to stimulate the tanning response in 
pigment cells. Whether any of these approaches will prove safe 
and effective remains to be seen. 

Skin cancer is not the only disease that is promoted by defi- 
cient or overworked DNA repair systems. It is estimated that up to 
15 percent of colon cancer cases can be attributed to mutations in 


the genes that encode the proteins required for mismatch repair. 
Mutations that cripple the mismatch repair system inevitably lead 
to a higher mutation rate in other genes because mistakes made 
during replication are not corrected. 

Cancer is also one of the consequences of double-strand 
DNA breaks that have either gone unrepaired or been repaired 
incorrectly. Breaks in DNA can be caused by a variety of envi- 
ronmental agents to which we are commonly exposed, including 
X-rays, gamma rays, and radioactive emissions. The most serious 
environmental hazard in this regard is probably radon (specifically 
222Rn), a radioactive isotope formed during the disintegration of 
uranium. Some areas of the planet contain relatively high levels of 
uranium in the soil, and houses built in these regions can contain 
dangerous levels of radon gas. When the gas is breathed into the 
lungs, it can lead to double-strand DNA breaks that increase the 
risk of lung cancer. A significant fraction of lung-cancer deaths in 
nonsmokers is probably due to radon exposure. 


References 


1. It has not been mentioned in this discussion, for a number of reasons, 
that two of the genes most often responsible for premature aging syn- 
dromes encode members of a particular type of DNA helicase family 
called RecQ helicases. The genes in question are WRN and BLM, which, 
when mutated, are responsible for the inherited diseases Werner 
syndrome and Bloom syndrome, respectively, which are characterized 
by both increased cancer risk and features of accelerated aging. It is 
suggested that these helicases are involved in certain types of base 
excision and DSB repair pathways. They appear to be particularly 
important in resolving situations where a replicative DNA polymerase 
becomes stalled at a lesion and the replication fork “collapses” 
(disassembles). 
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Analytic Questions RK 


Suppose that Meselson and Stahl had carried out their experi- 
ment by growing cells in medium with 'N and then transferring 
the cells to medium containing '°N. How would the bands within 
the centrifuge tubes have appeared if replication were semicon- 
servative? if replication were conservative? if replication were 
dispersive? 

Suppose you isolated a mutant strain of yeast that replicated its 
DNA more than once per cell cycle. In other words, each gene in 
the genome was replicated several times between successive 
cell divisions. How might you explain such a phenomenon? 


How would the chromosomes from the experiment on eukary- 
otic cells depicted in Figure 13.4 have appeared if replication 
occurred by a conservative or a dispersive mechanism? 


We have seen that cells possess a special enzyme to remove 
uracil from DNA. What do you suppose would happen if 
the uracil groups were not removed? (You might consider the 


information presented in Figure 11.42 on the pairing properties 


of uracil.) 


5 Draw a partially double-stranded DNA molecule that would not 


serve as a template for DNA synthesis by DNA polymerase I. 


6 Some temperature-sensitive bacterial mutants stop replication 


immediately following elevation of temperature, whereas others 
continue to replicate their DNA for a period of time before they 
cease this activity, and still others continue until a round of repli- 
cation is completed. How might these three types of mutants 
differ? 

Suppose the error rate during replication in human cells were 
the same as that of bacteria (about 107). How would this impact 
the two cells differently? 


Consider a bacterium that divides once every half hour, has a 
genome that is 10° bp, and that has an error rate for DNA repli- 
cation of 10°. If a single cell of this bacterium starts growing in 
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a tube of culture medium and grows for a whole day (24 hours), 
approximately how many different point mutations will be 
present among the bacteria in the tube after 24 hours? 
(See Quantitative Tutorial Video). 


Figure 13.19 shows the results from an experiment in which cells 
were incubated with [?H]thymidine for less than 30 minutes prior 
to fixation. How would you expect this photograph to appear 
after a one-hour labeling period? Can you conclude that the 
entire genome is replicated within an hour? If not, why not? 


Origins of replication tend to have a region that is very rich in 
A-T base pairs. What function do you suppose these sections 
might serve? 
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13 
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What are the advantages of replication occurring in a small num- 
ber of replication foci rather than in the general nucleoplasm? 


What are some of the reasons you might expect human cells to 
have more efficient repair systems than those of a frog? 


Suppose you were to compare autoradiographs of two cells that 
had been exposed to [°H]thymidine, one that was engaged in 
DNA replication (S phase) and another that was not. How would 
you expect autoradiographs of these cells to differ? 

Construct a model that would explain how transcriptionally 


active DNA is repaired preferentially over transcriptionally silent 
DNA. 
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Cell Division 


SEEKING IMMORTALITY 


—— couple of months, all of the skin cells on the 
surface of your body are sloughed away and replaced by 
new cells. The epithelial lining your lower intestine is 
renewed at an even faster rate, turning over completely 
in three to five days. Like a majority of cells in our body, 
these epithelial cells are terminal or post-mitotic, 
meaning that they will no longer divide to give rise to 
new cells. As they get older and damaged by wear and 
tear, they are continuously replaced through the work of 
stem cells (as described in Chapter 1). In contrast, other 
cells in the body are extremely long-lived and do not 
appear to be replaced at all, including some neuronal 
populations. 


Some organisms do not appear to have the ability to 
regenerate new cells and are born with all the cells they 
will ever have. This includes the model organism 
Caenorhabditis elegans, where the role and location of 
each of the 959 cells of the adult worm have been 
mapped. 


In contrast, other organisms appear to have extremely 
active stem cells, giving rise to an impressive ability to Dividing cells in an adult planarian are highlighted in orange/red; 
regenerate limbs or even larger sections of their body. cellular nuclei are colored blue. The dividing cells are all derived from 


: ee a single pluripotent stem cell. These active populations of stem cells 
Newts, for example, are well known for their ability to Nia pop 


provide planaria with remarkable regenerative capabilities. 


continues Source: From D. Wagner., et al. Science vol 332. Cover of May 13, 2011. 
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at the wound site and undergo a series of division and 
differentiation to give rise to new tissue, such as skin, muscle, 
and blood vessels. 


Perhaps the most remarkable regenerative ability belongs to 
small flatworms of the Planaria genus. As demonstrated in 
numerous teaching laboratories around the world, a planar- 
ian can be sliced and diced into numerous small pieces, 
each of which will eventually form a complete worm. 
Research has shown that a mass of stem cells, called a 
blastema, migrate to the sites of injury and coordinate the 


14.1. The Cell Cycle 


The process by which new cells arise from other living cells is called 
cell division. For a multicellular organism, such as a human or an 
oak tree, countless divisions of a single-celled zygote produce an 
organism of astonishing cellular complexity and organization. Cell 
division does not stop with the formation of the mature organism 
but continues in certain tissues throughout life. Millions of cells 
residing within the marrow of your bones or the lining of your intes- 
tinal tract are undergoing division at this very moment. This enor- 
mous output of cells is needed to replace cells that have aged or died. 

Although cell division occurs in all organisms, it takes place very 
differently in prokaryotes and eukaryotes. We will restrict discussion 
to the eukaryotic version. Two distinct types of eukaryotic cell division 


Interphase 
a Gi: ES 
7 _- Cell grows and DNA replication 
a carries out normal and chromosome 
a | metabolism; | duplication 


organelles duplicate 


ps 
Cell grows 
and prepares 
for mitosis 


destruction of older tissues and the formation of new tissue 
to reform what has been lost. The blastema is thought 

to “know” what to build based on a complex system 

of signals and interactions that is currently not well 
understood. 


Like stem cells, cancer cells are also well known for their 
ability to undergo numerous cell divisions rapidly, but often 
with dire consequences. Understanding the mechanisms of 
cell division and how the cell cycle is regulated is key to 
potentially harnessing the power of regenerative therapeu- 
tics, as well as to keeping cancer in check. 


will be discussed in this chapter. Mitosis leads to production of cells 
that are genetically identical to their parent, whereas meiosis leads to 
production of cells with half the genetic content of the parent. Mitosis 
serves as the basis for producing new cells, meiosis as the basis for 
producing new sexually reproducing organisms. Together, these two 
types of cell division form the links in the chain between parents and 
their offspring and, in a broader sense, between living species and 
the earliest eukaryotic life forms present on Earth. 


Phases of the Cell Cycle 


In a population of dividing cells, whether inside the body or in a 
culture dish, each cell passes through a series of defined stages, which 
constitutes the cell cycle (FIGURE 14.1). The cell cycle can be divided 


FIGURE 14.1 An overview of the eukaryotic cell cycle. This diagram of the cell cycle 
indicates the stages through which a cell passes from one division to the next. The cell cycle is 
divided into two major phases: M phase and interphase. M phase includes the successive events 
of mitosis and cytokinesis. Interphase is divided into G,, S, and Gz phases, with S phase being 
equivalent to the period of DNA synthesis. The division of interphase into three separate phases 
based on the timing of DNA synthesis was first proposed in 1953 by Alma Howard and Stephen 
Pelc of Hammersmith Hospital, London, based on their experiments on plant meristem cells. 


into two major phases based on cellular activities readily visible with 
a light microscope: M phase and interphase. M phase includes (1) 
the process of mitosis, during which duplicated chromosomes are 
separated into two nuclei, and (2) cytokinesis, during which the 
entire cell divides into two daughter cells. Interphase, the period 
between cell divisions, is a time when the cell grows and engages in 
diverse metabolic activities. Whereas M phase usually lasts only an 
hour or so in mammalian cells, interphase may extend for days, 
weeks, or longer, depending on the cell type and the conditions. 

Although M phase is the period when the contents of a cell are 
actually divided, numerous preparations for an upcoming mitosis 
occur during interphase, including replication of the cells DNA. 
One might guess that a cell engages in replication throughout inter- 
phase. However, studies in the early 1950s on asynchronous cultures 
(ie., cultures whose cells are randomly distributed throughout the 
cell cycle) showed that this is not the case. As described in Chapter 13, 
DNA replication can be monitored by the incorporation of [*H]thy- 
midine into newly synthesized DNA. If [*H]thymidine is given to a 
culture of cells for a short period (e.g., 30 minutes) and a sample of 
the cell population is fixed, dried onto a slide, and examined by auto- 
radiography, only a fraction of the cells are found to have radioactive 
nuclei. Among cells that were engaged in mitosis at the time of fixa- 
tion (as evidenced by their compacted chromosomes), none is found 
to have a radioactively labeled nucleus. These mitotic cells have 
unlabeled chromosomes because they were not engaged in DNA 
replication during the labeling period. 

If labeling is allowed to continue for one or two hours before the 
cells are sampled, there are still no cells with labeled mitotic chromo- 
somes (FIGURE 14.2). We can conclude from these results that there 
is a definite period of time between the end of DNA synthesis and 
the beginning of M phase. This period is termed G, (for second gap). 
The duration of G, is revealed as one continues to take samples of 
cells from the culture until labeled mitotic chromosomes are 
observed. The first cells whose mitotic chromosomes are labeled 
must have been at the last stages of DNA synthesis at the start of the 
incubation with [*H]thymidine. The length of time between the start 
of the labeling period and the appearance of cells with labeled mitotic 
figures corresponds to the duration of G5. 
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FIGURE 14.2 Experimental results demonstrating that replication 
occurs during a defined period of the cell cycle. HeLa cells were cultured 
for 30 minutes in medium containing [>H]thymidine and then incubated 
(chased) for various times in unlabeled medium before being fixed and 
prepared for autoradiography. Each culture dish was scanned for cells that 
were in mitosis at the time they were fixed, and the percentage of those 
mitotic cells whose chromosomes were labeled was plotted as shown. 


Source: From a study by R. Baserga and F. Wiebel. 


DNA replication occurs during a period of the cell cycle termed 541 


S phase. S phase is also the period when the cell synthesizes the 
additional histones that will be needed as the cell doubles the number 
of nucleosomes in its chromosomes (see Figure 13.23). The length of 
S phase can be determined directly. In an asynchronous culture, the 
percentage of cells engaged in a particular activity is an approximate 
measure of the percentage of time that this activity occupies in the 
lives of cells. Thus, if we know the length of the entire cell cycle, the 
length of S phase can be calculated directly from the percentage of 
the cells whose nuclei are radioactively labeled during a brief pulse 
with [*H]thymidine. Similarly, the length of M phase can be calcu- 
lated from the percentage of cells in the population that are seen to be 
engaged in mitosis or cytokinesis. When one adds up the periods of 
G, +S + M, it is apparent that there is an additional period in the cell 
cycle yet to be accounted for. This other phase, termed G, (for first 
gap), is the period following mitosis and preceding DNA synthesis. 


Cell Cycles in Vivo 


One of the properties that distinguishes various types of cells within 
a multicellular plant or animal is their capacity to grow and divide. 
We can recognize three broad categories of cells: 


1. Cells, such as neurons, muscle cells, or red blood cells, that are 
highly specialized and lack the ability to divide. Once these cells 
have differentiated, they remain in that state until they die. 


2. Cells that normally do not divide but can be induced to begin 
DNA synthesis and divide when given an appropriate stimulus. 
Included in this group are liver cells, which can be induced to 
proliferate by the surgical removal of part of the liver, and lym- 
phocytes, which can be induced to proliferate by interaction with 
an appropriate antigen. 


3. Cells that normally possess a relatively high level of mitotic 
activity. Included in this category are stem cells of various adult 
tissues, such as hematopoietic stem cells that give rise to red and 
white blood cells (Figure 17.6) and stem cells at the base of numer- 
ous epithelia that line the body cavities and the body surface 
(Figure 7.1). The relatively unspecialized cells of apical meri- 
stems located near the tips of plant roots and stems also exhibit 
rapid and continual cell division. Stem cells have an important 
property that is not shared by most cells; they are able to divide 
asymmetrically. An asymmetric cell division is one in which 
the two daughter cells have different sizes, components, or fates. 
The asymmetric division of a stem cell produces one daughter 
cell that remains an uncommitted stem cell like its parent and 
another daughter cell that has taken a step toward becoming a 
differentiated cell of that tissue. In other words, asymmetric divi- 
sions allow stem cells to engage in both self-renewal and the for- 
mation of differentiated tissue cells. Some types of nonstem cells 
can also engage in asymmetric cell divisions, as illustrated by the 
formation of oocytes and polar bodies in Figure 14.41b. 


Cell cycles can range in length from as short as 30 minutes in a 
cleaving frog embryo, whose cell cycles lack both G, and G) phases, 
to several months in slowly growing tissues, such as the mammalian 
liver. Many cells in the body are said to be quiescent, which means 
that they are in a state that will not lead them to an upcoming cell 
division, but they retain the capability to divide if conditions should 
change. With a few notable exceptions, cells that have stopped divid- 
ing are arrested in a stage preceding the initiation of DNA synthesis. 
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guish them from the typical G,-phase cells that may soon enter 
S phase. A cell must receive a growth-promoting signal to proceed 
from Gy into G, phase and thus reenter the cell cycle. 


REVIEW | 


1. What is the cell cycle? What are the stages of the cell 
cycle? How does the cell cycle vary among different 
types of cells? 

2. Describe how [°H]thymidine and autoradiography 
can be used to determine the length of the various 
periods of the cell cycle. 


14.2 Regulation of the Cell Cycle 


The study of the cell cycle is not only important in basic cell biology, 
but it also has enormous practical implications in combating cancer, a 
disease that results from a breakdown in a cell’s ability to regulate its 
own division. In 1970, a series of cell fusion experiments carried out 
by Potu Rao and Robert Johnson of the University of Colorado helped 
open the door to understanding how the cell cycle is regulated. 

Rao and Johnson wanted to know whether the cytoplasm of cells 
contains regulatory factors that affect cell cycle activities. They 
approached the question by fusing mammalian cells that were in dif- 
ferent stages of the cell cycle. In one experiment, they fused mitotic 
cells with cells in other stages of the cell cycle. The mitotic cell always 
induced compaction of the chromatin in the nucleus of the nonmi- 
totic cell (FIGURE 14.3). Ifa G,-phase and an M-phase cell were fused, 
the chromatin of the G,-phase nucleus underwent premature chromo- 
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somal compaction to form a set of elongated compacted chromo- 
somes (Figure 14.3a). If a G)-phase and M-phase cell were fused, the 
G, chromosomes also underwent premature chromosome compaction, 
but unlike those of a G, nucleus, the compacted G, chromosomes 
were visibly doubled, reflecting the fact that replication had already 
occurred (Figure 14.3c). If a mitotic cell was fused with an S-phase 
cell, the S-phase chromatin also became compacted (Figure 14.3b). 
However, replicating DNA is especially sensitive to damage, so that 
compaction in the S-phase nucleus led to the formation of “pulver- 
ized” chromosomal fragments rather than intact, compacted chromo- 
somes. The results of these experiments suggested that the cytoplasm 
of a mitotic cell contained diffusible factors that could induce mitosis 
in a nonmitotic (ie., interphase) cell. This finding suggested that the 
transition from G, to M was under positive control; that is, the transi- 
tion was induced by the presence of some stimulatory agent. 

While the cell-fusion experiments revealed the existence of fac- 
tors that regulated the cell cycle, they provided no information about 
the biochemical properties of these factors. Insights into the nature of 
the agents that promote entry of a cell into mitosis (or meiosis) were 
first gained in a series of experiments on the oocytes and early 
embryos of frogs and invertebrates. These experiments are described 
in the Experimental Pathways in Section 14.3. To summarize here, it 
was shown that entry of a cell into M phase is initiated by a protein 
called maturation-promoting factor (MPF). MPF consists of two sub- 
units: (1) a subunit with kinase activity that transfers phosphate groups 
from ATP to specific serine and threonine residues of specific protein 
substrates and (2) a regulatory subunit called cyclin. The term cyclin 
was coined because the concentration of this regulatory protein rises 
and falls in a predictable pattern with each cell cycle (FIGURE 14.4). 
When the cyclin concentration is low, the kinase lacks the cyclin sub- 
unit and, as a result, is inactive. When the cyclin concentration rises, 
the kinase is activated, causing the cell to enter M phase. These results 
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FIGURE 14.3 Experimental demonstration that cells contain factors that stimulate entry into mitosis. The photographs show the results of the fusion 
of an M-phase HeLa cell with a rat kangaroo PtK2 cell that had been in (a) G; phase, (b) S phase, or (c) G2 phase at the time of cell fusion. As described in 
the text, the chromatin of the Gj;-phase and G>-phase PtK2 cells undergoes premature compaction, whereas that of the S-phase cell becomes pulverized. 
The elongated chromatids of the G2-phase cell in c are doubled in comparison with those of the G; cell in a. 


Source: From Karl Sperling and Potu N. Rao, Humangenetik 23:237, 1974. With kind permission of Springer Science + Business Media. 


Mitosis Mitosis Mitosis FIGURE 14.4 Fluctuation of cyclin and MPF levels during the cell cycle. This drawing 
depicts the cyclical changes that occur during early frog development when mitotic divisions 
occur rapidly and synchronously in all cells of the embryo. The top tracing shows the 
alternation between periods of mitosis and interphase, the middle tracing shows the cyclical 
changes in MPF kinase activity, and the lower tracing shows the cyclical changes in the 


concentrations of cyclins that control the relative activity of the MPF kinase. 

Source: From A. W. Murray and M. W. Kirschner, Science 246:616, 1989; copyright 1989, 
AAAS. Science by Moses King, reproduced with permission of American Association for the 
Advancement of Science in the format reuse in a Book/Textbook via Copyright Clearance 


Interphase reas inierenase! 


MPF activity 


High 


Low Center. 


I I 

; I I 

Cyclin I I High 
I I 


Low 


suggested that (1) progression of cells into mitosis depends on an 
enzyme whose sole activity is to phosphorylate other proteins, and 
(2) the activity of this enzyme is controlled by a subunit whose con- 
centration varies from one stage of the cell cycle to another. 


REVIEW | 


1. What is the effect of fusing a G,-phase cell with one 
in M; of fusing a G>- or S-phase cell with one in M? 


2. How does the activity of MPF vary throughout the cell 
cycle? How is this correlated with the concentration 
of cyclins? How does the cyclin concentration affect 
MPF activity? 


eo -—_- 


14.3 EXPERIMENTAL PATHWAYS 


The Discovery and Characterization of MPF 


As an amphibian oocyte nears the end of oogenesis, the large 
nucleus (called a germinal vesicle) moves toward the periphery of 
the cell. In subsequent steps, the nuclear envelope disassembles, the 
compacted chromosomes become aligned along a metaphase plate 
near one end (the animal pole) of the oocyte, and the cell undergoes 
the first meiotic division to produce a large secondary oocyte and a 
small polar body. The processes of germinal vesicle breakdown and 
first meiotic division are referred to as maturation and can be induced 
in fully grown oocytes by treatment with the steroid hormone proges- 
terone. The first sign of maturation in the hormone-treated amphibian 
oocyte is seen 13-18 hours following progesterone treatment as the 
germinal vesicle moves near the oocyte surface. Germinal vesicle 
breakdown soon follows, and the oocyte reaches metaphase of the 
second meiotic division by about 36 hours after hormone treatment. 
Progesterone induces maturation only if it is applied to the external 
medium surrounding the oocyte; if the hormone is injected into the 
oocyte, the oocyte shows no response.' It appears that the hormone 
acts at the cell surface to trigger secondary changes in the cytoplasm 


of the oocyte that lead to germinal vesicle breakdown and the other 
changes associated with maturation. 

To learn more about the nature of the cytoplasmic change 
that was responsible for triggering maturation, Yoshio Masui of 
the University of Toronto and Clement Markert of Yale University 
began a series of experiments in which they removed cytoplasm 
from isolated frog oocytes at various stages following progesterone 
treatment and injected 40-60 nanoliters (nl) of the donor cytoplasm 
into fully grown, immature oocytes that had not been treated with 
the hormone.” They found that cytoplasm taken from oocytes during 
the first 12 hours following progesterone treatment had little or no 
effect on recipient oocytes. After this period, however, the cytoplasm 
gained the ability to induce maturation in the recipient oocyte. The 
cytoplasm from the donor oocyte was maximally effective about 20 
hours after progesterone treatment, and its effectiveness declined by 
40 hours (FIGURE 1). However, cytoplasm taken from early embryos 
continued to show some ability to induce oocyte maturation. Masui 
and Markert referred to the cytoplasmic substance(s) that induce 


FIGURE 1. Change of activity of the maturation-promoting factor in 
the oocyte cytoplasm of Rana pipiens during the course of maturation 
and early development. Ordinate: the ratio of frequency of induced 
maturation to volume of injected cytoplasm. The higher the ratio, the 
more effective the cytoplasm. nl, nanoliters of injected cytoplasm. 
Abscissa: age of the donors (hours after administration of 
progesterone). 

Source: Y. Masui and C. L. Markert, Journal Exp Zoology 177:142, 
1971. Reprinted with Permission from John Wiley & Sons 

Publishers, Inc. 
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maturation in recipient oocytes as “maturation promoting factor,” 
which became known as MPF. 

Because it was assumed that MPF was involved specifically in 
triggering oocyte maturation, relatively little interest was paid at 
first to the substance or its possible mechanism of action. Then in 
1978, William Wasserman and Dennis Smith of Purdue University 
published a report on the behavior of MPF during early amphibian 
development.’ It had been assumed that MPF activity present in 
early embryos was simply a residue of activity that had been pre- 
sent in the oocyte. But Wasserman and Smith discovered that MPF 
activity undergoes dramatic fluctuations in cleaving eggs that cor- 
relate with changes in the cell cycle. It was found, for example, that 
cytoplasm taken from cleaving frog eggs within 30-460 minutes 
after fertilization contains little or no detectable MPF activity, as 
assayed by injection into immature oocytes (FIGURE 2). However, 
if cytoplasm is taken from an egg at 90 minutes after fertilization, 
MPF activity can again be demonstrated. MPF activity reaches a 
peak at 120 minutes after fertilization and starts to decline again 
at 150 minutes (Figure 2). At the time the eggs undergo their first 
cytokinesis at 180 minutes, no activity is detected in the eggs. 
Then, as the second cleavage cycle gets underway, MPF activity 
once again reappears, reaching a peak at 225 minutes postfertil- 
ization, and then declining again to a very low level. Similar results 
were found in Xenopus eggs, except that the fluctuations in MPF 
activity occur more rapidly than in Rana and correlate with the 
more rapid rate of cleavage divisions in the early Xenopus embryo. 
Thus, MPF activity disappears and reappears in both amphibian 
species on a time scale that correlates with the length of the cell 
cycle. In both species, the peak of MPF activity corresponds to the 
time of nuclear membrane breakdown and the entry of the cells 
into mitosis. These findings suggested that MPF does more than 
simply control the time of oocyte maturation and, in fact, may play 
a key role in regulating the cell cycle of dividing cells. 
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FIGURE 2 Cycling of MPF activity in fertilized R. pipiens eggs. 
Ordinate: percent recipient oocytes undergoing germinal vesicle 
breakdown in response to 80 nanoliters of cytoplasm from fertilized 
eggs. Abscissa: time after fertilization when the Rana egg cytoplasm 
was assayed for MPF activity. Arrows indicate time of cleavage divisions. 
Source: W. J. Wasserman and L. D. Smith, Journal Cell Biology 78:R15, 
1978. The Journal of Cell Biology by Rockefeller Institute; American 
Society for Cell Biology Copyright 1978. Reproduced with permission of 
Rockefeller University Press in the format republish in a textbook via 
Copyright Clearance Center. 


It became apparent about this same time that MPF activity 
is not limited to amphibian eggs and oocytes but is present 
in a wide variety of organisms. It was found, for example, that 
mammalian cells growing in culture also possess MPF activity as 
assayed by the ability of mammalian cell extracts to induce ger- 
minal vesicle breakdown when injected into amphibian cocytes.* 
MPF activity of mammalian cells fluctuates with the cell cycle, as 
it does in dividing amphibian eggs. Extracts from cultured HeLa 
cells prepared from early G,-, late Gj-, or S-phase cells lack MPF 
activity (FIGURE 3). MPF appears in early Go, rises dramatically in 
late Gy, and reaches a peak in mitosis. 

Another element of the machinery that regulates the cell cycle 
was discovered in studies on sea urchin embryos. Sea urchin eggs 
are favorite subjects for studies of cell division because the mitotic 
divisions following fertilization occur rapidly and are separated by 
highly predictable time intervals. If sea urchin eggs are fertilized in 
seawater containing an inhibitor of protein synthesis, the eggs fail 
to undergo the first mitotic division, arresting at a stage prior to 
chromosome compaction and breakdown of the nuclear envelope. 
Similarly, each of the subsequent mitotic divisions can also be 
blocked if an inhibitor of protein synthesis is added to the medium 
at a time well before the division would normally occur. This finding 
had suggested that one or more proteins must be synthesized dur- 
ing each of the early cell cycles if the ensuing mitotic division is to 
occur. But early studies on cleaving sea urchin eggs failed to reveal 
the appearance of new species of proteins during this period. 

In 1983, Tim Hunt and his colleagues at the Marine Biological 
Laboratory at Woods Hole reported on several proteins that 
are synthesized in fertilized sea urchin eggs but not unfertilized 
eggs.° To study these proteins further, they incubated fertilized 
eggs in seawater containing [S]methionine and withdrew sam- 
ples at 10-minute intervals beginning at 16 minutes after fertiliza- 
tion. Crude protein extracts were prepared from the samples and 
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FIGURE 3 Maturation-promoting activity of HeLa cell extracts during 
different stages of the cell cycle. Because 228 ng of mitotic protein 
induced germinal vesicle breakdown (GVBD) in 100 percent of the 
cases, the percent activity for other phases of the cell cycle was 
normalized to that amount of protein. E, early; M, mid-, and L, late. 


Source: P. S. Sunkara, D. A. Wright, and P. N. Rao, Proc. Natl Acad. 
Science USA 76:2801, 1979. 


subjected to polyacrylamide gel electrophoresis, and the labeled 
proteins were located autoradiographically. Several prominent 
bands were labeled in gels from fertilized egg extracts that were 
not evident in comparable extracts made from unfertilized eggs. 
One of the bands that appeared strongly labeled at early stages 
after fertilization virtually disappeared from the gel by 85 minutes 
after fertilization, suggesting that the protein had been selectively 
degraded. This same band then reappeared in gels from eggs 
sampled at later times and disappeared once again in a sample 
taken at 127 minutes after fertilization. The fluctuations in the 
amount of this protein are plotted in FIGURE 4 (protein band A) 
together with the cleavage index, which indicates the time course 
of the first two cell divisions. The degradation of the protein 
occurs at about the same time that the cells undergo the first and 
second division. A similar protein was found in the eggs of the 
surf clam, another invertebrate whose eggs are widely studied. 
Hunt and colleagues named the protein “cyclin” and noted the 
striking parallel in behavior between the fluctuations in cyclin 
levels in their investigation and MPF activity in the earlier studies. 
Subsequent studies showed that there were two distinct cyclins, 
A and B, which are degraded at different times during the cell 
cycle. Cyclin A is degraded during a 5-6 minute period beginning 
just before the metaphase-anaphase transition, and cyclin B 
is degraded a few minutes after this transition. These studies 
provided the first indication of the importance of controlled pro- 
teolysis (page 578) in the regulation of a major cellular activity. 
The first clear link between cyclin and MPF was demon- 
strated by Joan Ruderman and her colleagues at the Woods Hole 
Marine Biological Laboratory.® In these studies, an mRNA encod- 
ing cyclin A was transcribed in vitro from a cloned DNA fragment 
that contained the entire cyclin A coding sequence. The identity 
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FIGURE 4 Correlation of the level of cyclin with the cell division cycle. 
A suspension of eggs was fertilized, and after 6 minutes, [25S] methionine 
was added. Samples were taken for analysis by gel electrophoresis at 
10-minute intervals, starting at 16 minutes after fertilization. The 
autoradiograph of the electrophoretic gel was scanned for label density, 
and the data were plotted as shown. Protein A, which varies according 
to the cell cycle and is named cyclin, is shown as solid circles. Protein B 
(not to be confused with cyclin B), which shows no cell cycle fluctuation, 
is plotted as solid triangles. The percentage of cells undergoing division 
at any given time period is given as the cleavage index (open squares). 
Source: T. Evans et al., Cell 33:391, 1983. Cell by Cell Press. Reproduced 
with permission of Cell Press in the format reuse in a book/textbook via 
Copyright Clearance Center. 


of this mRNA was verified by translating it in vitro and finding 
that it encoded authentic clam cyclin A. When the synthetic cyclin 
mRNA was injected into Xenopus oocytes, the cells underwent 
germinal vesicle breakdown and chromosome compaction over 
a time course not unlike that induced by progesterone treatment 
(FIGURE 5). These results suggested that the rise in cyclin A, which 
occurs normally during meiosis and mitosis, has a direct role in 
promoting entry into M phase. The amount of cyclin A normally 
drops rapidly and must be resynthesized prior to the next division 
or the cells cannot reenter M phase. 

But what is the relationship between cyclins and MPF? One 
of the difficulties in answering this question was the use of differ- 
ent organisms. MPF had been studied primarily in amphibians, 
and cyclins in sea urchins and clams. Evidence indicated that frog 
oocytes contain a pool of inactive pre-MPF molecules, which are 
converted to active MPFs during meiosis |. Cyclin, on the other 
hand, is totally absent from clam oocytes but appears soon after 
fertilization. Ruderman considered the possibility that cyclin A is 
an activator of MPF. We will return to this shortly. 

Meanwhile, another line of research was initiated to purify 
and characterize the substance responsible for MPF activity. In 
1980, Michael Wu and John Gerhart of the University of California, 
Berkeley, accomplished a 20- to 30-fold purification of MPF by 
precipitating the protein in ammonium sulfate and subjecting the 
redissolved material to column chromatography. In addition to 
stimulating oocyte maturation, injections of the partially purified 
MPF stimulated the incorporation of **P into proteins of the 
amphibian oocyte.’ When partially purified MPF preparations 
were incubated with [°@P]ATP in vitro, proteins present within the 
sample became phosphorylated, suggesting that MPF induced 
maturation by acting as a protein kinase. 
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FIGURE 5. Kinetics of Xenopus oocyte activation by progesterone 
and cyclin AmRNA. Large, immature oocytes were isolated from 
fragments of ovaries and were incubated with progesterone or 
microinjected with varying amounts of cyclin A mRNA. At 3 to 4 hours 
after injection, obviously damaged oocytes were removed (about 2-4 
per starting group of 20), and the remaining ones (which represent the 
100% value) were allowed to develop. Germinal vesicle breakdown 
(GVBD) and oocyte activation were indicated by the formation of a 
white spot in the region of the animal pole and were confirmed by 
dissection of the oocytes. 
Source: From K. |. Swenson, K. M. Farrell, and J. V. Ruderman, Cell 47: 
865, 1986. Cell by Cell Press. Reproduced with permission of Cell Press 
in the format reuse in a book/textbook via Copyright Clearance Center. 
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MPF was finally purified in 1988 by a series of six successive 
chromatographic steps.® MPF activity in these purified prepa- 
rations was consistently associated with two polypeptides, one 
having a molecular mass of 32 kDa and the other of 45 kDa. 
The purified MPF preparation possessed a high level of protein 
kinase activity, as determined by the incorporation of radioactivity 
from [@PJATP into proteins. When the purified preparation was 
incubated in the presence of [°*PJATP, the 45-kDa polypeptide 
became labeled. 

By the end of the 1980s, the efforts to uncover the role 
of cyclins and MPF had begun to merge with another line of 
research that had been conducted on fission yeast by Paul Nurse 
and his colleagues at the University of Oxford.’ It had been shown 
that yeast produced a protein kinase with a molecular weight of 
34 kDa whose activity was required for these cells to enter M 
phase (discussed in Section 14.4). The yeast protein was called 
p34°% or simply cdc2. The first evidence of a link between cdc2 
and MPF came as the result of a collaboration between yeast 
and amphibian research groups.'°'' Recall from the previous 
study that MPF was found to contain a 32- and 45-kDa protein. 
Antibodies formed against cdc2 from fission yeast were shown 
to react specifically with the 32-kDa component of MPF isolated 
from Xenopus eggs. These findings indicate that this component 
of MPF is a homologue of the 34-kDa yeast kinase and, therefore, 
that the machinery controlling the cell cycle in yeast and verte- 
brates contains evolutionarily conserved components. 

A similar study using antibodies against yeast cdc2 showed 
that the homologous protein in vertebrates does not fluctuate 
during the cell cycle.'* This supports the proposal that the 32- 
kDa protein kinase in vertebrate cells depends on another pro- 
tein. The modulator was predicted to be cyclin, which rises in 
concentration during each cell cycle and is then destroyed as cells 
enter anaphase. This proposal was subsequently verified in a number 
of studies in which the MPF was purified from amphibians, clams, 
and starfish, and its polypeptide composition analyzed.'*"° In all 
of these cases, it was shown that the active MPF present in M-phase 
animal cells is a complex consisting of two types of subunits: (1) a 
32-kDa subunit that contains the protein kinase active site and 
is homologous to the yeast cdc2 protein kinase, and (2) a larger 
subunit (45 kDa) identified as a cyclin whose presence is required 
for kinase activity. The studies described in this Experimental 
Pathways provided a unified view of the regulation of the cell 
cycle in all eukaryotic organisms. In addition, they set the stage 
for analysis of the numerous factors that control the activity of 


MPF (cdc2) at various points during yeast and mammalian cell 
cycles, which have become the focus of attention in the past 
few years. Many of the most important findings of these recent 
studies are discussed in the first section of this chapter. 
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14.4 Control of the Cell Cycle: 
The Role of Protein Kinases 


Over the past two decades, a large number of laboratories have 
focused on MPF-like enzymes, called cyclin-dependent kinases 
(Cdks). It has been found that Cdks are not only involved in M 
phase but are the key agents that orchestrate activities throughout 
the cell cycle. Cdks carry out this function by phosphorylating a 
diverse array of proteins. Each phosphorylation event occurs at an 
appropriate point during the cell cycle, thereby stimulating or inhib- 
iting a particular cellular process involved in cell division. Yeast cells 
have been particularly useful in studies of the cell cycle, at least in 
part because of the availability of temperature-sensitive mutants 
whose abnormal proteins affect various cell cycle processes. As dis- 
cussed on page 516, temperature-sensitive mutants can be grown in 


a relatively normal manner at a lower (permissive) temperature and 
then shifted to a higher (restrictive) temperature to study the effect 
of the mutant gene product. 

Researchers studying the genetic control of the cell cycle have 
focused on two distantly related yeast species, the budding yeast 
Saccharomyces cerevisiae, which reproduces through the formation 
of buds at one end of the cell (see Figure 1.180), and the fission yeast, 
Schizosaccharomyces pombe, which reproduces by elongating itself 
and then splitting into two equal-sized cells (see Figure 14.6). The 
molecular basis of cell cycle regulation has been remarkably 
conserved throughout the evolution of eukaryotes. Once a gene 
involved in cell cycle control has been identified in one of the two 
yeast species, homologues are sought—and usually found—in the 
genomes of higher eukaryotes, including humans. By combining 
genetic, biochemical, and live-cell analyses, investigators have gained 


a comprehensive understanding of the major activities that allow a 
cell to grow and reproduce in a laboratory culture dish. Research 
into the genetic control of the cell cycle in yeast began in the 1970s 
in two laboratories, initially that of Leland Hartwell at the University 
of Washington working on budding yeast and subsequently that of 
Paul Nurse at the University of Oxford working on fission yeast. 
Both laboratories identified a gene that, when mutated, would cause 
the growth of cells at elevated temperature to stop at certain points 
in the cell cycle. The product of this gene, which was called cdc2 in 
fission yeast (and CDC28 in budding yeast), was eventually found to 
be homologous to the catalytic subunit of MPF; in other words, it 
was a cyclin-dependent kinase. Subsequent research on yeast as well 
as many different vertebrate cells has supported the concept that the 
progression of a eukaryotic cell through its cell cycle is regulated at 
distinct stages. One of the primary stages of regulation occurs near 
the end of G, and another near the end of G,. These stages represent 
points in the cell cycle where a cell becomes committed to beginning 
a crucial event—either initiating replication or entering mitosis. 

We will begin our discussion with fission yeast, which has the 
least complex cell cycle. In this species, the same Cdk (cdc2) is 
responsible for passage through both points of commitment, though 
in partnership with different cyclins. A simplified representation of 
cell cycle regulation in fission yeast is shown in FIGURE 14.5. The 
first transition point, which is called START, occurs in late G;. Once 
a cell has passed START, it is irrevocably committed to replicating its 
DNA and, ultimately, completing the cell cycle.' Passage through 
START requires the activation of cdc2 by one or more G,/S cyclins, 
whose levels rise during late G; (Figure 14.5). 

Passage from G, to mitosis requires activation of cdc2 by a 
different group of cyclins—the mitotic cyclins. Cdks containing a 


‘Mammalian cells pass through a comparable point during Gy, referred to as the 
restriction point, at which time they become committed to DNA replication and ulti- 
mately to completing mitosis. Prior to the restriction point, mammalian cells require 
the presence of growth factors in their culture medium if they are to progress in the 
cell cycle. After they have passed the restriction point, these same cells will continue 
through the remainder of the cell cycle without external stimulation. 
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FIGURE 14.5 A simplified model for cell cycle regulation in fission 
yeast. The cell cycle is controlled primarily at two points, START and the 
G>-M transition. Passage of a cell through these two critical junctures 
(purple arrowheads) requires the activation of the same cdc2 kinase by 
different classes of cyclins, either G,/S or mitotic cyclins. A third major 
transition occurs at the end of mitosis and is triggered by a rapid drop in 
concentration of mitotic cyclins. (Note: cdc2 is also known as Cdk1.) 
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substrates that are required for the cell to enter mitosis. Included 
among the substrates are proteins required for the dynamic changes 
in organization of both the chromosomes and cytoskeleton that 
characterize the shift from interphase to mitosis. Cells make a third 
commitment during the middle of mitosis, which determines 
whether they will complete cell division and reenter G, of the next 
cycle. Exit from mitosis and entry into G, depends on a rapid 
decrease in Cdk activity that results from a plunge in concentration 
of the mitotic cyclins (Figure 14.5), an event that will be discussed in 
Section 14.10 in conjunction with other mitotic activities. 

Cyclin-dependent kinases are often described as the “driver” 
that advances the cell cycle through its various stages. The activities 
of these enzymes are regulated by a variety of “brakes” and “accelera- 
tors” that operate in combination with one another. 


Cyclin Binding 


As indicated in Figure 14.5, the levels of particular cyclins rise over 
time. When a cyclin reaches a sufficient concentration in the cell, it 
binds to the catalytic subunit of a Cdk, causing a major change in the 
conformation of the enzyme’s active site. X-ray crystallographic 
structures of various cyclin-Cdk complexes indicate that cyclin 
binding causes the movement of a flexible loop of the Cdk polypep- 
tide chain away from the opening of the active site, allowing the Cdk 
to phosphorylate its protein substrates. 


Cdk Phosphorylation/Dephosphorylation 


We have already seen in other chapters that many events that take 
place in a cell are regulated by the addition and removal of phosphate 
groups from proteins. The same is true for the events that lead to the 
onset of mitosis. We can see from Figure 14.5 that the level of mitotic 
cyclins rises through S and G,. The mitotic cyclins present in a yeast 
cell during this period bind to the Cdk to form a cyclin-Cdk com- 
plex, but the complex shows little evidence of kinase activity. Then, 
late in G), the cyclin-Cdk becomes activated and mitosis is triggered. 
To understand this change in Cdk activity, we have to look at the 
activity of three other regulatory enzymes—two kinases and a phos- 
phatase. We will look briefly at the events that occur in fission yeast 
(FIGURE 14.6a). The roles of these enzymes in the fission yeast cycle, 
which is illustrated in Figure 14.6b, was revealed through a combina- 
tion of genetic and biochemical analyses. In step 1, one of the kinases, 
called CAK (Cdk-activating kinase), phosphorylates a critical threo- 
nine residue (Thr 161 of cdc2 in Figure 14.6b). Phosphorylation of 
this residue is necessary, but not sufficient, for the Cdk to be active. A 
second protein kinase shown in step 1, called Weel, phosphorylates a 
key tyrosine residue in the ATP-binding pocket of the enzyme (Tyr 15 
of cdc2 in Figure 14.60). If this residue is phosphorylated, the enzyme 
is inactive, regardless of the phosphorylation state of any other resi- 
due. In other words, the effect of Weel overrides the effect of CAK, 
keeping the Cdk in an inactive state. Line 2 of Figure 14.6c shows the 
phenotype of cells with a mutant weel gene. These mutants cannot 
maintain the Cdk in an inactive state and divide at an early stage in 
the cell cycle producing smaller cells, hence the name “wee. In 
normal (wild-type) cells, Weel keeps the Cdk inactive until the end 
of G,. Then, at the end of G,, the inhibitory phosphate at Tyr 15 is 
removed by the third enzyme, a phosphatase named Cdc25 (step 2, 
Figure 14.6b). Removal of this phosphate switches the stored cyclin- 
Cdk molecules into the active state, allowing it to phosphorylate key 
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(b) During Go, the cdc2 kinase interacts with a mitotic cyclin but remains 
inactive as the result of phosphorylation of a key tyrosine residue (Tyr 15 in 
fission yeast) by Wee'l (step 1). A separate kinase, called CAK, transfers a 
phosphate to another residue (Thr 161), which is required for cdc2 kinase 


activity later in the cell cycle. When the cell reaches a critical size, an enzyme 


called Cdc25 phosphatase is activated, which removes the inhibitory 
phosphate on the Tyr 15 residue. The resulting activation of the cdc2 kinase 
drives the cell into mitosis (step 2). By the end of mitosis (step 3), the 
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stimulatory phosphate group is removed from Thr 161 by another phosphatase. The free cyclin is subsequently degraded, and the cell begins another cycle. 
(The mitotic Cdk in mammalian cells is phosphorylated and dephosphorylated in a similar manner.) (c) Identification of Wee1 kinase and Cdc25 phosphatase 
was made by studying mutants that behaved as shown in this figure. Line 1 shows the Gz and M stages of a wild-type cell. Line 2 shows the effect of a 
mutant wee7 gene; the cell divides prematurely, forming small (wee) cells. Line 3 shows the effect of a mutant cdc25 gene; the cell does not divide but 
continues to grow. The red arrow marks the time when the temperature is raised to inactivate the mutant protein. 


Source: (a) Steve Gschmeissner/Photo Researchers, Inc.; (6) After T. R. Coleman and W. G. Dunphy, Curr. Opin. Cell Biol. 76:877, 1994. Current Opinion in 
Cell Biology by Elsevier Ltd. Reproduced with permission of Elsevier Ltd. in the format reuse in a Book/Textbook via Copyright Clearance Center. 


substrates and drive the yeast cell into mitosis. Line 3 of Figure 14.6c 
shows the phenotype of cells with a mutant cdc25 gene. These mutants 
cannot remove the inhibitory phosphate from the Cdk and cannot 
enter mitosis. The balance between Weel kinase and Cdc25 phos- 
phatase activities, which normally determines whether the cell will 
remain in G, or progress into mitosis, is regulated by still other 
kinases and phosphatases. As we will see shortly, these pathways can 
stop the cell from entering mitosis under conditions that might lead 
to an abnormal cell division. 


Cdk Inhibitors 


Cdk activity can be blocked by a variety of inhibitors. In budding 
yeast, for example, a protein called Sicl acts as a Cdk inhibitor dur- 
ing G,. The degradation of Sicl allows the cyclin—Cdk that is present 
in the cell to initiate DNA replication. The role of Cdk inhibitors in 
mammalian cells is discussed on page 552. 


Controlled Proteolysis 


It is evident from Figures 14.4 and 14.5 that cyclin concentrations 
oscillate during each cell cycle, which leads to changes in the activity 
of Cdks. Cells regulate the concentration of cyclins and other key cell 
cycle proteins by adjusting both the rate of synthesis and the rate of 
destruction of the molecule at different points in the cell cycle. 
Degradation is accomplished by means of the ubiquitin-proteasome 
pathway described in Section 12.21. Unlike other mechanisms that 
control Cdk activity, degradation is an irreversible event that helps 
drive the cell cycle in a single direction. Regulation of the cell cycle 
requires two classes of multisubunit complexes (SCF and APC 


complexes) that function as ubiquitin ligases. These complexes rec- 
ognize proteins to be degraded and link these proteins to a polyubiq- 
uitin chain, which ensures their destruction in a proteasome. The SCF 
complex is active from late G, through early mitosis (see Figure 14.26a) 
and mediates the destruction of G,/S cyclins, Cdk inhibitors, and 
other cell cycle proteins. These proteins become targets for an SCF 
after they are phosphorylated by the protein kinases (i.e., Cdks) that 
regulate the cell cycle. Mutations that inhibit SCFs from mediating 
proteolysis of key proteins, such as G,/S cyclins or the Cdk inhibitor 
Sicl mentioned above, can prevent cells from entering S phase and 
replicating their DNA. The APC complex acts in mitosis and 
degrades a number of key mitotic proteins, including the mitotic 
cyclins. Destruction of the mitotic cyclins allows a cell to exit mitosis 
and enter a new cell cycle (Section 14.10). 


Subcellular Localization 


Cells contain a number of different compartments in which regula- 
tory molecules can either be united with or separated from the pro- 
teins they interact with. Subcellular localization is a dynamic 
phenomenon in which cell cycle regulators are moved into different 
compartments at different stages. For example, one of the major 
mitotic cyclins in animal cells (cyclin B1) shuttles between the 
nucleus and cytoplasm until G,, when it accumulates in the nucleus 
just prior to the onset of mitosis (FIGURE 14.7). According to one 
proposal, nuclear accumulation of cyclin B1 is facilitated by phos- 
phorylation of one or more serine residues that reside in its nuclear 
export signal (NES, page 462). In this model, phosphorylation 
blocks subsequent export of the cyclin back to the cytoplasm. 
According to an alternate proposal, cyclin B1-Cdk1 stimulates it own 


translocation into the nucleus by phosphorylating and activating As noted above, the proteins and processes that control the cell 549 
components of the nuclear import machinery. Regardless of the cycle are remarkably conserved among eukaryotes. As in yeast, 
mechanism, if nuclear accumulation of the cyclin is blocked, cells successive waves of synthesis and degradation of different cyclins play 
fail to initiate cell division. a key role in driving mammalian cells from one stage to the next. 

Unlike yeast cells, which have a single Cdk, mammalian cells produce 


several different versions of this protein kinase. Different cyclin-Cdk 
complexes target different groups of substrates at different points 
within the cell cycle. The pairing between individual cyclins and Cdks 
is specific, and only certain combinations are found (FIGURE 14.8a). 
In mammalian cells, for example, the activity of a cyclin E-Cdk2 
complex drives the cell into S phase, whereas activity of a cyclin B1- 
Cdk1 complex (the mammalian MPF) drives the cell into mitosis. 
Cdks do not always stimulate activities, but can also inhibit inappro- 
priate events. For example, cyclin B1-Cdk1 activity during G, pre- 
vents a cell from rereplicating DNA that has already been replicated 
(a) (b) 


earlier in the cell cycle (page 527). This helps ensure that each region 
of the genome is replicated once and only once per cell cycle. 

The roles of the various cyclin-Cdk complexes shown in Figure 
14.84 have been determined by a wide range of biochemical studies 
carried out on mammalian cells for more than three decades. In 
recent years, the roles of these proteins have been reexamined in 
knockout mice, with some surprising results (Figure 14.8b). As 


FIGURE 14.7 Experimental demonstration of subcellular localization 
during the cell cycle. Micrographs of a living HeLa cell that has been 
injected with cyclin B1 linked to the green fluorescent protein (page 260). 
The cell shown in a is in the Gp phase of its cell cycle, and the fluorescently 
labeled cyclin B1 is localized almost entirely in the cytoplasm. The 
micrograph in b shows the cell in prophase of mitosis, and the labeled 


SOSEUIY Ul@}O1g JO BJOY SY :9[9AD |[9D SY} 4O |O1]UOD e ppl 


cyclin B1 is concentrated in the cell nucleus. The basis for this change in expected, the phenotype of a particular knockout mouse depends 
localization is discussed in the text. on the gene that has been eliminated. Mice that are unable to syn- 
Source: From Paul Clute and Jonathan Pines, Nature Cell Biol. 1:83, 1999. thesize Cdk1, cyclin B1, cyclins E1 and E2, or cyclin A2 die as early 
Reprinted by permission from Macmillan Publishers Limited. embryos, suggesting that the proteins encoded by these genes are 
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FIGURE 14.8 Cyclin-Cdks in the mammalian cell cycle. (a) Combinations between various cyclins and cyclin-dependent kinases at different stages in the 
mammalian cell cycle. Cdk activity during early G, is very low, which promotes the formation of prereplication complexes at the origins of replication (see Figure 
13.20). By mid-G,, Cdk activity is evident due to the association of Cdk4 and Cdké with the D-type cyclins (D1, D2, and D3). Among the substrates for these 
Cdks is an important regulatory protein called pRb (Section 16.3, Figure 16.12). The phosphorylation of pRb leads to the transcription of a number of genes, 
including those that code for cyclins E and A, Cdk1, and proteins involved in replication. The G,-S transition, which includes the initiation of replication, is 
driven by the activity of the cyclin E-Cdk2 and cyclin A-Cdk2 complexes. The transition from Gz to M and passage through early M is driven by the sequential 
activity of cyclin A-Cdk1 and cyclin B1-Cdk1 complexes, which phosphorylate such diverse substrates as cytoskeletal proteins, histones, and proteins of the 
nuclear envelope. (The mammalian Cdk1 kinase is equivalent to the fission yeast cdc2 kinase, and its inhibition and activation are similar to that indicated in 
Figure 14.6.) (b) The effects on mouse development of the deletion of genes (shown in red) encoding various Cdks. Of the four primary mammalian Cdks, only 
Cdk1 is absolutely required for cell division. Embryos that express only Cdk1 die during the course of embryonic development. Mice expressing both Cdk1 and 
Cdk4 develop into adults that are sterile, owing to defects in the meiotic cell cycles. E, embryonic day number; P, postnatal day number. 

Source: (a) C. J. Sherr, Cell 73:1060, 1993; Cell by Cell Press. Reproduced with permission of Cell Press in the format reuse in a book/textbook via Copyright 
Clearance Center. H. A. Coller, Nature Revs. Mol. Cell Biol. 8:667, 2007. Nature Reviews Molecular Cell Biology by Nature Publishing Group. Reproduced 
with permission of Nature Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center; (b) Malumbers and Barbacid, 

Nat Revs Cancer 9, 160, 2009 Figure 2. Nature Reviews Cancer by Nature Publishing Group. Reproduced with permission of Nature Publishing Group 

in the format reuse in a book/textbook via Copyright Clearance Center. 
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lacks the genes encoding all of the other cell cycle Cdks (namely, 
Cdks 2, 4, and 6) is capable of developing to a stage with fully formed 
organs, although the animal does not survive to birth (Figure 14.8). 
Cells taken from such embryos are capable of proliferating in cul- 
ture, although more slowly than normal cells. This finding indicates 
that, as in yeast, Cdk1 is the only Cdk required to drive a mamma- 
lian cell through all of the stages of the cell cycle. In other words, 
even though the other Cdks are normally expressed at specific times 
during the mammalian cell cycle, Cdk1 is able to “cover” for their 
absence, ensuring that all of the required substrates are phosphoryl- 
ated at each stage of the cell cycle. This is a classic case of redun- 
dancy, in which a protein is able to carry out functions that it would 
not normally perform. Still, the absence of one of these “nonessen- 
tial” cyclins or Cdks typically results in distinct cell cycle abnormali- 
ties, at least in certain types of cells. Mice lacking a gene for cyclin 
D1, for example, are smaller than control animals, which stems 
from a reduction in the level of cell division throughout the body. In 
addition, cyclin D1-deficient animals display a particular lack of cell 
proliferation during development of the retina. Mice lacking Cdk4 
develop without insulin-producing cells in their pancreas. Mice 
lacking Cdk2 appear to develop normally but exhibit specific defects 
during meiosis (Figure 14.8b), which reinforces the important dif- 
ferences in the regulation of mitotic and meiotic divisions. 


REVIEW | 


1. What are the respective roles of CAK, Wee1, and 
Cdec25 in controlling Cdk activity in fission yeast cells? 
What is the effect of mutations in the wee or cdc25 
genes in these cells? 


14.5 Control of the Cell Cycle: 
Checkpoints, Cdk Inhibitors, 
and Cellular Responses 


Ataxia-telangiectasia (AT) is an inherited recessive disorder charac- 
terized by a host of diverse symptoms, including a greatly increased 
risk for certain types of cancer.* During the late 1960s following the 
deaths of several individuals undergoing radiation therapy, it was 
discovered that patients with AT are extremely sensitive to ionizing 
radiation (page 531). So too are cells from these patients, which 
lack a crucial protective response found in normal cells. When nor- 
mal cells are subjected to treatments that damage DNA, such as ion- 
izing radiation or DNA-altering drugs, their progress through the 
cell cycle stops while the damage is repaired. If, for example, a nor- 
mal cell is irradiated during the G, phase of the cell cycle, it delays 
progression into S phase. Similarly, cells irradiated in S phase delay 
further DNA synthesis, whereas cells irradiated in G, delay entry 
into mitosis. 


Other symptoms of the disease include unsteady posture (ataxia) resulting from 
degeneration of nerve cells in the cerebellum, permanently dilated blood vessels (tel- 
angiectasia) in the face and elsewhere, susceptibility to infection, and cells with an 
abnormally high number of chromosome aberrations. The basis for the first two 
symptoms has yet to be determined. 


Studies of this type carried out in yeast gave rise to a concept, 
formulated by Leland Hartwell and Ted Weinert in 1988, that cells 
possess checkpoints as part of their cell cycle. Checkpoints are sur- 
veillance mechanisms that halt the progress of the cell cycle if (1) any 
of the chromosomal DNA is damaged or (2) certain critical pro- 
cesses, such as DNA replication during S phase or chromosome 
alignment during M phase, have not been properly completed. 
Checkpoints ensure that each of the various events that make up the 
cell cycle occurs accurately and in the proper order. Many of the pro- 
teins of the checkpoint machinery have no role in normal cell cycle 
events and are only called into action when an abnormality appears. 
In fact, the genes encoding several checkpoint proteins were first 
identified in mutant yeast cells that continued their progress through 
the cell cycle, despite suffering DNA damage or other abnormalities 
that caused serious defects. 

Checkpoints are activated throughout the cell cycle by a system 
of sensors that recognize DNA damage or cellular abnormalities. If a 
sensor detects the presence of a defect, it triggers a response that 
temporarily arrests further cell cycle progress. The cell can then use 
the delay to repair the damage or correct the defect rather than con- 
tinuing to the next stage. This is especially important because mam- 
malian cells that undergo division with genetic damage run the risk 
of becoming transformed into a cancer cell. If the DNA is damaged 
beyond repair, the checkpoint mechanism can transmit a signal that 
leads either to (1) the death of the cell or (2) its conversion to a state 
of permanent cell cycle arrest (known as senescence). 

We have seen in numerous places in this text where the study of 
a rare human disease has led to a discovery of basic importance in 
cell and molecular biology. The cell’s DNA damage response pro- 
vides another example of this path to discovery. The gene responsi- 
ble for ataxia-telangiectasia (the ATM gene) encodes a protein kinase 
that is activated by certain DNA lesions, particularly double-stranded 
breaks (page 534). Remarkably, the presence of a single break in one 
of the cell’s DNA molecules is sufficient to cause rapid, large-scale 
activation of ATM molecules, causing cell cycle arrest. A related pro- 
tein kinase called ATR is also activated by DNA breaks as well as 
other types of lesions, including those resulting from incompletely 
replicated DNA or UV irradiation. Both ATM and ATR are part of 
multiprotein complexes capable of binding to chromatin that con- 
tains damaged DNA. Once bound, ATM and ATR can phosphoryl- 
ate a remarkable variety of proteins that participate in cell cycle 
checkpoints and DNA repair. 

How does a cell stop its progress from one stage of the cell cycle 
to the next? We will briefly examine two well-studied pathways 
available to mammalian cells to arrest their cell cycle in response to 
DNA damage. 


1. Ifacell preparing to enter mitosis is subjected to UV irradiation, 
ATR kinase is activated and the cell arrests in G,. ATR kinase 
molecules are thought to be recruited to sites of protein-coated, 
single-stranded DNA (step 1, FIGURE 14.9), such as those present 
as UV-damaged DNA is repaired (Figure 13.25). ATR phospho- 
rylates and activates a checkpoint kinase, called Chk1 (step 2), 
which in turn phosphorylates Cdc25 on a particular serine resi- 
due (step 3), making the Cdc25 molecule a target for a special 
adaptor protein that binds to Cdc25 in the cytoplasm (steps 4, 5). 
This interaction inhibits Cdc25’s phosphatase activity and pre- 
vents it from being reimported into the nucleus. As discussed on 
page 547, Cdc25 normally plays a key role in the G,/M transi- 
tion by removing inhibitory phosphates from Cdk1. Thus, the 


FIGURE 14.9 Models for the mechanism of action of two DNA-damage 
checkpoints. ATM and ATR are protein kinases that become activated following 
specific types of DNA damage. Each of these proteins acts through checkpoint 
signaling pathways that lead to cell cycle arrest. ATM becomes activated in 
response to double-strand breaks, which are detected by the MRN protein 
complex (step a). ATR, on the other hand, becomes activated by protein-coated 
ssDNA (step 1) that forms when replication forks become stalled or the DNA is 
being repaired after various types of damage. In the Gz pathway shown here, 
ATR phosphorylates and activates the checkpoint kinase Chk1 (step 2), which 
phosphorylates and inactivates the phosphatase Cdc25 (step 3), which normally 
shuttles between the nucleus and cytoplasm (step 4). Once phosphorylated, 
Cdc25 is bound by an adaptor protein in the cytoplasm (step 5) and cannot be 
reimported into the nucleus, which leaves the Cdk in its inactivated, phosphoryl- 
ated state (step 6). In the G; pathway shown here, ATM phosphorylates and 


activates the checkpoint kinase Chk2 (step b), which phosphorylates p53 (step c). 


p53 is normally very short-lived, but phosphorylation by Chk2 stabilizes the 
protein, enhancing its ability to activate p21 transcription (step d). Once 
transcribed and translated (step e), 921 directly inhibits the Cdk (step f). Many 
other proteins, including histone-modifying enzymes, chromatin remodeling 
complexes, and histone variants are involved in mediating the response to DNA 
damage but are not discussed. 

Source: See Curr. Opin. Cell Biol. 21:245, 2009; Nature Revs. Mol. Cell Biol. 10: 
243, 2009; Nature Cell Biol. 13:1161, 2011; and Genes Develop. 25:409, 2011. 


absence of Cdc25 from the nucleus leaves the Cdk in an inactive 
state (step 6) and the cell arrested in G3. 


2. Damage to DNA also leads to the synthesis of proteins that 
directly inhibit the cyclin-Cdk complex that drives the cell cycle. 
For example, cells exposed to ionizing radiation in G, synthesize 
a protein called p21 (molecular mass of 21 kDa) that inhibits the 
kinase activity of the G, Cdk. This prevents the cells from 
phosphorylating key substrates and from entering S phase. ATM 
is involved in this checkpoint mechanism. In this particular 
DNA-damage response, the breaks in DNA that are caused by 
ionizing radiation serve as sites for the recruitment of a protein 
complex termed MRN (step a, Figure 14.9). MRN can be consid- 
ered as a sensor of DNA breaks. MRN recruits and activates 
ATM, which phosphorylates and activates another checkpoint 
kinase called Chk2 (step b). Chk2 in turn phosphorylates a 
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transcription factor (p53) (step c), which leads to the transcrip- 
tion and translation of the p21 gene (steps d and e) and subse- 
quent inhibition of Cdk (step f). Approximately 50 percent of all 
human tumors show evidence of mutations in the gene that 
encodes p53, which reflects its importance in the control of cell 
growth. The role of p53 is discussed at length in Chapter 16. 


p21 is only one of at least seven known Cdk inhibitors. The 


interaction between a related Cdk inhibitor (p27) and one of the 
cyclin-Cdk complexes is shown in FIGURE 14.10a. In this structural 


FIGURE 14.10 p27: A Cdk inhibitor that arrests cell cycle progression. (a) Three-dimensional structure of a complex between p27 and cyclin A-Cdk2. 
Interaction with p27 alters the conformation of the Cdk catalytic subunit, inhibiting its protein kinase activity. (b) A pair of littermates at 12 weeks of age. In 
addition to possessing different genes for coat color, the mouse with dark fur has been genetically engineered to lack both copies of the p27 gene (denoted 
as p27”), which accounts for its larger size. (c) Comparison of the thymus glands from a normal (left) and a p27” mouse (right). The gland from the p27 


knockout mouse is much larger owing to an increased number of cells. 


Source: (a) From Alicia A. Russo et al., Nature 382:327, 1996, Fig. 2a. Courtesy of Nikola Pavletich, Howard Hughes Medical Institute; reprinted by permission 
of Macmillan Publishers Limited; (b) From Keiko Nakayama et al., courtesy of Kei-ichi Nakayama, Cell 85:710, 711, 1996; with permission from Elsevier; 
(c) From Keiko Nakayama et al., courtesy of Kei-ichi Nakayama, Cell 85:710, 711, 1996; with permission from Elsevier. 
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552 model, the p27 molecule drapes itself across both subunits of the 


UOISIAIG ||8D e¢ PL YALdVHO 


cyclin A~Cdk2 complex, changing the conformation of the catalytic 
subunit and inhibiting its kinase activity. In many cells, p27 must be 
phosphorylated and then degraded before progression into S phase 
can occur. 

Cdk inhibitors, such as p21 and p27, are also active in cell dif- 
ferentiation. Just before cells begin to differentiate—whether into 
muscle cells, liver cells, blood cells, or some other type—they typi- 
cally withdraw from the cell cycle and stop dividing. Cdk inhibitors 
are thought to either allow or directly induce cell cycle withdrawal. 
Just as the functions of specific Cdks and cyclins have been studied 
in knockout mice, so too have their inhibitors. Knockout mice that 
lack the p27 gene show a distinctive phenotype: they are larger than 
normal (Figure 14.10b), and certain organs, such as the thymus 
gland and spleen, contain a significantly greater number of cells than 
those of a normal animal (Figure 14.10c). In normal mice, the cells 
of these particular organs synthesize relatively high levels of p27, and 
it is presumed that the absence of this protein in the p27-deficient 
animals allows the cells to divide several more times before they 
differentiate. 


REVIEW | 


1. What is meant by a cell cycle checkpoint? What is its 
importance? How does a cell stop its progress at one 
of these checkpoints? 


e—______ 


14.6 Overview of M Phase: 
Mitosis and Cytokinesis 


Whereas our understanding of cell cycle regulation rests largely on 
genetic studies in yeast, our knowledge of M phase is based on more 
than a century of microscopic and biochemical research on animals 
and plants. The name “mitosis” comes from the Greek word mitos, 
meaning “thread” The name was coined in 1882 by the German 
biologist Walther Flemming to describe the threadlike chromo- 
somes that mysteriously appeared in animal cells just before they 
divided in two. The beauty and precision of cell division is best 
appreciated by watching a time-lapse video of the process (e.g., 
www.bio.unc.edu/faculty/salmon/lab/mitosis/mitosismovies.html) 
rather than reading about it in a textbook. 

Mitosis is a process of nuclear division in which the replicated 
DNA molecules of each chromosome are faithfully segregated into 
two nuclei. Mitosis is usually accompanied by cytokinesis, a process 
by which a dividing cell splits in two, partitioning the cytoplasm into 
two cellular packages. The two daughter cells resulting from mitosis 
and cytokinesis possess a genetic content identical to each other and 
to the mother cell from which they arose. Mitosis, therefore, main- 
tains the chromosome number and generates new cells for the 
growth and maintenance of an organism. Mitosis can take place in 
either haploid or diploid cells. Haploid mitotic cells are found in 
fungi, plant gametophytes, and a few animals (including male bees 
known as drones). Mitosis is a stage of the cell cycle when the cell 
devotes virtually all of its energy to a single activity—chromosome 
segregation. As a result, most metabolic activities of the cell, includ- 
ing transcription and translation, are curtailed during mitosis, and 
the cell becomes relatively unresponsive to external stimuli. 


We have seen in previous chapters how much can be learned 
about the factors responsible for a particular process by studying 
that process outside of a living cell (page 263). Our understanding 
of the biochemistry of mitosis has been greatly aided by the use of 
extracts prepared from frog eggs. These extracts contain stockpiles 
of all the materials (histones, tubulin, etc.) necessary to support 
mitosis. When chromatin or whole nuclei are added to the egg 
extract, the chromatin is compacted into mitotic chromosomes, 
which are segregated by a mitotic spindle that assembles spontane- 
ously within the cell-free mixture. In many experiments, the role of 
a particular protein in mitosis can be studied by removing that pro- 
tein from the egg extract by addition of an antibody (immunodeple- 
tion) and determining whether the process can continue in the 
absence of that substance (see Figure 14.21 for an example). 

Mitosis is generally divided into five stages (FIGURE 14.11), pro- 
phase, prometaphase, metaphase, anaphase, and telophase, each char- 
acterized by a particular series of events. Keep in mind that each of 
these stages represents a segment of a continuous process; the divi- 
sion of mitosis into arbitrary phases is done only for the sake of dis- 
cussion and experimentation. 


REVIEW | 


1. What takes place during the mitotic stage of cell 
cycle? What happens during cytokinesis? 


oe 


14.7 Prophase 


During the first stage of mitosis, prophase, the duplicated chromo- 
somes are prepared for segregation and the mitotic machinery is 
assembled. 


Formation of the Mitotic Chromosome 


The nucleus of an interphase cell contains tremendous lengths of 
chromatin fibers. The extended state of interphase chromatin is ide- 
ally suited for the processes of transcription and replication but not 
for segregation into two daughter cells. Before segregating its chro- 
mosomes, a cell converts them into much shorter, thicker structures 
by a remarkable process of chromosome compaction (or chro- 
mosome condensation), which occurs during early prophase 
(Figures 14.11 and 14.12). 

As described on page 467, the chromatin of an interphase cell is 
organized into fibers approximately 30 nm in diameter. Although 
there is debate on this issue, mitotic chromosomes are thought to be 
composed of similar types of fibers as seen by electron microscopic 
examination of whole chromosomes isolated from mitotic cells 
(FIGURE 14.13a). According to this viewpoint, chromosome com- 
paction does not alter the nature of the chromatin fiber, but rather 
the way that the chromatin fiber is packaged. Treatment of mitotic 
chromosomes with solutions that solubilize the histones and the 
majority of the nonhistone proteins reveals a structural framework 
or scaffold that retains the basic shape of the intact chromosome 
(Figure 14.13b). Loops of DNA are attached at their base to the 
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. Centromeres split, and chromatids 


separate. 
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. Daughter cells formed by cytokinesis. 


spindle poles. 
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Source: Micrographs Courtesy of Andrew Bajer. 


Telophase 


FIGURE 14.11 The stages of mitosis in an animal cell (left drawings) and a plant cell (right photos). 
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FIGURE 14.12 Prophase nuclear morphology. Light-optical section 
through two mouse cell nuclei in prophase, recorded with super-resolution 
3D-structured illumination microscopy (3D-SIM). Condensed chromosomes 
are shown in red, the nuclear envelope in blue and microtubules, in green. 
Scale bar is 5mm. 

Source: Courtesy of Lothar Schermelleh. 


nonhistone proteins that make up this chromosome scaffold (shown 
at higher magnification in Figure 12.15). 

Research on chromosome compaction has focused on an abun- 
dant multiprotein complex called condensin. The proteins of con- 
densin were discovered by incubating nuclei in frog egg extracts and 
identifying those proteins that associated with the chromosomes as 
they underwent compaction. Removal of condensin from the 
extracts prevented normal chromosome compaction. How is con- 
densin involved in such dramatic changes in chromatin architecture? 
There is very little data available from in vivo studies to answer this 
question, but there is considerable speculation. 

Supercoiled DNA occupies a much smaller volume than relaxed 
DNA (see Figure 10.13), and studies suggest that DNA supercoiling 
plays a key role in compacting a chromatin fiber into the tiny volume 
occupied by a mitotic chromosome. In the presence of a topoisomer- 
ase and ATP, condensin is able to bind to DNA in vitro and curl the 
DNA into positively supercoiled loops. This finding fits nicely with 
the observation that chromosome compaction at prophase requires 
topoisomerase II, which along with condensin is present as part of the 
mitotic chromosome scaffold (Figure 14.13b). A speculative model 
for condensin action is shown in Figure 14.14. Condensin is activated 
at the onset of mitosis by phosphorylation of several of its subunits by 
the cyclin-Cdk responsible for driving cells from G, into mitosis. 
Thus condensin is one of the targets through which Cdks are able to 
trigger cell cycle activities. The subunit structure of a V-shaped con- 
densin molecule is shown in the right inset of FIGURE 14.14. 

As the result of compaction, the chromosomes of a mitotic cell 
appear as distinct, rod-like structures. Close examination of mitotic 
chromosomes reveals each of them to be composed of two mirror- 
image, “sister” chromatids (Figure 14.13a). Sister chromatids are a 
result of replication in the previous interphase. 

Prior to replication, the DNA of each interphase chromosome 
becomes associated at sites along its length with a multiprotein 
complex called cohesin (Figure 14.14). Following replication, 
cohesin holds the two sister chromatids together continuously 


(a) 


(b) 3 um 


FIGURE 14.13 The mitotic chromosome. (a) Electron micrograph of a 
whole-mount preparation of a human mitotic chromosome. The structure is 
seen to be composed of a knobby fiber 30 nm in diameter, which is similar 
to that found in interphase chromosomes. (b) Appearance of a mitotic 
chromosome after the histones and most of the nonhistone proteins have 
been removed. The residual proteins form a scaffold from which loops of DNA 
are seen to emerge (the DNA loops are shown more clearly in Figure 12.15). 
Source: (a) Courtesy of Gunther F. Bahr, Armed Forces Institute of 
Pathology, Washington, D.C. (b) From James R. Paulson and Ulrich K. 
Laemmli, Cell 12:820, 1977, with permission from Elsevier. 


through G, and into mitosis when they are ultimately separated. As 
indicated in the insets of Figure 14.14, condensin and cohesin have a 
similar structural organization. A number of experiments support 
the hypothesis that the cohesin ring encircles two sister DNA mole- 
cules as shown in both the left and right portions of Figure 14.14. 
In vertebrates, cohesin is released from the chromosomes in two 
distinct stages. Most of the cohesin is removed from the arms of the 


FIGURE 14.14 Model for the roles of condensin and cohesin in the 
formation of mitotic chromosomes. Just after replication, the DNA helices 
of a pair of sister chromatids would be held in association by cohesin 
molecules that encircled the sister DNA helices, as shown at the left of the 
drawing. As the cell entered mitosis, the compaction process would begin, 
aided by condensin molecules, as shown in the right part of the drawing. In 
this model, condensin brings about chromosome compaction by forming a 
ring around supercoiled loops of DNA within chromatin. Cohesin molecules 
would continue to hold the DNA of sister chromatids together. It is 
proposed (but not shown in this drawing) that cooperative interactions 
between condensin molecules would then organize the supercoiled loops 
into larger coils, which are then folded into a mitotic chromosome fiber. 
The top left and right insets show the subunit structure of an individual 
cohesin and condensin complex, respectively. Both complexes are built 
around a pair of SMC subunits. Each of the SMC polypeptides folds back 
on itself to form a highly elongated antiparallel, coiled coil with an ATP- 
binding globular domain where the N- and C-termini come together. 
Cohesin and condensin also have two or three non-SMC subunits that 
complete the ring-like structure of these proteins. 


chromosomes as they become compacted during prophase. 
Dissociation is induced by phosphorylation of cohesin associated 
proteins by two important mitotic enzymes called Polo-like kinase 
and Aurora B kinase. The associated cohesins are subsequently 
removed from chromosome arms through the action of the protein 
WAPL. In the wake of this event, the chromatids of each mitotic 
chromosome are held relatively loosely along their extended arms, 
but much more tightly at their centromeres (Figure 14.13a and 
FIGURE 14.15). Cohesin remains at the centromeres because of the 
presence there of a phosphatase that removes any phosphate groups 
added to the protein by the kinases. Release of cohesin from the 
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centromeres is normally delayed until anaphase as described on 
page 562 and is mediated by the protein separase. If the phosphatase 
is experimentally inactivated, sister chromatids separate from one 
another prematurely prior to anaphase. 


Centromeres and Kinetochores 


The most notable landmark on a mitotic chromosome is an indenta- 
tion or primary constriction, which marks the position of the cen- 
tromere (Figure 14.13a). The centromere is the residence of highly 
repeated DNA sequences (see Figure 10.20) that serve as the binding 


(a) 


(b) 


FIGURE 14.15 Each mitotic chromosome is comprised of a pair of sister chromatids connected to one another by the protein complex cohesin. 
(a) Colorized scanning electron micrograph of several metaphase chromosomes showing the paired identical chromatids associated loosely along their 
length and joined tightly at the centromere. The chromatids are not split apart from one another until anaphase. (b) Fluorescence micrograph of a meta- 
phase chromosome in a cultured human cell. The DNA is stained blue, the kinetochores are green, and cohesin is red. At this stage of mitosis, cohesin has 
been lost from the arms of the sister chromatids but remains concentrated at the centromeres where the two sisters are tightly joined. 


Source: (a) Andrew Syred/Photo Researchers, Inc.; (b) By S. Hauf and Jan-Michael Peters, Nature Cell Biol. 3:E17, 2001 Fig. 1c. Reprinted by permission from 


Macmillan Publishers Limited. 
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chromosome reveals the presence of a proteinaceous, button-like 
structure called the kinetochore at the outer surface of the cen- 
tromere of each chromatid (FIGURE 14.16a,b). Most of the 100-plus 
proteins that make up the kinetochore assemble at the centromere at 
early prophase. Kinetochore proteins are thought to be recruited to 
the centromere because of the presence there of the novel nucle- 
osomes containing the histone variant CENP-A (page 477). As will 
be apparent shortly, the kinetochore functions as (1) the site of 
attachment of the chromosome to the dynamic microtubules of 
the mitotic spindle (as in Figure 14.30), (2) the residence of several 
motor proteins involved in chromosome motility (Figure 14.16c), 
and (3) a key component in the signaling pathway of an important 
mitotic checkpoint (see Figure 14.31). 


Formation of the Mitotic Spindle 


We discussed in Chapter 9 how microtubule assembly in animal cells 
is initiated by a special microtubule-organizing structure, the centro- 
some (Section 9.5). As a cell progresses past G, and into mitosis, 
the microtubules of the cytoskeleton undergo sweeping disassembly 
in preparation for their reassembly as components of a complex, 
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micro-sized machine called the mitotic spindle. The rapid 
disassembly of the interphase cytoskeleton is thought to be accom- 
plished by the inactivation of proteins that stabilize microtubules 
(e.g., microtubule-associated proteins, or MAPs) and the activation 
of proteins that destabilize these polymers. 

To understand the formation of the mitotic spindle, we need to 
first examine the centrosome cycle as it progresses in concert with the 
cell cycle (FIGURE 14.17a). When an animal cell exits mitosis, the 
cytoplasm contains a single centrosome containing two centrioles 
situated at right angles to one another. Even before cytokinesis has 
been completed, the two centrioles of each daughter cell lose their 
close association to one another (they are said to “disengage”). This 
event is triggered by the enzyme separase, which becomes activated 
late in mitosis (page 562) and cleaves a proteinaceous link holding 
the centrioles together. Later, as DNA replication begins in the 
nucleus at the onset of S phase, each centriole of the centrosome 
initiates its “replication” in the cytoplasm. The process begins with 
the appearance of a small procentriole next to each preexisting 
(maternal) centriole and oriented at right angles to it (Figure 14.170). 
Subsequent microtubule elongation converts each procentriole into 
a full-length daughter centriole. At the beginning of mitosis, the 
centrosome splits into two adjacent centrosomes, each containing a 
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FIGURE 14.16 The kinetochore. (a) Electron micrograph of a section through one of the kinetochores of a mammalian metaphase chromosome, showing 
its three-layered (trilaminar) structure. Microtubules of the mitotic spindle can be seen to terminate at the kinetochore. (b) Schematic representation of the 
kinetochore, which contains an electron-dense inner and outer plate separated by a lightly staining interzone. Proposed functions of the inner and outer 
plates are indicated in part a. The inner plate contains a variety of proteins attached to the centromeric heterochromatin of the chromosome. Associated 
with the outer plate is the fibrous corona, which binds motor proteins involved in chromosome movement. (c) A schematic model showing a proposed 
disposition of several of the proteins found at the outer surface of the kinetochore. Among the motor proteins associated with the kinetochore, cytoplasmic 
dynein moves toward the minus end of a microtubule, whereas CENP-E moves toward the plus end. These motors may also play a role in tethering the 
microtubule to the kinetochore. The protein labeled “depolymerase” is a member of the kinesin superfamily that functions in depolymerization of microtu- 
bules rather than motility. In this drawing, the depolymerases are in an inactive state (the microtubule is not depolymerizing). Ndc80 is a protein complex 
consisting of four different protein subunits that form a 57-nm-long, rod-shaped molecule extending outward from the body of the kinetochore. Globular 
domains at either end of the complex mediate attachment to the microtubule and kinetochore. These Ndc80 fibrils have been implicated as couplers of the 


kinetochore to the plus end of a dynamic microtubule. 


Source: (a) From Don W. Cleveland, UCSD, et al., Cell 112:408, 2003 Fig. 1c, with permission from Elsevier. Image Courtesy of Kevin Sullivan. 
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FIGURE 14.17 The centrosome cycle of an animal cell. (a) During Gy, 
the centrosome contains a single pair of centrioles that are no longer as 
tightly associated as they were during mitosis. During S phase, daughter 
procentrioles form adjacent to maternal centrioles so that two pairs of 
centrioles become visible within the centrosome (see b). The daughter 
procentrioles continue to elongate during Gz phase, and at the beginning 
of mitosis, the centrosome splits, with each pair of centrioles becoming part 
of its own centrosome. As they separate, the centrosomes organize the 


microtubule fibers that make up the mitotic spindle. (b) The centrosome of (c) 
this cell is seen to contain two mother centrioles, each with an associated 


daughter procentriole (arrows). (c) Fluorescence micrograph of a normal 
spindle in early metaphase, stained for microtubules (green), chromosomes 
(blue), and kinetochore protein (red). (d) This mouse mammary cancer cell 
contains more than the normal complement of two centrosomes (red) and 
has assembled a multipolar spindle apparatus (green). Additional cen- 
trosomes lead to chromosome missegregation and abnormal numbers of 
chromosomes (blue), which are characteristic of malignant cells. 


Source: (a) After D. R. Kellogg et al., reproduced with permission from the 
Annual Review of Biochemistry, Vol. 63; Copyright 1994, Annual Review of 
Biochemistry by Annual Reviews. Reproduced with permission of Annual 
Reviews in the format republish in a book via Copyright; (b) From Jerome 

B. Rattner and Stephanie G. Phillips, J. Cell Biol.57:363, 1973, Fig. 4. 
Reproduced with permission of The Rockefeller University Press; (c) Courtesy 
of Jason Stumpf; (d) Courtesy of Thea Goepfert and W. R. Brinkley, Baylor 


College of Medicine, Houston, Tx. 


pair of mother-daughter centrioles. The initiation of centrosome 
duplication at the G,-S transition is normally triggered by phospho- 
rylation of a centrosomal protein by Cdk2, the same agent responsi- 
ble for the onset of DNA replication (Figure 14.8). Centrosome 
duplication is a tightly controlled process so that each mother centri- 
ole produces only one daughter centriole during each cell cycle. The 
formation of additional centrioles can lead to abnormal cell division 
and may contribute to the development of cancer (Figure 14.17c). 
The first stage in the formation of the mitotic spindle in a typical 
animal cell is the appearance of microtubules in a “sunburst” arrange- 
ment, or aster (FIGURE 14.18), around each centrosome during early 
prophase. As discussed in Chapter 9, microtubules grow by addition 
of subunits to their plus ends, while their minus ends remain associ- 
ated with the pericentriolar material (PCM) of the centrosome (page 
321). Phosphorylation of proteins of the PCM by Polo-like kinase is 
thought to play a key role in stimulating nucleation of spindle micro- 


(d) 


tubules during prophase. The process of aster formation is followed 
by separation of the centrosomes from one another and their subse- 
quent movement around the nucleus toward opposite ends of the 
cell. Centrosome separation is driven by motor proteins associated 
with the adjacent microtubules. As the centrosomes separate, the 
microtubules stretching between them increase in number and elon- 
gate (Figure 14.18). Eventually, the two centrosomes reach points 
opposite one another, thus establishing the two poles of a bipolar 
mitotic spindle (as in Figure 14.17a). Following mitosis, one centro- 
some will be distributed to each daughter cell. 

A number of different types of animal cells (including those of 
the early mouse embryo) lack centrosomes, as do the cells of higher 
plants, yet all of these cells construct a bipolar mitotic spindle and 
undergo a relatively typical mitosis. Functional mitotic spindles can 
even form in mutant Drosophila cells that lack centrosomes or in mam- 
malian cells in which the centrosome has been experimentally 
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FIGURE 14.18 Formation of the mitotic spindle. During prophase, as the 
chromosomes are beginning to condense, the centrosomes move apart from one 
another as they organize the bundles of microtubules that form the mitotic 
spindle. This micrograph shows a cultured newt lung cell in early prophase that 
has been stained with fluorescent antibodies against tubulin, which reveals the 
distribution of the cell’s microtubules (green). The microtubules of the developing 
mitotic spindle are seen to emanate as asters from two sites within the cell. These 
sites correspond to the locations of the two centrosomes that are moving toward 
opposite poles at this stage of prophase. The centrosomes are situated above the 
cell nucleus, which appears as an unstained dark region. 

Source: From Jennifer Waters, Richard W. Cole, and Conly L. Rieder, J. Cell Biol. 
122:364, 1993 Fig. 2. Reproduced with permission of the Rockefeller University 
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removed. In all of these cases, the microtubules of the mitotic spindle 
are nucleated near the chromosomes rather than at the poles where 
centrosomes would normally reside. Then, once they have polymer- 
ized, the minus ends of the microtubules are brought together (i.e., 
focused) at each spindle pole through the activity of motor proteins 
(FIGURE 14.19). These types of experiments suggested that cells possess 
two fundamentally different mechanisms—one centrosome-dependent 
and the other centrosome-independent—to achieve the same end 
result. Recent studies have indicated that both pathways to spindle 
formation operate simultaneously in the same cell and that even cells 
with functional centrosomes nucleate a significant fraction of their 
spindle microtubules at the chromosomes. 


The Dissolution of the Nuclear 
Envelope and Partitioning 
of Cytoplasmic Organelles 


In most eukaryotic cells, the mitotic spindle is assembled in the cyto- 
plasm and the chromosomes are compacted in the nucleoplasm. 
Interaction between the spindle and chromosomes is made possible 
by the breakdown of the nuclear envelope at the end of prophase. 
The three major components of the nuclear envelope—the nuclear 
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FIGURE 14.19 Formation of a spindle pole in the absence of cen- 
trosomes. In this model, each motor protein has multiple heads, which are 
bound to different microtubules. The movement of these motor proteins 
causes the minus ends of the microtubules to converge to form a distinct 
spindle pole. This type of mechanism is thought to facilitate the formation 
of spindle poles in the absence of centrosomes but may also play a role 
when centrosomes are present. 

Source: From A. A. Hyman and E. Karsenti, Cell 84:406, 1996; by permission 
of Cell Press. Cell by Cell Press. Reproduced with permission of Cell Press in 
the format reuse in a Book/Textbook via Copyright Clearance Center. 
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pore complexes, nuclear lamina, and nuclear membranes—are dis- 
assembled in separate processes. All of these processes are thought 
to be initiated by phosphorylation of key substrates by mitotic 
kinases, particularly cyclin B—Cdk1. The nuclear pore complexes are 
disassembled as the interactions between nucleoporin subcomplexes 
are disrupted, and the subcomplexes dissociate into the surrounding 
medium. The nuclear lamina is disassembled by depolymerization 
of the lamin filaments. The integrity of the nuclear membranes is 
first disrupted mechanically as holes are torn into the nuclear envelope 
by cytoplasmic dynein molecules associated with the outer nuclear 
membrane. The subsequent fate of the membranous portion of the 
nuclear envelope has been the subject of controversy. According to 
the classical view, the nuclear membranes are fragmented into a pop- 
ulation of small vesicles that disperse throughout the mitotic cell. 
Alternatively, the membranes of the nuclear envelope may be absorbed 
into the membranes of the ER. 

Some of the membranous organelles of the cytoplasm remain 
relatively intact through mitosis; these include mitochondria, lys- 
osomes, and peroxisomes, as well as the chloroplasts of a plant cell. 
Considerable debate has been generated in recent years over the 
mechanism by which the Golgi complex and endoplasmic reticulum 
are partitioned during mitosis. According to one view, the contents 
of the Golgi complex become incorporated into the ER during pro- 
phase, and the Golgi complex ceases to exist briefly as a distinct 
organelle. According to an alternate view, the Golgi membranes 
become fragmented to form a distinct population of small vesicles 
that are partitioned between daughter cells. A third view based pri- 
marily on studies in algae and protists has the entire Golgi complex 
splitting in two, with each daughter cell receiving half of the original 
structure. Ultimately, we may learn that different types of cells or 
organisms utilize different mechanisms of Golgi inheritance. Our 
ideas about the fate of the ER have also changed. Recent studies on 
living, cultured mammalian cells suggest that the ER network 
remains relatively intact during mitosis. This view challenges earlier 
studies performed largely on eggs and embryos that suggested the 
ER undergoes extensive fragmentation during prophase. 


REVIEW| 
1. How do the events of mitotic prophase prepare the 
chromatids for later separation at anaphase? 


2. What are some of the activities of the kinetochore 
during mitosis? 
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14.8 Prometaphase 


The dissolution of the nuclear envelope marks the start of the second 
phase of mitosis, prometaphase, during which mitotic spindle 
assembly is completed and the chromosomes are moved into posi- 
tion at the center of the cell. The following discussion provides a 
generalized picture of the steps of prometaphase; many variations on 
these events have been reported. 

At the beginning of prometaphase, compacted chromosomes 
are scattered throughout the space that was the nuclear region 
(FIGURE 14.20a). As the microtubules of the spindle penetrate into 
the central region of the cell, the free (plus) ends of the microtubules 
are seen to grow and shrink in a dynamic fashion, as if they were 
“searching” for a chromosome. It is not certain whether searching is 
entirely random, as evidence suggests that microtubules may grow 
preferentially toward a site containing chromatin. Those microtu- 
bules that contact a kinetochore are “captured” and stabilized. 

A kinetochore typically makes initial contact with the sidewall 
of a microtubule rather than its end (step 1, Figure 14.20b). Once 
initial contact is made, some chromosomes move actively along the 
wall of the microtubule, powered by motor proteins located in the 
kinetochore. Soon, however, the kinetochore tends to become stably 
associated with the plus end of one or more spindle microtubules 
from one of the spindle poles (step 2). A chromosome that is attached 
to microtubules from only one spindle pole represents an unstable 
intermediate stage in the course of prometaphase. Eventually, the 
unattached kinetochore on the sister chromatid captures its own 
microtubules from the opposite spindle pole (step 3). 

Recent evidence has shown that microtubules can also be nucle- 
ated through a chromatin-based pathway. These microtubules grow 
out from the chromosome by incorporation of tubulin subunits 
proximal to the chromatin and, once assembled, they become incor- 
porated into the mitotic spindle. Microtubule nucleation from the 
sides of existing microtubules has also recently been observed. 
These mechanisms all contribute to the high density of microtu- 
bules within the spindle, which helps ensure that the two sister chro- 
matids of each mitotic chromosome ultimately become connected 
by their kinetochores to microtubules that extend from opposite 
poles. 

Observations in living cells indicate that prometaphase 
chromosomes associated with spindle microtubules are not moved 
directly to the center of the spindle but rather oscillate back and 
forth in both a poleward and antipoleward direction. Ultimately, the 
chromosomes of a prometaphase cell are moved by a process called 
congression toward the center of the mitotic spindle, midway 
between the poles (step 4, Figure 14.20b). The forces required for 
chromosome movements during prometaphase are generated by 
motor proteins associated with both the kinetochores and arms of 
the chromosomes (depicted in Figure 14.33a and discussed in the 
legend). FIGURE 14.21 shows the consequences of a deficiency of a 
chromosomal motor protein whose activity pushes chromosomes 
away from the poles. 

Microtubule dynamics also play a key role in facilitating chro- 
mosome movements during prometaphase. As the chromosomes 
congress toward the center of the mitotic spindle, the longer micro- 
tubules attached to one kinetochore are shortened, while the shorter 
microtubules attached to the sister kinetochore are elongated. These 
changes in microtubule length are thought to be governed by differ- 
ences in pulling force (tension) on the two sister kinetochores. 
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FIGURE 14.20 Prometaphase. (a) Fluorescence micrograph of a 
cultured newt lung cell at the early prometaphase stage of mitosis, just 
after the nuclear envelope has broken. The microtubules of the mitotic 
spindle are now able to interact with the chromosomes. The mitotic 
spindle appears green after labeling with a monoclonal antibody against 
tubulin, whereas the chromosomes appear blue after labeling with a 
fluorescent dye. (b) Schematic view of some of the successive steps in 
chromosome-microtubule interactions during prometaphase. In step 1, 
a kinetochore has made contact with the sidewall of a microtubule and 
is capable of utilizing kinetochore-bound motors to slide in either 
direction along the microtubule. In step 2, a chromosome has become 
attached to the plus end of a microtubule from one spindle pole (an 
end-on attachment forming a mono-oriented chromosome). In step 3, the 
chromosome has become attached in an end-on orientation to 
microtubules from both poles (forming a bi-oriented chromosome). 

In step 4, the bi-oriented chromosome has been moved to the center 

of the cell and will become part of the metaphase plate. Chromosomes 
at this stage are under tension (as indicated by the space between the 
chromatids) due to the opposing pulling forces exerted by the 
microtubules from opposite poles. The chromosome in step 3a has both 
of its kinetochores attached to microtubules from the same spindle pole. 
This abnormal syntelic attachment is discussed on page 566. 


Source: (a) Courtesy of Alexey Khodjakov, Wadsworth Center, NY. 
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FIGURE 14.21 The consequence of a missing motor protein on 
chromosome alignment during prometaphase. The top micrograph 
shows a mitotic spindle that has assembled in a complete frog egg 
extract. The lower micrograph shows a mitotic spindle that has assembled 
in a frog egg extract that has been depleted of a particular kinesin-related 
protein called Kid that is present along the arms of prometaphase 
chromosomes. In the absence of this motor protein, the chromosomes 
fail to align at the center of the spindle and instead are found stretched 
along spindle microtubules and clustered near the poles. Kid normally 
provides force for moving chromosomes away from the poles 

(see Figure (14.33a). 

Source: From Celia Antonio et al., Cell vol. 102, cover #4, 2000; with 
permission from Elsevier. Courtesy of Isabelle Vernos. 


Shortening and elongation of microtubules occur primarily by loss 
or gain of subunits at the plus end of the microtubule (FIGURE 14.22). 
Remarkably, this dynamic activity occurs while the plus end of each 
microtubule remains attached to a kinetochore. 

Eventually, each chromosome moves into position along a plane 
at the center of the spindle, so that microtubules from each pole are 
equivalent in length. The movement of a wayward chromosome 
from a peripheral site near one of the poles to the center of the 
mitotic spindle during prometaphase is shown in the series of photos 
in FIGURE 14.23. 


REVIEW | 


1. Describe the movement of chromosomes during 


prometaphase. What forces cause this movement? 
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FIGURE 14.22 Microtubule behavior during formation of the meta- 
phase plate. Initially, the chromosome is connected to microtubules from 
opposite poles that may be very different in length. As prometaphase 
continues, this imbalance is corrected as the result of the shortening of 
microtubules from one pole, due to the rapid loss of tubulin subunits at 
the kinetochore, and the lengthening of microtubules from the opposite 
pole, due to the rapid addition of tubulin subunits at the kinetochore. 
These changes are superimposed over a much slower polymerization 
and depolymerization apparent in the lower drawing that occur 
continually during prometaphase and metaphase, causing the subunits 
of the microtubule to move toward the poles in a process known as 
microtubule flux. 


14.9 Metaphase 


Once all of the chromosomes have become aligned at the spindle 
equator—with one chromatid of each chromosome connected by its 
kinetochore to microtubules from one pole and its sister chromatid 
connected by its kinetochore to microtubules from the opposite 
pole—the cell has reached the stage of metaphase (FIGURE 14.24). 
The plane of alignment of the chromosomes at metaphase is referred 
to as the metaphase plate. The mitotic spindle of the metaphase cell 
contains a highly organized array of microtubules that is ideally 
suited for the task of separating the duplicated chromatids posi- 
tioned at the center of the cell. Functionally and spatially, the micro- 
tubules of the metaphase spindle of an animal cell can be divided 
into three groups (Figure 14.24): 


1. Astral microtubules that radiate outward from the centrosome 
into the region outside the body of the spindle. They help posi- 
tion the spindle apparatus in the cell and may help determine the 
plane of cytokinesis. 


2. Chromosomal (or kinetochore) microtubules that extend between 
the centrosome and the kinetochores of the chromosomes. In 
mammalian cells, each kinetochore is attached to a bundle of 
20-30 microtubules, which forms a spindle fiber (or k-fiber). 
During metaphase, the chromosomal microtubules exert a pulling 


force on the kinetochores. As a result, the chromosomes are within the kinetochore and cause oscillations of the chromo- 561 


maintained in the equatorial plane by a “tug-of-war” between somes situated at the metaphase plate. During anaphase, chro- 
balanced pulling forces exerted by chromosomal spindle fibers mosomal microtubules are required for the movement of the 
from opposite poles. These pulling forces generate deformations chromosomes toward the poles. 
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FIGURE 14.23 The engagement of a chromosome during prometaphase and its movement to the metaphase plate. This series of photographs 
taken from a video recording shows the movements of the chromosomes of a newt lung cell over a period of 100 seconds during prometaphase. Although 
most of the cell’s chromosomes were nearly aligned at the metaphase plate at the beginning of the sequence, one of the chromosomes (arrow) had failed to 
become attached to spindle fibers from both poles. The wayward chromosome has become attached to spindle fibers from opposite poles in B and then 
moves toward the spindle equator with variable velocity until it reaches a stable position in F. The position of one pole is indicated by the arrowhead in A. 
Source: From Stephen P. Alexander and Conly L. Rieder, J. Cell Biol. 113:807, 1991 Fig. 1. Reproduced with permission of The Rockefeller University Press. 


Courtesy Conly L. Rieder. 
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FIGURE 14.24 The mitotic spindle of an animal cell. Each spindle pole contains a pair of centrioles surrounded by amorphous pericentriolar material 

at which the microtubules are nucleated. Three types of spindle microtubules—astral, chromosomal, and polar spindle microtubules—are evident, and their 
functions are described in the text. All of the spindle microtubules, which can number in the thousands, have their minus ends pointed toward the centro- 
some. Although not shown here, spindles may also contain shorter microtubules that do not make contact with either a kinetochore or a spindle pole. 


562 3. Polar (or interpolar) microtubules that extend from the centro- 
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some past the chromosomes. Polar microtubules from one cen- 
trosome overlap with their counterparts from the opposite 
centrosome. The polar microtubules form a structural basket 
that maintains the mechanical integrity of the spindle. 


As one watches films or videos of mitosis, metaphase appears as 
a stage during which the cell pauses for a brief period, as if all mitotic 
activities suddenly come to a halt. However, experimental analysis 
reveals that metaphase is a time when important events occur. 

Even though there is no obvious change in length of the chro- 
mosomal microtubules as the chromosomes are aligned at the meta- 
phase plate, studies using fluorescently labeled tubulin indicate that 
the microtubules exist in a highly dynamic state (See this chapter's 
Experimental Walkthrough Video: Fluorescence microscopy of living cells to 
learn more about fluorescence microscopy). Subunits are rapidly lost and 
added at the plus ends of the chromosomal microtubules, even 
though these ends are attached to the kinetochore. Thus, the kine- 
tochore does not act like a cap at the end of the microtubule, block- 
ing the entry or exit of terminal subunits, but rather it is the site of 
dynamic activity. Because more subunits are added to the plus end 
than are lost, there is a net addition of subunits at the kinetochore. 
Meanwhile, the minus ends of the microtubules experience a net 
loss, and thus subunits move along the chromosomal microtubules 
from the kinetochore toward the pole. This poleward flux of tubulin 
subunits in a mitotic spindle is indicated in the experiment illus- 
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FIGURE 14.25 Tubulin flux through the microtubules of the mitotic 
spindle at metaphase. Even though the microtubules appear stationary at 
this stage, injection of fluorescently labeled tubulin subunits indicates that 
the components of the spindle are in a dynamic state of flux. Subunits are 
incorporated preferentially at the kinetochores of the chromosomal 
microtubules and the equatorial ends of the polar microtubules, and they 
are lost preferentially from the minus ends of the microtubules in the region 
of the poles. Tubulin subunits move through the microtubules of a 
metaphase spindle at a rate of about 1 um/min. 


trated in FIGURE 14.25. Loss of tubulin subunits at the poles is likely 
aided by a member of the kinesin-13 family of motor proteins whose 
function is to promote microtubule depolymerization rather than 
movement (page 317). 


REVIEW | 


1. Describe the roles of astral, kinetochore, and polar 
microtubules in the mitotic spindle. 


14.10 Anaphase 


Anaphase begins when the sister chromatids of each chromosome 
split apart and start their movement toward opposite poles. 


The Role of Proteolysis in Progression 
through Mitosis 


We have seen how important it is that specific activities take place in 
their proper order throughout the cell cycle. This orderliness 
depends to a large degree on the selective destruction of cell cycle 
regulatory proteins at precise times during the cell cycle. It was 
pointed out earlier that two distinct multiprotein complexes, SCF 
and APC, add ubiquitin to proteins at different stages of the cell 
cycle, targeting them for destruction by a proteasome. The periods 
during the cell cycle in which the SCF and APC complexes are active 
are shown in FIGURE 14.26a. As illustrated in Figure 14.26a, SCF acts 
primarily during interphase. In contrast, the anaphase promoting 
complex, or APC, plays a key role in regulating events that occur dur- 
ing mitosis. The APC contains about a dozen core subunits, in addi- 
tion to an “adaptor protein” that plays a key role in determining 
which proteins serve as the APC substrate. Two alternate versions of 
this adaptor protein—Cdc20 and Cdh1—determine substrate selec- 
tion during mitosis. APC complexes containing one or the other of 
these adaptors are known as APC@*° or APC“*"! (Figure 14.26a). 
APC“*° becomes activated prior to metaphase (Figure 14.26a) 
and ubiquitinates a key anaphase inhibitor called securin—so named 
because it secures the attachment between sister chromatids. The 
ubiquitination and destruction of securin at the end of metaphase 
release an active protease called separase. Separase then cleaves the 
Sccl subunit of the cohesin molecule that holds sister chromatids 
together (Figure 14.26b). Cleavage of cohesin triggers the separation 
of sister chromatids to mark the onset of anaphase. Experimental 
support for the role of cohesin in maintaining the attachment of sis- 
ter chromatids comes from studies in which proteolytic enzymes 
have been injected into cells that had been arrested in metaphase. 
Cleavage of cohesin by such enzymes leads rapidly to the separation 
of chromatids and their anaphase-like movement toward the poles. 
Near the end of mitosis, Cdc20 is inactivated, and the alternate 
adaptor, Cdh1, takes control of the APC’s substrate selection (Figure 
14.26a). When Cdh1 is associated with the APC, the enzyme com- 
pletes the ubiquitination of cyclin B that was begun by APC, 
Destruction of the cyclin leads to a precipitous drop in activity of the 
mitotic Cdk (cyclin B—Cdk1) and progression of the cell out of mitosis 
and into the G, phase of the next cell cycle. The importance of protein 
degradation in regulating the events of mitosis and the reentry of cells 
into G, is best revealed with the use of inhibitors (FIGURE 14.27). 


If the destruction of cyclin B is prevented with an inhibitor of the its normal activities and continue through mitosis and cytokinesis. 563 
proteasome, cells remain arrested in a late stage of mitosis. If such | The completion of mitosis clearly requires the cessation of activity of 
cells that are arrested in mitosis are subsequently treated with acom- Cdk! (either by the normal destruction of its cyclin activator or by 
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FIGURE 14.26 SCF and APC activities during the cell cycle. SCF and APC are multisubunit complexes that ubiquitinate substrates, leading to their 
destruction by proteasomes. (a) SCF is active primarily during interphase, whereas APC (anaphase promoting complex) is active during mitosis and G,. Two 
different versions of APC are indicated. These two APCs differ in containing either a Cdc20 or a Cdh1 adaptor protein, which alters the substrates recog- 
nized by the APC. APC°%? js active early in mitosis, at a time when Cdh1 is inhibited by Cdk1-mediated phosphorylation. As Cdk1 activity drops sharply in 
late mitosis, Cdh1 is activated, leading to the activation of APC“4". The label SAC stands for spindle assembly checkpoint, which is discussed on page 566. 
The SAC prevents APC“%*° from triggering anaphase until all the chromosomes are properly aligned at the metaphase plate. (b) APC°%”° is responsible for 
destroying proteins, such as securin, that inhibit anaphase. Destruction of these substrates promotes the metaphase—anaphase transition. APCS" jg 
responsible for ubiquitinating proteins, such as mitotic cyclins, that inhibit exit from mitosis. Destruction of these substrates promotes the mitosis—G, 
transition. APC©*" activity during early G, helps maintain the low cyclin-Cdk activity (Figure 14.8) required to assemble prereplication complexes at the 
origins of replication (Figure 13.20). (Although not discussed here, the activation of phosphatases that remove the phosphate groups added by Cdk1 also 
plays a substantial part in driving the events that occur during the latter stages of mitosis and reentry into G;; see Nature Revs. Mol. Cell Biol. 12:469, 2011.) 
Source (a) After J-M Peters, Curr. Opin. Cell Biol. 10:762, 1998; Copyright 1998. Current Opinion in Cell Biology by Elsevier Ltd. Reproduced with 
permission of Elsevier Ltd. in the format reuse in a book/textbook via Copyright Clearance Center. See also Nature Revs. Mol. Cell Biol. 7:650, 2006. 


00:37:16 


FIGURE 14.27 Experimental demonstration of the importance of proteolysis in a cell's irreversible exit from mitosis. This illustration shows frames 
from a video of a cell that had been arrested in mitosis by the presence of a proteasome inhibitor (MG132). At time 0, a Cdk1 inhibitor (flavopiridol) was 
added to the medium, which caused the cell to complete mitosis and initiate cytokinesis. At 25 min, the cell was washed free of the Cdk1 inhibitor. Because 
the cell still contained cyclin B (which would normally have been destroyed by proteasomes), the cell reentered mitosis and progressed to metaphase as 
seen in the last five frames of the video. The upper row shows phase-contrast images of the cell at various times, and the lower row shows the correspond- 
ing fluorescence micrographs with times indicated. Video 3, from which these frames were taken, can be seen here: http://www.ncbi.nlm.nih.gov/pmc/ 
articles/PMC1513549/bin/NIHMS10478-supplement-Video003.mov. Bar in lower right image equals 10 um. 


Soure: From Tamara A. Potapova, et al., Nature 440:954, 2006 © 2006. Reprinted with permission from Macmillan Publishers Limited. Courtesy of Gary J. Gorbsky. 
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that have already exited mitosis are washed free of the Cdk1 inhibitor 
(Figure 14.27). Such cells, which now contain both cyclin B and active 
Cdk1, actually progress in the reverse direction and head back into 
mitosis. This reversal is characterized by compaction of the chromo- 
somes, breakdown of the nuclear envelope, assembly of a mitotic spin- 
dle, and movement of the chromosomes back to the metaphase plate, 
as shown in Figure 14.27. All of these events are triggered by the inap- 
propriate reactivation of Cdk] activity by removal of its inhibitor. This 
finding dramatically illustrates the importance of proteolysis in mov- 
ing the normal cell cycle in a single, irreversible direction. 


The Events of Anaphase 


All the chromosomes of the metaphase plate are split in synchrony at 
the onset of anaphase, and the chromatids (now referred to as chro- 
mosomes, because they are no longer attached to their sisters) begin 
their poleward migration (see Figure 14.11). As each chromosome 
moves during anaphase, its centromere is seen at its leading edge 
with the arms of the chromosome trailing behind (FIGURE 14.28a). 
The movement of chromosomes toward opposite poles is very slow 
relative to other types of cellular movements, proceeding at approxi- 
mately 1 um per minute, a value calculated by one mitosis researcher 
to be equivalent to a trip from North Carolina to Italy that would 
take approximately 14 million years. The slow rate of chromosome 
movement ensures that the chromosomes segregate accurately and 
without entanglement. The forces thought to power chromosome 
movement during anaphase are discussed in the following section. 
The poleward movement of chromosomes is accompanied by 
obvious shortening of chromosomal microtubules. It has long been 


FIGURE 14.28 The mitotic spindle and chromosomes at anaphase. 

(a) Fluorescence micrograph of a cell at late anaphase, showing the highly 
compacted arms of the chromosomes at this stage. The arms are seen to trail 
behind as the kinetochores lead the way toward the respective poles. The 
chromosomal spindle fibers at this late anaphase stage are extremely short 
and are less evident between the forward edges of the chromosomes and the 
poles. The polar spindle microtubules are visible in the interzone between 


appreciated that tubulin subunits are lost from the plus (kinetochore- 
based) ends of chromosomal microtubules during anaphase (Figure 
14.28b). Subunits are also lost from the minus ends of these micro- 
tubules as a result of the continued poleward flux of tubulin subunits 
that occurs during prometaphase and metaphase (Figures 14.22 
and 14.25). The primary difference in microtubule dynamics between 
metaphase and anaphase is that subunits are added to the plus ends of 
microtubules during metaphase, keeping the length of the chromo- 
somal fibers constant (Figure 14.25), whereas subunits are lost from 
the plus ends during anaphase, resulting in shortening of the chromo- 
somal fibers (Figure 14.28b). This change in behavior at the microtu- 
bule plus ends is thought to be triggered by the loss of tension on the 
kinetochores following separation of the sister chromatids. 

The movement of the chromosomes toward the poles is referred 
to as anaphase A to distinguish it from a separate but simultaneous 
movement, called anaphase B, in which the two spindle poles 
move farther apart. The elongation of the mitotic spindle during 
anaphase B is accompanied by the net addition of tubulin subunits to 
the plus ends of the polar microtubules. Thus, subunits can be pref- 
erentially added to polar microtubules and removed from chromo- 
somal microtubules at the same time in different regions of the same 
mitotic spindle (Figure 14.28b). 


Forces Required for Chromosome 
Movements at Anaphase 


In the early 1960s, Shinya Inoué of the Marine Biological Laboratory 
at Woods Hole proposed that the depolymerization of chromosomal 
microtubules during anaphase was not simply a consequence of 
chromosome movement but the cause of it. Inoué suggested that 
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(b) Anaphase continues 


the separating chromosomes. Relative movements of the polar microtubules are thought to be responsible for causing the separation of the poles that 
occurs during anaphase B. (b) Microtubule dynamics during anaphase. Tubulin subunits are lost from both ends of the chromosomal microtubules, resulting 
in shortening of chromosomal fibers and movement of the chromosomes toward the poles during anaphase A. Meanwhile, tubulin subunits are added to the 


plus ends of polar microtubules, which also slide across one another, leading to separation of the poles during anaphase B. 


Source: (a) From Nasser Rusan, UNC Chapel Hill, 2005 Olympus Bioscapes Competition. 


FIGURE 14.29 Experimental demonstration that microtubule depolymerization can move attached chromosomes in vitro. The structure at the lower 
left is the remnant of a lysed protozoan. In the presence of tubulin, the basal bodies at the surface of the protozoan were used as sites for the initiation of 
microtubules, which grew outward into the medium. Once the microtubules had formed, condensed mitotic chromosomes were introduced into the 
chamber and allowed to bind to the ends of the microtubules. The arrow shows a chromosome attached to the end of a bundle of microtubules. The 
concentration of soluble tubulin within the chamber was then decreased by dilution, causing the depolymerization of the microtubules. As shown in this 
video sequence, the shrinkage of the microtubules was accompanied by the movement of the attached chromosome. Bar equals 5 um. 


Source: From Martine Coue, Vivian A. Lombillo, and J. Richard Mcintosh, J. Cell Biol. 112:1169, 1991, Fig. 3. Reproduced with permission of the Rockefeller 


University Press. 


depolymerization of the microtubules that comprise a spindle fiber 
could generate sufficient mechanical force to pull a chromosome 
forward. 

Early experimental support for the disassembly-force model 
came from studies in which chromosomes underwent considerable 
movement as the result of the depolymerization of attached micro- 
tubules. An example of one of these experiments is shown in 
FIGURE 14.29. In this case, the movement of a microtubule-bound 
chromosome (arrow) occurs in vitro following dilution of the 
medium. Dilution reduces the concentration of soluble tubulin, 
which in turn promotes the depolymerization of the microtubules. 
These types of experiments, as well as direct force-measuring stud- 
ies, indicate that microtubule depolymerization alone can generate 
sufficient force to pull chromosomes through a cell. 
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FIGURE 14.30 depicts a model of the events proposed to occur 
during chromosome movement at anaphase in a vertebrate cell. As 
indicated in Figure 14.28), the microtubules that comprise the chro- 
mosomal spindle fibers undergo depolymerization at both their 
minus and plus ends during anaphase. These combined activities 
lead to the movement of chromosomes toward the pole. 
Depolymerization at the microtubule minus ends serves to transport 
the chromosomes toward the poles due to poleward flux, reminis- 
cent of a person standing on a moving walkway in an airport. 
Depolymerization at the microtubule plus ends causes the tubule to 
roll up while still towing the chromosomes. Some cells rely more on 
poleward flux, others more on plus-end depolymerization. Studies 
of animal cells in anaphase have revealed that both the plus and 
minus ends of chromosomal fibers are sites where depolymerizing 
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FIGURE 14.30 Proposed mechanism for the movement of chromosomes during anaphase in animal cells. In the model depicted here, chromosome 
movement toward the poles is accomplished by a combination of poleward flux, which moves the body of each microtubule toward one of the poles, and 
simultaneous depolymerization of the microtubule at both ends. Depolymerizing kinesins of the kinesin-13 family have been localized at both the plus 
(kinetochore) and minus (polar) ends of chromosomal microtubules and are postulated to be responsible for depolymerization at their respective sites. In this 
model, the Ndc80 protein complex of the outer kinetochore plate acts as the device that couples microtubule depolymerization to chromosome segrega- 
tion. The force required for chromosome movement is provided by the release of strain energy as the microtubule depolymerizes. The released energy is 
utilized by the curled ends of the depolymerizing protofilaments to bias the movement of the bound heads of the Ndc80 complex (dashed black arrow) 
toward the minus end of the microtubule. Motor proteins of the kinetochore, such as cytoplasmic dynein, may also have a force-generating role in 


chromosome movement during anaphase. 
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ized. These depolymerases are indicated at the opposite ends of the 
microtubule in Figure 14.30. If either of these microtubule “depoly- 
merases” is specifically inhibited, chromosome segregation during 
anaphase is at least partially disrupted. These findings suggest that 
ATP-dependent, kinesin-mediated depolymerization forms the 
basis for chromosome segregation during mitosis. 

One of the questions of greatest interest in the field of mitosis is 
the mechanism by which kinetochores are able to hold on to the plus 
ends of microtubules that are losing tubulin subunits. During mito- 
sis, a molecular complex known as the KMN network forms at the 
outer kinetochore of each chromosome. A key structural component 
of the KMN network is a protein complex known as Ndc80; Ndc80 
complexes are present as molecular fibrils that reach out to form 
relatively weak linkages with an attached microtubule just behind its 
plus end. It is estimated that each microtubule is contacted around 
its circumference by 6 to 9 of these Ndc80 tethers. A number of stud- 
ies suggest that, as indicated in Figure 14.30, the terminal heads of 
the Ndc80 complexes are able to travel along the microtubule toward 
its minus end, pushed along by the curling protofilaments of the dis- 
assembling tip. As a result, the attached chromosome moves toward 
the spindle pole as it is towed by the shrinking chromosomal fiber. 


The Spindle Assembly Checkpoint 


As discussed in Section 14.5, cells possess checkpoint mechanisms 
that monitor the status of events during the cell cycle. One of these 
checkpoints operates at the transition between metaphase and ana- 
phase. The spindle assembly checkpoint (SAC), as it is called, is 
best revealed when a chromosome fails to become aligned properly at 
the metaphase plate. When this happens, the checkpoint mechanism 
delays the onset of anaphase until the misplaced chromosome has 
assumed its proper position along the spindle equator. If a cell were 
not able to postpone chromosome segregation, it would greatly elevate 
the risk of the daughter cells receiving an abnormal number of chro- 
mosomes (aneuploidy). This expectation has been confirmed with the 
identification of a number of children with inherited deficiencies in 
one of the spindle checkpoint proteins. These individuals exhibit a 
disorder (named MVA), which is characterized by a high percentage 
of aneuploid cells and a greatly increased risk of developing cancer. 

How does the cell determine whether all of the chromosomes 
are properly aligned at the metaphase plate? Let’s consider a chromo- 
some that is only attached to microtubules from one spindle pole, 
which is probably the circumstance of the wayward chromosome 
indicated by the arrow in Figure 14.23a. Through a mechanism that 
is not yet well understood, unnattached kinetochores assemble a 
complex of proteins that mediates the spindle assembly checkpoint. 
The presence of these proteins at an unattached kinetochore sends a 
“wait” signal to the cell cycle machinery that prevents the cell from 
continuing on into anaphase. Once the wayward chromosome 
becomes attached to spindle fibers from both spindle poles and 
becomes properly aligned at the metaphase plate, the signaling com- 
plex leaves the kinetochore, which turns off the “wait” signal and 
allows the cell to progress into anaphase. 

FIGURE 14.31 shows the mitotic spindle of a cell that is arrested 
prior to metaphase due to a single unaligned chromosome. Unlike 
all of the other kinetochores in this cell, only the unaligned chromo- 
some is seen to still contain Mad2 protein, a key component of the 
spindle assembly checkpoint complex. Studies have shown that Mad2 
can switch between two conformations: an inactive (closed) and 


FIGURE 14.31 The spindle assembly checkpoint. Fluorescence 
micrograph of a mammalian cell in late prometaphase labeled with 
antibodies against the spindle checkpoint protein Mad2 (pink) and tubulin 
of the microtubules (green). The chromosomes appear blue. Only one of 
the chromosomes of this cell is seen to contain Mad2, and this chromo- 
some has not yet become aligned at the metaphase plate. The presence of 
Mad2 on the kinetochore of this chromosome is sufficient to prevent the 
initiation of anaphase. 

Source: From Jennifer Waters, Rey-Huei Chen, Andrew W. Murray, and E. D. 
Salmon, J. Cell Biol.141, cover #5, 1998; Reproduced with permission of 
the Rockefeller University Press. 


active (open) state. During mitotic entry, inactive Mad2 is recruited 
to kinetochores through its interaction with Mad1. Binding to Mad1 
at the kinetochore causes Mad2 to switch to its active conformation, 
and to start recruiting and activating and releasing additional Mad2 
molecules. While in the active conformation, Mad2 is able to bind to 
the APC activator Cdc20. During the period that Cdc20 is bound to 
Mad2, APC complexes would be unable to ubiquitinate the anaphase 
inhibitor securin, thus keeping all of the sister chromatids attached 
to one another by their cohesin “glue.” When the kinetochore binds 
to a microtubule, the Mad1-Mad2 complex is knocked off of the 
kinetochore, ceasing continued Mad2 recruitment and activation. 

It is well established that the spindle assembly checkpoint is 
activated by the presence of an unattached kinetochore, but there are 
other chromosomal abnormalities that arise during the progression 
to metaphase that also require corrective measures. For example, on 
occasion, the two kinetochores of sister chromatids will become 
attached to microtubules from the same spindle pole, a condition 
referred to as a syntelic attachment (step 3a, Figure 14.20b). If not 
corrected, a syntelic attachment is very likely to lead to the move- 
ment of both sister chromatids to one of the daughter cells, leaving 
the other daughter devoid of this chromosome. The cell is likely 
alerted to the presence of a chromosome with a syntelic attachment 
by the lack of tension on the chromosome’ kinetochores. Tension 
normally develops when sister chromatids are being pulled by 
microtubules from opposite spindle poles (steps 3-4, Figure 14.20). 

Cells are able to correct syntelic attachments (and other types of 
abnormal microtubule connections) through the action of an 
enzyme called Aurora B kinase, which is part of a mobile protein 
complex that resides at the centromere during prometaphase and 
metaphase. Among the substrates of Aurora B kinase are several of 
the proteins thought to be involved in kinetochore-microtubule 
attachment, including members of the Ndc80 complex and the kine- 
sin depolymerase of Figure 14.30. Studies suggest that Aurora B 
kinase molecules of an incorrectly attached chromosome phospho- 
rylate these protein substrates, which destabilizes microtubule 


attachment to both kinetochores. Once freed of their bonds, the 
kinetochores of each sister chromatid have a fresh opportunity to 
become attached to microtubules from opposite spindle poles. 
Inhibition of Aurora B kinase in cells or extracts leads to misalign- 
ment and missegregation of chromosomes (see Figure 18.51). 


REVIEW | 


1. Describe the events that occur in a cell during 
prometaphase and during anaphase. 


2. Describe the similarities and differences in microtubule 
dynamics between metaphase and anaphase. How 
are the differences related to anaphase A and B 
movements? 


@ 


14.11 Telophase and Cytokinesis 


As the chromosomes near their respective poles, they tend to collect 
in a mass, which marks the beginning of the final stage of mitosis, 
or telophase (FIGURES 14.11 and 14.32). During telophase, 
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FIGURE 14.32 Telophase. Electron micrograph of a section through 
a human lymphocyte in telophase. 
Source: David M. Phillips/Photo Researchers, Inc. 


daughter cells return to the interphase condition: The mitotic spindle 567 


disassembles, the nuclear envelope reforms, and the chromosomes 
become more and more dispersed until they disappear from view 
under the microscope. The actual partitioning of the cytoplasm into 
two daughter cells occurs by a process to be discussed shortly. First, 
however, let’s look back and consider the motor proteins involved in 
several of the major chromosome movements that take place during 
mitosis. 


Motor Proteins Required 
for Mitotic Movements 


Mitosis is characterized by extensive movements of cellular struc- 
tures. Prophase is accompanied by movement of the spindle poles to 
opposite ends of the cell, prometaphase by movement of the chro- 
mosomes to the spindle equator, anaphase A by movement of the 
chromosomes from the spindle equator to its poles, and anaphase B 
by the elongation of the spindle. Over the past decade, a variety of 
different molecular motors have been identified in different loca- 
tions in mitotic cells of widely diverse species. The motors involved 
in mitotic movements are primarily microtubule motors, including a 
number of different kinesin-related proteins and cytoplasmic 
dynein. Some of the motors move toward the plus end of the micro- 
tubule, others toward the minus end. As discussed above, one group 
of kinesins does not move anywhere, but promotes microtubule 
depolymerization. Motor proteins have been localized at the spindle 
poles, along the spindle fibers, and within both the kinetochores and 
arms of the chromosomes. Although firm conclusions about the 
functions of specific motor proteins cannot yet be drawn, a general 
picture of the roles of these molecules is suggested (FIGURE 14.33): 


e Motor proteins located along the polar microtubules probably 
contribute by keeping the poles apart (Figure 14.33a,b). 


e Motor proteins residing on the chromosomes are probably 
important in the movements of the chromosomes during 
prometaphase (Figure 14.33a), in maintaining the chromosomes 
at the metaphase plate (Figure 14.33b), and in separating the 
chromosomes during anaphase (Figure 14.33c). 


e Motor proteins situated along the overlapping polar microtu- 
bules in the region of the spindle equator are probably responsi- 
ble for cross-linking antiparallel microtubules and sliding them 
over one another, thus elongating the spindle during anaphase B 
(Figure 14.33c). 


Cytokinesis 


Mitosis accomplishes the segregation of duplicated chromosomes 
into daughter nuclei, but the cell is divided into two daughter cells by 
a separate process called cytokinesis. The first hint of cytokinesis in 
most animal cells appears during anaphase as an indentation of the 
cell surface in a narrow band around the cell. As time progresses, the 
indentation deepens to form a furrow that moves inward toward 
the center of the cell. The plane of the furrow lies in the same plane 
previously occupied by the chromosomes of the metaphase plate, so 
that the two sets of chromosomes are ultimately partitioned into dif- 
ferent cells (as in Figure 14.32). As one cell becomes two cells, addi- 
tional plasma membrane is delivered to the cell surface via 
cytoplasmic vesicles that fuse with the advancing cleavage furrow. In 
its latter stages, the advancing furrow passes through the tightly 
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FIGURE 14.33 Proposed activity of motor proteins during mitosis. 

(a) Prometaphase. The two halves of the mitotic spindle are moving apart 
from one another to opposite poles, due to the action of bipolar (four- 
headed) plus-end-directed motors (members of the kinesin-5 family). These 
motors can bind by their heads to antiparallel microtubules from opposite 
poles and cause them to slide apart (step 1). (Additional motors associated 
with the centrosomes and cortex are not shown.) Meanwhile, the 
chromosomes have become attached to the chromosomal microtubules and 
can be seen oscillating back and forth along the microtubules. Ultimately, 
the chromosomes are moved to the center of the spindle, midway between 
the poles. Poleward chromosome movements are mediated by minus-end- 
directed motors (i.e., cytoplasmic dynein) residing at the kinetochore 

(step 2). Chromosome movements away from the poles are mediated by 
plus-end-directed motors (i.e., kinesins) residing at the kinetochore and 
especially along the chromosome arms (step 3) (see Figure 14.21). 

(b) Metaphase. The two halves of the spindle maintain their separation as 
the result of continued plus-end-directed motor activity associated with the 
polar microtubules (step 4). The motor activity associated with step 4 is also 
suspected of promoting the flux of subunits within these microtubules that is 
depicted in Figure 14.25. The chromosomes are thought to be maintained 
at the equatorial plane by the balanced activity of plus-end and minus-end- 
directed motor proteins residing at the kinetochore (step 5). (c) Anaphase. The 
movement of the chromosomes toward the poles is thought to require the 
activity of kinesin depolymerases that catalyze depolymerization at both the 
plus and minus ends of microtubules (step 6). The separation of the poles 
(anaphase B) is thought to result from the continuing activity of the bipolar 
plus-end-directed motors of the polar microtubules (step 7). 

Source: K. E. Sawin and J. M. Scholey, Trends Cell Biol. 1:122, 1991. Trends 
in Cell Biology by Elsevier Ltd. Reproduced with permission of Elsevier Ltd. 
in the format reuse in a book/textbook via Copyright Clearance Center. 


packed remnants of the central portion of the mitotic spindle, which 
forms a cytoplasmic bridge between the daughter cells called the 
midbody (FIGURE 14.34a). The final step of cytokinesis is called 
abscission, when the surfaces of the cleavage furrow fuse with one 
another, splitting the cell in two (Figure 14.34b). Abscission requires 
the action of ESCRT complexes—the same proteins responsible for 
severing the intraluminal vesicles that form within endosomes 
(page 301). 

Our present concept of the mechanism responsible for cytoki- 
nesis stems from a proposal made by Douglas Marsland in the 1950s 
known as the contractile ring theory (FIGURE 14.35a). Marsland pro- 
posed that the force required to cleave a cell is generated in a thin 
band of contractile cytoplasm located in the cortex, just beneath the 
plasma membrane of the furrow. Microscopic examination of the 
cortex beneath the furrow of a cleaving cell reveals the presence of 
large numbers of actin filaments (Figure 14.35b and 14.36a). These 
unbranched actin filaments are assembled in the cell cortex by the 
action of a formin protein (page 352), which may also anchor the 
filaments to the overlying plasma membrane. 

Interspersed among the actin filaments are short, bipolar myosin 
filaments. These filaments are composed of myosin II, as evidenced 


wa Midbody 


FIGURE 14.34 Cytokinesis (a) These cultured mammalian cells are 
undergoing the final step in cytokinesis, called abscission, in which the 
cleavage furrow cuts through the midbody, a thin cytoplasmic bridge that is 
packed with remnants of the central portion of the mitotic spindle. 
Microtubules are green, actin is red, and DNA is blue. (b) This fertilized sea 
urchin egg has just been split into two cells by cytokinesis. 


Source: (a) From Ahna R. Skop et al., Science 305:61, 2004, Fig. 1a. 
Reprinted with permission from AAAS. (b) Courtesy of Tryggve Gustafson. 
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FIGURE 14.35 The formation and operation of the contractile ring 
during cytokinesis. (a) Actin filaments become assembled in a ring at the 
cell equator. Contraction of the ring, which requires the action of myosin, 
causes the formation of a furrow that splits the cell in two. (b) Confocal 
fluorescence micrograph of a fly spermatocyte undergoing cytokinesis at 
the end of the first meiotic division. Actin filaments, which have been 
stained with the mushroom toxin phalloidin, are seen to be concentrated in 
a circular equatorial band within the cleavage furrow. 

Source: (b) From Daniel Saul, et al., J. Cell Science 117:3893, 2004, #17 
cover, Courtesy of Julie A. Brill, by permission of the Company of Biologists 
Ltd. http://jcs.biologists.org/content/117/17.cover-expansion 


by their binding of anti-myosin II antibodies (FIGURE 14.36b). The 569 
importance of myosin II in cytokinesis is evident from the fact that 
(1) anti-myosin II antibodies cause the rapid cessation of cytokinesis 
when injected into a dividing cell (Figure 14.36c), and (2) cells lack- 
ing a functional myosin II gene carry out nuclear division by mitosis 
but cannot divide normally into daughter cells. The assembly of the 
actin-myosin contractile machinery in the plane of the future cleav- 
age furrow is orchestrated by a G protein called RhoA. In its GTP- 
bound state, RhoA triggers a cascade of events that leads to both the 
assembly of actin filaments and the activation of myosin II’s motor 
activity. If RhoA is depleted from cells or inactivated, a cleavage fur- 
row fails to develop. 
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FIGURE 14.36 Experimental demonstration of the importance of 
myosin in cytokinesis. (a,b) Localization of actin and myosin Il in a 
Dictyostelium ameba during cytokinesis as demonstrated by double-stain 
immunofluorescence. (a) Actin filaments (red) are located at both the 
cleavage furrow and the cell periphery where they play a key role in cell 
movement (Section 9.10). (b) Myosin II (green) is localized at the cleavage 
furrow, where it is part of a contractile ring that encompasses the equator. 
(c) A starfish egg that had been microinjected with an antibody against 
starfish myosin, as observed under polarized light (which causes the mitotic 
spindles to appear either brighter or darker than the background due to 
the presence of oriented microtubules). While cytokinesis has been 
completely suppressed by the antibodies, mitosis (as revealed by the 
mitotic spindles) continues unaffected. 

Source: (a-b) Courtesy of Yoshio Fukui; (c) From Daniel P. Kiehart, Issei 
Mabuchi, and Shinya Inoué, J. Cell Biol. 94:167, 1982, Fig. 1. Reproduced 
with permission of the Rockefeller University Press. 
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The force-generating mechanism operating during cytokinesis 
is thought to be similar to the actin and myosin-based contraction of 
muscle cells. In fact, the cytokinesis furrow of a primitive single- 
celled eukaryote is the likely evolutionary ancestor of the contractile 
machinery of animal muscle cells. Whereas the sliding of actin fila- 
ments of a muscle cell brings about the shortening of the muscle 
fiber, sliding of the filaments of the contractile ring pulls the cortex 
and attached plasma membrane toward the center of the cell. As a 
result, the contractile ring constricts the equatorial region of the cell, 
much like pulling on the handles of a drawstring bag narrows its 
opening. 

It is generally agreed that the position of the cleavage furrow is 
determined by the position of the anaphase mitotic spindle, which 
induces the activation of RhoA in a narrow ring within the cortex. 
However, there has been considerable debate as to how this particu- 
lar zone within the cortex is selected. Early pioneering studies on 
marine invertebrate eggs by Ray Rappaport of Union College in New 
York demonstrated that the contractile ring forms in a plane midway 
between the spindle poles, even if one of the poles is displaced by a 
microneedle inserted into the cell. An example of the relationship 
between the position of the spindle poles and cleavage plane is 
shown in the experiment of FIGURE 14.37. These studies suggest that 
the site of actin-filament assembly, and thus the plane of cytokinesis, 
is determined by a signal emanating from the spindle poles. The sig- 
nal is thought to travel from the spindle poles to the cell cortex along 
the astral microtubules. When the distance between the poles and 
the cortex is modified experimentally, the timing of cytokinesis can 
be dramatically altered (Figure 14.37). Recent studies have suggested 
that the activity observed by Rappaport may be carried out by a pro- 
tein complex known as centralspindlin. Centralspindlin, which 
includes a kinesin-like protein and an activator of RhoA, localizes to 
the plus ends of antiparallel microtubules at the spindle midbody, 
where it is then thought to activate RhoA to initiate cleavage furrow 
formation. 

A second signaling complex, known as the chromosomal pas- 
senger complex (CPC), has also been observed to accumulate at the 
spindle midbody and is also thought to regulate the formation of the 
cleavage furrow. One key member of the CPC is Aurora B kinase, 
which we previously discussed as playing an important regulatory 
role in kinetochore-microtubule attachments and other mitotic 
events (page 566). Aurora B kinase activity at the midbody appears 
to regulate the timing of abscission, ensuring that membrane scis- 
sion does not occur if chromosomes are present at the midbody, and 
also has been implicated in regulating chromatin decondensation 
following mitosis. 


Cytokinesis in Plant Cells: Formation 
of the Cell Plate 


Plant cells, which are enclosed by a relatively inextensible cell wall, 
undergo cytokinesis by a very different mechanism. Unlike animal 
cells, which are constricted by a furrow that advances inward from 
the outer cell surface, plant cells must construct an extracellular wall 
inside a living cell. Wall formation starts in the center of the cell and 
grows outward to meet the existing lateral walls. The formation of a 
new cell wall begins with the construction of a simpler precursor, 
which is called the cell plate. 

The plane in which the cell plate forms is perpendicular to the 
axis of the mitotic spindle but, unlike the case for animal cells, 


(a) 


FIGURE 14.37 The site of formation of the cleavage plane and the 
time at which cleavage occurs depends on the position of the mitotic 
spindle. (a) This echinoderm egg was allowed to divide once to forma 
two-cell embryo. Then, once the mitotic spindle appeared in each of the 
two cells, one of the cells was drawn into a micropipette, causing it to 
assume a cylindrical shape. The two dark spots in each cell are the spindle 


poles that have formed prior to the second division of each cell. (b) Nine 
minutes later the cylindrical cell has completed cleavage, while the 
spherical cell has not yet begun to cleave. These photos indicate that 

(1) the cleavage plane forms between the spindle poles, regardless of 

their position, and (2) cleavage occurs more rapidly in the cylindrical cell. 
Bar equals 80 mm. (c) These results can be explained by assuming that 

(1) the cleavage plane (brown bar) forms where the astral microtubules 
overlap, and (2) cleavage occurs earlier in the cylindrical cell because the 
distance from the poles (blue spheres) to the site of cleavage is reduced, 
thereby shortening the time it takes the cleavage signal to reach the surface. 


Source: (a-b) From Charles B. Shuster and David R. Burgess, J. Cell Biol. 
146:987, 1999, Fig. 5. Reproduced with permission of the Rockefeller 
University Press. 


the plane is not determined by the position of the spindle nor is 
it determined late in mitosis. Rather, the orientation of both the 
mitotic spindle and cell plate are determined by a belt of cortical 


microtubules—the preprophase band—that forms in late G, (see 
Figure 9.15). Even though the preprophase band has disassembled 
by prometaphase, it leaves an invisible imprint that determines the 
future division site. The first sign of cell plate formation is seen in 
late anaphase with the appearance of the phragmoplast in the center 
of the dividing cell. The phragmoplast consists of clusters of inter- 
digitating microtubules oriented perpendicular to the future plate 
(Figure 9.15), together with actin filaments, membranous vesicles, 
and electron-dense material. The microtubules of the phragmoplast, 
which arise from remnants of the mitotic spindle, serve as tracks for 
the movement of small Golgi-derived secretory vesicles into the 
region. The vesicles become aligned along a plane between the 
daughter nuclei (FIGURE 14.38a). Electron micrographs of rapidly 
frozen tobacco cells have revealed the steps by which the Golgi- 
derived vesicles become reorganized into the cell plate. To begin the 
process (step 1, Figure 14.38b), the vesicles send out finger-like 
tubules that contact and fuse with neighboring vesicles to form an 
interwoven tubular network in the center of the cell (step 2). 
Additional vesicles are then directed along microtubules to the lat- 
eral edges of the network. The newly arrived vesicles continue the 
process of tubule formation and fusion, which extends the network 
in an outward direction (step 2). Eventually, the leading edge of the 
growing network contacts the parent plasma membrane at the 
boundary of the cell (step 3). Ultimately, the tubular network loses 
its cytoplasmic gaps and matures into a continuous, flattened partition. 


(a) 


FIGURE 14.38 The formation of a cell plate between two daughter plant nuclei during 
cytokinesis. (a) A low-magnification electron micrograph showing the formation of the cell 
plate between the future daughter cells. Secretory vesicles derived from nearby Golgi 
complexes have become aligned along the equatorial plane (arrow) and are beginning to 
fuse with one another. The membrane of the vesicles forms the plasma membranes of the 
two daughter cells, and the contents of the vesicles will provide the material that forms the 
cell plate separating the cells. (6) Steps in the formation of the cell plate as described in 


the text. 


Source: (a) David Phillips/Photo Researchers, Inc.; (b) A. L. Samuels, T. H. Giddings, Jr., & L. A. 
Staehelin, Journal Cell Biology 130:1354, 1995. The Journal of Cell Biology by Rockefeller 


The membranes of the tubular network become the plasma mem- 571 


branes of the two adjacent daughter cells, whereas the secretory 
products that had been carried within the vesicles contribute to the 
intervening cell plate. Once the cell plate is completed, cellulose and 
other materials are added to produce the mature cell wall. 


REVIEW | 


1. What types of force-generating mechanisms might 
be responsible for chromosome movement during 
anaphase? 


2. Contrast the events that occur during cytokinesis in 
typical plant and animal cells. 


e ——___- 


14.12 Overview of Meiosis 


The production of offspring by sexual reproduction includes the 
union of two cells, each with a haploid set of chromosomes. As dis- 
cussed in Chapter 10, the doubling of the chromosome number at 
fertilization is compensated by an equivalent reduction in chromo- 
some number at a stage prior to formation of the gametes. This is 
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word meaning “reduction.” Meiosis ensures production of a haploid 
phase in the life cycle, and fertilization ensures a diploid phase. 
Without meiosis, the chromosome number would double with each 
generation, and sexual reproduction would not be possible. 

To compare the events of mitosis and meiosis, we need to exam- 
ine the fate of the chromatids. Prior to both mitosis and meiosis, 
diploid G, cells contain pairs of homologous chromosomes, with 
each chromosome consisting of two chromatids. During mitosis, the 
chromatids of each chromosome are split apart and separate into 
two daughter nuclei in a single division. As a result, cells produced by 
mitosis contain pairs of homologous chromosomes and are geneti- 
cally identical to their parents. During meiosis, in contrast, the four 
chromatids of a pair of replicated homologous chromosomes are dis- 
tributed among four daughter nuclei. Meiosis accomplishes this feat 
by incorporating two sequential divisions without an intervening 
round of DNA replication (FIGURE 14.39). In the first meiotic divi- 
sion, each chromosome (consisting of two chromatids) is separated 
from its homologue. Asa result, each daughter cell contains only one 
member of each pair of homologous chromosomes. For this to 
occur, homologous chromosomes are paired during prophase of the 
first meiotic division (prophase I, Figure 14.39) by an elaborate pro- 
cess that has no counterpart in mitosis. As they are paired, homolo- 
gous chromosomes engage in a process of genetic recombination 
that produces chromosomes with new combinations of maternal 
and paternal alleles (see metaphase I, Figure 14.39). In the second 
meiotic division, the two chromatids of each chromosome are sepa- 
rated from one another (anaphase II, Figure 14.39). 

A survey of various eukaryotes reveals marked differences with 
respect to the stage within the life cycle at which meiosis occurs and 
the duration of the haploid phase. The following three groups (Figure 
14.40) can be identified on these bases: 


1. Gametic or terminal meiosis. In this group, which includes all 
multicellular animals and many protists, the meiotic divisions are 
closely linked to the formation of the gametes (FIGURE 14.40, left). 
In male vertebrates (FIGURE 14.41a), for example, meiosis occurs 
just prior to the differentiation of the spermatozoa. Spermatogonia 
that are committed to undergo meiosis become primary spermat- 
ocytes, which then undergo the two divisions of meiosis to pro- 
duce four relatively undifferentiated spermatids. Each spermatid 
undergoes a complex differentiation to become the highly specialized 
sperm cell (spermatozoon). In female vertebrates (Figure 14.41b), 
oogonia become primary oocytes, which then enter a greatly 
extended meiotic prophase. During this prophase, the primary 
oocyte grows and becomes filled with yolk and other materials. It 
is only after differentiation of the oocyte is complete (i.e., the 
oocyte has reached essentially the same state as when it is ferti- 
lized) that the meiotic divisions occur. Vertebrate eggs are typi- 
cally fertilized at a stage before the completion of meiosis (usually 
at metaphase II). Meiosis is completed after fertilization, while 
the sperm resides in the egg cytoplasm. 


2. Zygotic or initial meiosis. In this group, which includes only 
protists and fungi, the meiotic divisions occur just after fertiliza- 
tion (Figure 14.40, right) to produce haploid spores. The spores 
divide by mitosis to produce a haploid adult generation. 
Consequently, the diploid stage of the life cycle is restricted to a 
brief period after fertilization when the individual is still a zygote. 

3. Sporic or intermediate meiosis. In this group, which includes 
plants and some algae, the meiotic divisions take place at a 
stage unrelated to either gamete formation or fertilization 
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FIGURE 14.39 The stages of meiosis. 


(Figure (14.40, center). If we begin the life cycle with the union of 
a male gamete (the pollen grain) and a female gamete (the egg), 
the diploid zygote undergoes mitosis and develops into a diploid 
sporophyte. At some stage in the development of the sporo- 
phyte, sporogenesis (which includes meiosis) occurs, producing 
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FIGURE 14.40 A comparison of three major groups of organisms based 
on the stage within the life cycle at which meiosis occurs and the duration 
of the haploid phase. 

Source: The Cell in Development and Heredity by Wilson, Edmund B. 
Copyright 1987. Reproduced with permission of Taylor & Francis Group Llc - 
Books in the format textbook via Copyright Clearance Center. 


spores that germinate directly into a haploid gametophyte. 
The gametophyte can be either an independent stage or, as in the 
case of seed plants, a tiny structure retained within the ovules. In 
either case, the gametes are produced from the haploid gameto- 
phyte by mitosis. 


REVIEW | 


1. Contrast the overall roles of mitosis and meiosis in 
the lives of a plant or animal. How do the nuclei 
formed by these two processes differ from one 
another? 


2. Contrast the timing of meiosis in spermatogenesis 


versus oogenesis. 
oe 
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FIGURE 14.41 The stages of gametogenesis in vertebrates: a 
comparison between the formation of sperm and eggs. In both sexes, a 
relatively small population of primordial germ cells present in the embryo 
proliferates by mitosis to form a population of gonial cells (spermatogonia 
or oogonia) from which the gametes differentiate. In the male (a), meiosis 
occurs before differentiation, whereas in the female (b), both meiotic 
divisions occur after differentiation. Each primary spermatocyte generally 
gives rise to four viable gametes, whereas each primary oocyte forms only 
one fertilizable egg and two or three polar bodies. 
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As with mitosis, the prelude to meiosis includes DNA replication. 
The premeiotic S phase generally takes several times longer than a 
premitotic S phase. Prophase of the first meiotic division (i.e., pro- 
phase I) is typically lengthened in extraordinary fashion when com- 
pared to mitotic prophase. In the human female, for example, 
oocytes initiate prophase I of meiosis prior to birth and then enter a 
period of prolonged arrest. Oocytes resume meiosis just prior to the 
time they are ovulated, which occurs every 28 days or so after an 
individual reaches puberty. Consequently, many human oocytes 
remain arrested in the same approximate stage of prophase for sev- 
eral decades. The first meiotic prophase is also very complex and is 
customarily divided into several stages that are similar in all sexually 
reproducing eukaryotes (FIGURE 14.42). 

The first stage of prophase I is leptotene, during which the chro- 
mosomes become compacted and visible in the light microscope. 
Although the chromosomes have replicated at an earlier stage, there 
is no indication that each chromosome is actually composed of a 
pair of identical chromatids. In the electron microscope, however, 
the chromosomes are revealed to be composed of paired chromatids. 

The second stage of prophase I, which is called zygotene, is 
marked by the visible association of homologues with one another. 
This process of chromosome pairing is called synapsis and is an 
intriguing event with important unanswered questions: On what 
basis do the homologues recognize one another? How does the pair 
become so perfectly aligned? When does recognition between hom- 
ologues first occur? Recent studies have shed considerable light on 
these questions. It had been assumed for years that interaction 
between homologous chromosomes first begins as chromosomes 
initiate synapsis. However, studies on yeast cells by Nancy Kleckner 
and her colleagues at Harvard University demonstrated that homol- 
ogous regions of DNA from homologous chromosomes are already 
associated with one another during leptotene. Chromosome com- 
paction and synapsis during zygotene simply make this arrangement 
visible under the microscope. As will be discussed below, the first 
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step in genetic recombination is the deliberate introduction of 
double-stranded breaks in aligned DNA molecules. Studies in both 
yeast and mice suggest the DNA breaks occur in leptotene, well 
before the chromosomes are visibly paired. 

These findings are supported by studies aimed at locating par- 
ticular DNA sequences within the nuclei of premeiotic and meiotic 
cells. We saw in Section 12.8 that individual chromosomes occupy 
discrete regions within nuclei rather than being randomly dispersed 
throughout the nuclear space. When yeast cells about to enter mei- 
otic prophase are examined, each pair of homologous chromosomes 
is found to share a joint territory distinct from the territories shared 
by other pairs of homologues. This finding suggests that homolo- 
gous chromosomes are paired to some extent before meiotic pro- 
phase begins. The telomeres (terminal segments) of leptotene 
chromosomes are distributed throughout the nucleus. Then, near 
the end of leptotene, there is a dramatic reorganization of chromo- 
somes in many species so that the telomeres become localized at the 
inner surface of the nuclear envelope at one side of the nucleus. 
Clustering of telomeres at one end of the nuclear envelope occurs in 
a wide variety of eukaryotes and causes the chromosomes to resem- 
ble the clustered stems of a bouquet of flowers (FIGURE 14.43). Mice 
carrying mutations that prevent the association of chromosomes 
with the nuclear envelope exhibit defects in synapsis, genetic recom- 
bination, and gamete formation. These experimental results suggest 
that the nuclear envelope plays an important role in the interaction 
between homologous chromosomes during meiosis. 

A recent study in fission yeast has demonstrated that noncoding 
meiotic RNAs (meiRNAs) may also play a key role in the pairing of 
homologous chromosomes during meiosis. meiRNAs were observed. 
being transcribed from certain loci along a chromosome and 
appeared to pair with analogous meiRNA transcripts on the homol- 
ogous chromosome, leading to the pairing of the chromosomes. 
Further research is needed to determine the prevalence of this 
mechanism in different organisms. Electron micrographs indicate 
that chromosome synapsis is accompanied by the formation of a 
complex structure called the synaptonemal complex. The synap- 
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FIGURE 14.42 The stages of prophase I. The events at each stage are described in the text. 


FIGURE 14.43 Association of the telomeres of meiotic chromosomes 
with the nuclear envelope. The chromosomes of a stage of meiotic 
prophase of the male grasshopper in which homologous chromosomes are 
physically associated as part of a bivalent. The bivalents are arranged in a 
well-defined “bouquet” with their terminal regions clustered near the inner 
surface of the nuclear envelope at the base of the photo. 

Source: From B. John, Meiosis, © 1990. Cambridge University Press. 
Reprinted with permission. Courtesy of Bernard John. 


tonemal complex (SC) is a ladder-like structure with transverse 
protein filaments connecting the two lateral elements (FIGURE 14.44). 
The chromatin of each homologue is organized into loops that 
extend from one of the lateral elements of the SC (Figure 14.440). 
The lateral elements are composed primarily of cohesin (Figure 
14.14), which presumably binds together the chromatin of the sister 
chromatids. For many years, the SC was thought to hold each pair of 
homologous chromosomes in the proper position to initiate genetic 
recombination between strands of homologous DNA. It is now evi- 
dent that the SC is not required for genetic recombination. Not only 
does the SC form after genetic recombination has been initiated, but 
mutant yeast cells unable to assemble an SC can still engage in the 
exchange of genetic information between homologues. It is currently 
thought that the SC functions primarily as a scaffold to allow inter- 
acting chromatids to complete their crossover activities, as described. 
below. 

The complex formed by a pair of synapsed homologous chro- 
mosomes is called a bivalent or a tetrad. The former term reflects 
the fact that the complex contains two homologues, whereas the lat- 
ter term calls attention to the presence of four chromatids. The end 
of synapsis marks the end of zygotene and the beginning of the next 
stage of prophase I, called pachytene (Figure 14.44a), which is char- 
acterized by a fully formed synaptonemal complex. During pachy- 
tene, the homologues are held closely together along their length by 
the SC. The DNA of sister chromatids is extended into parallel loops 
(Figure 14.44b). Under the electron microscope, a number of elec- 
tron-dense bodies about 100 nm in diameter are seen within the 
center of the SC. These structures have been named recombination 
nodules because they correspond to the sites where crossing-over is 
taking place, as evidenced by the associated synthesis of DNA that 
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FIGURE 14.44 The synaptonemal complex. (a) Electron micrograph of a 
human pachytene bivalent showing a pair of homologous chromosomes 
held in a tightly ordered parallel array. K, kinetochore. (b) Schematic 
diagram of the synaptonemal complex and its associated chromosomal 
fibers. The dense granules (recombination nodules) seen in the center of 
the SC (indicated by the arrowhead in part a) contain the enzymatic 
machinery required to complete genetic recombination, which is thought to 
begin at a much earlier stage in prophase |. Closely paired loops of DNA 
from the two sister chromatids of each chromosome are depicted. The 
loops are likely maintained in a paired configuration by cohesin (not 
shown). Genetic recombination (crossing-over) is presumed to occur 
between the DNA loops from nonsister chromatids, as shown. 

Source: Courtesy of Alberto J. Solari, Chromosoma 81:330, 1980. With kind 
permission from Springer Science1 Business Media. 


occurs during intermediate steps of recombination (Section 14.15). 
Recombination nodules contain the enzymatic machinery that facil- 
itates genetic recombination, which is completed by the end of 
pachytene. 

The beginning of diplotene, the next stage of meiotic prophase I 
(Figure 14.42), is recognized by the dissolution of the SC, which 
leaves the chromosomes attached to one another at specific points by 
X-shaped structures, termed chiasmata (singular chiasma) 
(FIGURE 14.45). Chiasmata are located at sites on the chromosomes 
where crossing-over between DNA molecules from the two chromo- 
somes had previously occurred. Chiasmata are formed by covalent 
junctions between a chromatid from one homologue and a nonsister 
chromatid from the other homologue. These points of attachment 
provide a striking visual portrayal of the extent of genetic recombi- 
nation. The chiasmata are made more visible by a tendency for the 
homologues to separate from one another at the diplotene stage. 
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FIGURE 14.45 Visible evidence of crossing-over. (a,b) Diplotene 
bivalents from the grasshopper showing the chiasmata formed between 
chromatids of each homologous chromosome. The accompanying inset 
indicates the crossovers that have presumably occurred within the bivalent 
in a. The chromatids of each diplotene chromosome are closely apposed 
except at the chiasmata. (c) Scanning electron micrograph of a bivalent 
from the desert locust with three chiasmata (arrows). 

Source: (a—b) From Bernard John, Meiosis, © 1990 Cambridge University 
Press. Reprinted with permission. (c) From Klaus Werner Wolf, 

BioEss. 16:108, 1994. Reprinted with permission of John Wiley & Sons. 


In vertebrates, diplotene can be an extremely extended phase of 
oogenesis during which the bulk of oocyte growth occurs. Thus 
diplotene can be a period of intense metabolic activity. Transcrip- 
tion during diplotene in the oocyte provides the RNA utilized for 
protein synthesis during both oogenesis and early embryonic devel- 
opment following fertilization. 

During the final stage of meiotic prophase I, called diakinesis, 
the meiotic spindle is assembled and the chromosomes are prepared 
for separation. In those species in which the chromosomes become 
highly dispersed during diplotene, the chromosomes become re- 
compacted during diakinesis. Diakinesis ends with the disappear- 
ance of the nucleolus, the breakdown of the nuclear envelope, and 
the movement of the tetrads to the metaphase plate. In vertebrate 
oocytes, these events are triggered by an increase in the level of the 
protein kinase activity of MPF (maturation-promoting factor). As 
discussed in the Experimental Pathways in Section 14.3, MPF was 
first identified by its ability to initiate these events, which represent 
the maturation of the oocyte (Section 14.3). 

In most eukaryotic species, chiasmata can still be seen in 
homologous chromosomes aligned at the metaphase plate of meiosis 
I. In fact, chiasmata are required to hold the homologues together as 
a bivalent during this stage. In humans and other vertebrates, every 


pair of homologues typically contains at least one chiasma, and the 
longer chromosomes tend to have two or three of them. It is thought 
that some mechanism exists to ensure that even the smallest chro- 
mosomes form a chiasma. If a chiasma does not occur between a 
pair of homologous chromosomes, the chromosomes of that bivalent 
tend to separate from one another after dissolution of the SC. This 
premature separation of homologues often results in the formation 
of nuclei with an abnormal number of chromosomes. The conse- 
quences of such an event are discussed in the Human Perspective 
(Section 14.14). 

At metaphase I, the two homologous chromosomes of each 
bivalent are connected to the spindle fibers from opposite poles 
(FIGURE 14.46a). In contrast, sister chromatids are connected to 
microtubules from the same spindle pole, which is made possible by 
the side-by-side arrangement of their kinetochores as seen in the 
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FIGURE 14.46 Separation of homologous chromosomes during 
meiosis | and separation of chromatids during meiosis II. (a) Schematic 
diagram of a pair of homologous chromosomes at metaphase |. The 
chromatids are held together along both their arms and centromeres by 
cohesin. The pair of homologues is maintained as a bivalent by the 
chiasmata. Inset micrograph shows that the kinetochores (arrowheads) of 
sister chromatids are situated on one side of the chromosome, facing the 
same pole. The black dots are gold particles bound to the motor protein 
CENP-E of the kinetochores (see Figure 14.16c). (b) At anaphase |, the 
cohesin holding the arms of the chromatids is cleaved, allowing the 
homologues to separate from one another. Cohesin remains at the 
centromere, holding the chromatids together. (c) At metaphase II, the 
chromatids are held together at the centromere, with microtubules from 
opposite poles attached to the two kinetochores. Inset micrograph shows 
the kinetochores of the sister chromatids are now on opposite sides of the 
chromosome, facing opposite poles. (d) At anaphase II, the cohesin 
holding the chromatids together has been cleaved, allowing the 
chromosomes to move to opposite poles. 


Source: Insets: From Jibak Lee et al., Mol. Reprod. Develop. 56:51, 2000. 
Reprinted with permission of John Wiley & Sons. 


inset of Figure 14.46a. The orientation of the maternal and paternal 
chromosomes of each bivalent on the metaphase I plate is random; 
the maternal member ofa particular bivalent has an equal likelihood 
of facing either pole. Consequently, when homologous chromo- 
somes separate during anaphase I, each pole receives a random 
assortment of maternal and paternal chromosomes (see Figure 
14.39). Thus, anaphase I is the cytological event that corresponds to 
Mendel’s law of independent assortment (page 368). As a result of 
independent assortment, organisms are capable of generating a 
nearly unlimited variety of gametes. 

Separation of homologous chromosomes at anaphase I requires 
the dissolution of the chiasmata that hold the bivalents together. The 
chiasmata are maintained by cohesion between sister chromatids in 
regions that flank these sites of recombination (Figure 14.46a). The 
chiasmata disappear at the metaphase I-anaphase I transition, as the 
arms of the chromatids of each bivalent lose cohesion (Figure 
14.46b). Loss of cohesion between the arms is accomplished by pro- 
teolytic cleavage of the cohesin molecules in those regions of the 
chromosome. In contrast, cohesion between the joined centromeres 
of sister chromatids remains strong, because the cohesin situated 
there is protected from proteolytic attack (Figure 14.46)). Asa result, 
sister chromatids remain firmly attached to one another as they 
move together toward a spindle pole during anaphase I. 

Telophase I of meiosis I produces less dramatic changes than 
telophase of mitosis. Although chromosomes often undergo some 
dispersion, they do not reach the extremely extended state of the 
interphase nucleus. The nuclear envelope may or may not reform 
during telophase I. The stage between the two meiotic divisions is 
called interkinesis and is generally short-lived. In animals, cells in 
this fleeting stage are referred to as secondary spermatocytes or sec- 
ondary oocytes. These cells are characterized as being haploid 
because they contain only one member of each pair of homologous 
chromosomes. Even though they are haploid, they have twice as 


much DNA as a haploid gamete because each chromosome is still 577 


represented by a pair of attached chromatids. Secondary spermato- 
cytes are said to have a 2C amount of DNA, half as much as a pri- 
mary spermatocyte, which has a 4C DNA content, and twice as 
much as a sperm cell, which has a 1C DNA content. 

Interkinesis is followed by prophase II, a much simpler pro- 
phase than its predecessor. If the nuclear envelope had reformed in 
telophase I, it is broken down again. The chromosomes become re- 
compacted and line up at the metaphase plate. Unlike metaphase I, 
the kinetochores of sister chromatids of metaphase II face opposite 
poles and become attached to opposing sets of chromosomal spindle 
fibers (Figure 14.46c). The progression of meiosis in vertebrate 
oocytes stops at metaphase II. The arrest of meiosis at metaphase II 
is brought about by factors that inhibit APC°*° activation, thereby 
preventing cyclin B degradation. As long as cyclin B levels remain 
high within the oocyte, Cdk activity is maintained, and the cells can- 
not progress to the next meiotic stage. Metaphase II arrest is released 
only when the oocyte (now called an egg) is fertilized. Fertilization 
leads to a rapid influx of Ca?" ions, the activation of APC? (page 
562), and the destruction of cyclin B. The fertilized egg responds to 
these changes by completing the second meiotic division. Anaphase 
II begins with the synchronous splitting of the centromeres, which 
had held the sister chromatids together, allowing them to move 
toward opposite poles of the cell (Figure 14.46d). Meiosis II ends 
with telophase II, in which the chromosomes are once again enclosed 
by a nuclear envelope. The products of meiosis are haploid cells with 
a 1C amount of nuclear DNA. 


REVIEW | 


1. Contrast the events that take place during prophase | 
and prophase II of meiosis. 


eo | 


14.14 THE HUMAN PERSPECTIVE 


Meiotic Nondisjunction and Its Consequences 


Meiosis is a complex process, and meiotic mistakes in humans 
appear to be surprisingly common. Homologous chromosomes 
may fail to separate from each other during meiosis |, or sister 
chromatids may fail to come apart during meiosis Il. When either 
of these situations occurs, gametes are formed that contain an 
abnormal number of chromosomes—either an extra chromosome 
or a missing chromosome (FIGURE 1). If one of these gametes 
happens to fuse with a normal gamete, a zygote with an abnormal 
number of chromosomes forms, and serious consequences arise. 
In most cases, the zygote develops into an abnormal embryo 
that dies at some stage between conception and birth. In a few 
cases, however, the zygote develops into an infant whose cells 
have an abnormal chromosome number, a condition known as 
aneuploidy. The consequences of aneuploidy depend on which 
chromosome or chromosomes are affected. 

The normal human chromosome complement is 46: 22 
pairs of autosomes and one pair of sex chromosomes. An extra 
chromosome (producing a total of 47 chromosomes) creates a 
condition referred to as a trisomy (FIGURE 2). A person whose cells 
contain an extra chromosome 21, for example, has trisomy 21. A 


missing chromosome (producing a total of 45 chromosomes) pro- 
duces a monosomy. We will begin by considering the effects of 
an abnormal number of autosomes. 

The absence of one autosomal chromosome, regardless 
of which chromosome is affected, invariably proves to be 
lethal at some stage during embryonic or fetal development. 
Consequently, a zygote containing an autosomal monosomy 
does not give rise to a fetus that is carried to term. Although one 
might not expect that possession of an extra chromosome would 
create a life-threatening condition, the fact is that trisomies do 
not fare much better than monosomic zygotes. Of the 22 differ- 
ent autosomes in the human chromosome complement, only per- 
sons with trisomy 21 can survive beyond the first few weeks or 
months of life. Most of the other possible trisomies are lethal dur- 
ing development, whereas trisomies for chromosomes 13 and 18 
are often born alive but have such severe abnormalities that they 
succumb soon after birth. More than one-quarter of fetuses that 
spontaneously abort carry a chromosomal trisomy. It is thought 
that many more zygotes carrying abnormal chromosome num- 
bers produce embryos that die at an early stage of development 


continued 


aAlpedsiag UCWNH OYUL e HL'PL 


578 


UOISINIG |]2D e¢ VL YALdVHOD 


i 


First 
meiotic 
division 
Nondisjunction Normal 
Second 
meiotic 
division 
Normal Normal Normal Nondisjunction 


IL /\ J\ 1\ 
©0000 0000 


cee sere ee Normal h h 
with extra missing a gametes CMO" =a cnro- 
chromosome chromosome mosome, meson? 


FIGURE 1. Meiotic nondisjunction occurs when chromosomes fail to 
separate from each other during meiosis. If the failure to separate occurs 
during the first meiotic division, which is called primary nondisjunction, 
all of the haploid cells have an abnormal number of chromosomes. If 
nondisjunction occurs during the second meiotic division, which is called 
secondary nondisjunction, only two of the four haploid cells are affected. 
(More complex types of nondisjunction than shown here can also occur.) 


before the pregnancy is recognized. For example, for every tri- 
somic zygote formed at fertilization, there is presumably an equal 
number of monosomic zygotes that fare even less well. It is esti- 
mated that approximately 20 to 25 percent of human oocytes are 
aneuploid, which is much higher than any other species that has 
been studied. Meiosis in males occurs with a much lower level of 
chromosomal abnormalities than in females. 


FIGURE 2 The karyotype of a person with Down syndrome. The 
karyotype shows an extra chromosome 21 (trisomy 21). 


Source: PHANIE/Photo Researchers, Inc. 


Not all of the aneuploidy that occurs during human 
development necessarily begins with the zygote. One recent 
study of early embryos formed by in vitro fertilization (IVF) found 
that most of these embryos contained some cells (called blasto- 
meres) with an aneuploid karyotype. Aneuploid cells were present 
along with normal cells in the same embryo suggesting (1) that 
chromosome nondisjunction occurs frequently during mitotic divi- 
sions of early embryonic cells and (2) that these abnormal cells 
are somehow destroyed during early development, owing to the 
fact that babies born as a result of in vitro fertilization do not 
exhibit an abnormally high level of chromosome abnormalities. 
Because it is impossible to study the chromosome complement 
of cells from human embryos conceived in vivo, we don’t know 
whether the occurrence of mitotic nondisjunction seen in this 
study is a phenomenon restricted to embryos generated by IVF. 

Even though chromosome 21 is the smallest human 
chromosome with fewer than 400 genes, the presence of an extra 
copy of this genetic material leads to Down syndrome. Persons 
with Down syndrome exhibit varying degrees of mental impair- 
ment, alteration in certain body features, circulatory problems, 
increased susceptibility to infectious diseases, a greatly increased 
risk of developing leukemia, and the early onset of Alzheimer’s 
disease. All of these medical problems are thought to result 
from higher-than-normal level of expression of genes located 
on chromosome 21. Moreover, the expression of genes on other 
chromosomes can also be affected by the excess levels of certain 
transcription factors encoded by genes on chromosome 21. 

The presence of an abnormal number of sex chromosomes is 
much less disruptive to human development. A zygote with only 
one X chromosome and no second sex chromosome (denoted 
as XO) develops into a female with Turner syndrome, in which 
genital development is arrested in the juvenile state, the ovaries 
fail to develop, and body structure is slightly abnormal. Because 
a Y chromosome is male determining, persons with at least one Y 
chromosome develop as males. A male with an extra X chromosome 
(XXY) develops Klinefelter syndrome, which is characterized by 
mental retardation, underdevelopment of genitalia, and the pres- 
ence of feminine physical characteristics (such as breast enlarge- 
ment). A zygote with an extra Y (XYY) develops into a physically 
normal male who is likely to be taller than average. Considerable 
controversy developed surrounding claims that XYY males tend to 
exhibit more aggressive, antisocial, and criminal behavior than do 
XY males, but this hypothesis has never been substantiated. 

The likelihood of having a child with Down syndrome rises 
dramatically with the age of the mother—from 0.05 percent for 
mothers 19 years of age to greater than 3 percent for women 
over the age of 45. Most studies show that no such correlation 
is found between the age of the father and the likelihood of 
bearing a child with trisomy 21. Estimates based on comparisons 
of DNA sequences between the offspring and the parents indi- 
cate that approximately 95 percent of trisomies 21 can be traced 
to nondisjunction having occurred in the mother. Note, however, 
that although chromosome nondisjunction does not appear to 
increase during spermatogenesis in relation to the age of the 
father, the number of new mutations that are passed on to a child 
by the father does apparently increase as aman ages. This finding 
is linked to an apparent increase in the risk of autism in children 
born to older fathers. 

It was noted above that an abnormal chromosome number 
can result from nondisjunction at either of the two meiotic divi- 
sions (Figure 1). Although these different nondisjunction events 
produce the same effect in terms of chromosome numbers in the 
zygote, they can be distinguished by genetic analysis. Primary 
nondisjunction transmits two homologous chromosomes to the 
zygote, whereas secondary nondisjunction transmits two sister 


chromatids (most likely altered by crossing over) to the zygote. 
Studies indicate that most of the mistakes occur during meiosis |. 
For example, in one study of 433 cases of trisomy 21 that resulted 
from maternal nondisjunction, 373 were the result of errors that 
had occurred during meiosis | and 60 were the result of errors 
during meiosis Il. 

Why should meiosis | be more susceptible to nondisjunction 
than meiosis II? We don't know the precise answer to that question, 
but it almost certainly reflects the fact that oocytes of older women 
have remained arrested in meiosis | for a very long period within 
the ovary. It was noted in the chapter that chiasmata—which are 


visual indicators of genetic recombination—play an important 
role in holding a bivalent together during metaphase |. According 
to one hypothesis, meiotic spindles of older oocytes are less able 
to hold together weakly constructed bivalents (e.g., bivalents with 
only one chiasma located near the tip of the chromosome) than 
those of younger oocytes, increasing the likelihood that homol- 
ogous chromosomes will missegregate at anaphase |. Another 
possibility is that sister chromatid cohesion, which prevents the 
chiasmata from “sliding off” the end of the chromosome, is not 
fully maintained over an extended period, allowing homologues 
to separate prematurely. 


14.15 Genetic Recombination 
during Meiosis 


In addition to reducing the chromosome number as required for 
sexual reproduction, meiosis increases the genetic variability in a 
population of organisms from one generation to the next. Independent 
assortment allows maternal and paternal chromosomes to become 
shuffled during formation of the gametes, and genetic recombination 
(crossing-over) allows maternal and paternal alleles on a given chro- 
mosome to become shuffled as well (see Figure 14.39). Without 
genetic recombination, the alleles along a particular chromosome 
would remain tied together, generation after generation. By mixing 
maternal and paternal alleles between homologous chromosomes, 
meiosis generates organisms with novel genotypes and phenotypes 
on which natural selection can act (see Figure 10.7 for an example 
from the fruit fly). In humans, it has been estimated that there are 
around 25 recombination events in any given sperm cell. 

Recombination involves the physical breakage of individual 
DNA molecules and the ligation of the split ends from one DNA 
duplex with the split ends of the duplex from the homologous chro- 
mosome. Recombination is a remarkably precise process that nor- 
mally occurs without the addition or loss of a single base pair. To 
occur so faithfully, recombination depends on the complementary 
base sequences that exist between a single strand from one chromo- 
some and the homologous strand of another chromosome, as dis- 
cussed below. The precision of recombination is further ensured by 
the involvement of DNA repair enzymes that fill gaps that develop 
during the exchange process. 

A simplified model of the proposed steps that occur during 
recombination in eukaryotic cells is shown in FIGURE 14.47. In this 
model, two DNA duplexes that are about to recombine become 
aligned next to one another as the result of some type of homology 
search in which homologous DNA molecules associate with one 
another in preparation for recombination. Once they are aligned, an 
enzyme (Spol1) introduces a double-stranded break into one of the 
duplexes (step 1, Figure 14.47). The gap is subsequently widened 
(resected) as indicated in step 2. Resection may occur by the action 
of a 5’ > 3’ exonuclease or by an alternate mechanism. Regardless, 
the broken strands possess exposed single-stranded tails, each bear- 
ing a 3’ OH terminus. In the model shown in Figure 14.47, one of the 
single-stranded tails leaves its own duplex and invades the DNA 
molecule of a nonsister chromatid, hydrogen bonding with the com- 
plementary strand in the neighboring duplex (step 3). In E. coli, this 
process in which a single strand invades an intact homologous 
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FIGURE 14.47 A proposed mechanism for genetic recombination 
initiated by double-strand breaks. The steps are described in the text. 


duplex and displaces the corresponding strand in that duplex is cata- 
lyzed by a recombinase enzyme, called the RecA protein. The RecA 
recombinase polymerizes along a length of single-stranded DNA 
forming a nucleoprotein filament. RecA enables the single-stranded 
DNA to search for and invade an homologous double helix. 
Eukaryotic cells have homologues of RecA (e.g., Rad51) that are 
thought to catalyze strand invasion. Strand invasion activates a DNA 
repair activity (Section 13.8) that fills the gaps as shown in step 4. 
As a result of the reciprocal exchange of DNA strands, the two 
duplexes are covalently linked to one another to form a joint 
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or Holliday junctions, that flank the region of strand exchange (steps 
4 and 5, Figure 14.47). These junctions are named after Robin 
Holliday, who proposed their existence in 1964. This type of recom- 
bination intermediate need not be a static structure because the 
point of linkage may move in one direction or another (an event 
known as branch migration) by breaking the hydrogen bonds hold- 
ing the original pairs of strands and reforming hydrogen bonds 
between strands of the newly joined duplexes (step 5). Formation of 
Holliday junctions and branch migration occur during pachytene 
(Figure 14.42). 

To resolve the interconnected DNA molecules of the Holliday 
junctions and restore the DNA back to two separate duplexes, another 
round of DNA cleavage must occur. Depending on the particular 
DNA strands that are cleaved and ligated, two alternate products can 
be generated. In one case, the two duplexes contain only short 
stretches of genetic exchange, which represents a noncrossover 
(step 6, Figure 14.47). In the alternate pathway of breakage and liga- 
tion, the duplex of one DNA molecule is covalently joined to the 
duplex of the homologous molecule, creating a site of genetic 


Analytic Questions | 


1. In what way is cell division a link between humans and the earli- 
est eukaryotic cells? 


2. What types of synthetic events would you expect to occur in G; 
that do not occur in Gy? 


3. Suppose you are labeling a population of cells growing asyn- 
chronously with H]thymidine. G, is 6 hours, S is 6 hours, Go is 
5 hours, and M is 1 hour. What percentage of the cells would be 
labeled after a 15-minute pulse? What percentage of mitotic 
cells would be labeled after such a pulse? How long would you 
have to chase these cells before you saw any labeled mitotic 
chromosomes? What percentage of the cells would have labeled 
mitotic chromosomes if you chased the cells for 18 hours? 
(See Quantitative Tutorial Video). 


4. Suppose you take a culture of the same cells used in the previ- 
ous question, but instead of pulse-labeling them with [?H]thymi- 
dine, you labeled them continuously for 20 hours. Plot a graph 
showing the amount of radioactive DNA that would be present 
in the culture over the 20-hour period. What would be the mini- 
mum amount of time needed to ensure that all cells had some 
incorporated label? How could you determine the length of the 
cell cycle in this culture without use of a radioactive label? 


5. Fusing G,- and S-phase cells produces different results from fus- 
ing a Gp-phase cell with one in S. What would you expect this 
difference to be and how can it be explained? (Hint: Look back 
at Figure 13.20, which shows that the initiation of replication 
requires formation of a prereplication complex, which can only 
form in G;.) 


6. Figure 14.6 shows the effect on the cell cycle of mutations in the 
genes that encode Wee! and Cdc25. The kinase CAK was identi- 
fied biochemically rather than genetically (i.e., by isolation of 
mutant cells). What phenotype would you expect in a yeast cell 
carrying a temperature-sensitive CAK mutation, after raising the 


recombination (i.e., a crossover) (step 7). The decision as to whether 
a recombinational interaction will result in a crossover or a noncross- 
over is thought to occur long before the stage when the double 
Holliday junction is actually resolved. Crossovers, which represent 
the fusion of a maternal and paternal chromosome (step 7), develop 
into the chiasmata required to hold the homologues together during 
meiosis I (Figure 14.44). In mammals, the number of noncrossovers 
greatly outnumbers the number of crossovers. Moreover, as noted on 
page 402, crossovers occur in certain regions of the genome (recom- 
bination “hotspots”) with a much higher frequency than in other 
regions. These recombination hotspots are predicted to be binding 
sites for a protein called PRDM9, which then directs recombination 
using a mechanism that is not yet well understood 


REVIEW | 


1. What is the role of DNA strand breaks in genetic 
recombination? 
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temperature of the culture medium in early G,? in late Gy? Why 
might it be different depending upon the stage at which the 
temperature was raised? 


7. Give four distinct mechanisms by which a Cdk can be in activated. 


8. A syncytium is a “cell” that contains more than one nucleus; 
examples are a skeletal muscle fiber and a blastula of a fly 
embryo. These two types of syncytia arise by very different 
pathways. What two mechanisms can you envision that could 
lead to formation of syncytia? What does this tell you about the 
relationship between mitosis and cytokinesis? 


9. How could you determine experimentally if the microtubules of 
the polar microtubules were in a state of dynamic flux during 
anaphase? Knowing the events that occur at this stage, what 
would you expect to see? 


10. If you were to add [?H]thymidine to a cell as it underwent replica- 
tion (S phase) prior to beginning meiosis, what percentage of 
the chromosomes of the gametes produced would be labeled? 
f one of these gametes (a sperm) were to fertilize an unlabeled 
egg, what percentage of the chromosomes of the two-cell stage 
would be labeled? 


11. If the haploid number of chromosomes in humans is 23 and the 
amount of nuclear DNA in a sperm is 1C, how many chromo- 
somes does a human cell possess in the following stages: meta- 
phase of mitosis, prophase | of meiosis, anaphase | of meiosis |, 
prophase II of meiosis, anaphase II of meiosis? How many chro- 
matids does the cell have at each of these stages? How much 
DNA (in terms of numbers of C) does the cell have at each of 
these stages? 


12. Plot the amount of DNA in the nucleus of a spermatogonia from 
the G, stage prior to the first meiotic division through the com- 
pletion of meiosis. Label each of the major stages of the cell 
cycle and of meiosis on the graph. 


13; 
14. 
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16. 


17. 


How many centrioles does a cell have at metaphase of mitosis? 


Suppose you were told that most cases of trisomy result from 
aging of an egg in the oviduct as it awaits fertilization. What type 
of evidence could you obtain from examining spontaneously 
aborted fetuses that would confirm this suggestion? How would 
this fit with data that has already been collected? 


Suppose you incubate a meiotic cell in [’H]thymidine between 
the leptotene and zygotene stages, then you fix the cell during 
pachytene and prepare an autoradiograph. You find that the chi- 
asmata are sites of concentrations of silver grains. What does 
this tell you about the mechanism of recombination? 


What type of phenotype would you expect of a cell whose Cdc20 
polypeptide was mutated so that (1) it was unable to bind to 
Mad2, or (2) it was unable to bind to the other subunits of the 
APC, or (3) it failed to dissociate from the APC at the end of 
anaphase? 


Assume for a moment that crossing-over did not occur. Would 
you agree that you received half of your chromosomes from 
each parent? Would you agree that you received one-quarter of 


18. 


19: 


20. 


your chromosomes from each grandparent? Would the answer 
to these questions change if you allowed for crossing-over to 
have occurred? 


Fetuses whose cells are triploid, that is, contain three full sets of 
chromosomes, develop to term and die as infants, whereas 
fetuses with individual chromosome trisomies tend not to fare as 
well. How can you explain this observation? 


What type of phenotype would you expect of a fission yeast cell 
whose Cdk subunit was lacking each of the following residues as 
the result of mutation: Tyr 15, Thr 161? 


It was noted on page 549 that centrosome duplication and DNA 
synthesis are both initiated by cyclin E-Cdk2, which becomes 
active at the end of G;. A recent study found that if cyclin E-Cdk2 
is activated at an earlier stage, such as the beginning of Gy, that 
centrosome duplication begins at that point in the cell cycle, but 
that DNA replication is not initiated until S phase would normally 
begin. Provide a hypothesis to explain why DNA synthesis does 
not begin as well. You might look back at Figure 13.20 for further 
information. 
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The Hawaiian bobtail squid maintains a symbiotic relationship with 
quorum-sensing bacteria. When present at high concentrations, the 
bacteria Vibrio fischeri produce light that allows the squid to hunt for prey 
without casting a shadow. 


Source: MatthiasOrmestad. 


15.7 The Specificity of G Protein- 


QUORUM SENSING 


Florting in shallow coastal waters of the Pacific Ocean, the 
Hawaiian bobtail squid uses a clever tactic to prevent 
frightening off would-be prey. As it hovers above the ocean 
floor on moonlit nights, the squid produces a glow so that its 
silhouette is obscured when viewed from below. The light 
produced by the bobtail squid is a form of bioluminescence, 
where light is produced and emitted by a living organism. 
The bluish glow of the Hawaiian bobtail is actually produced 
by symbiotic bacteria, Aliivibrio fischeri, which are captured 
and maintained in a highly specialized light organ in the 
squid’s mantle. Interestingly, A. fischeri only fluoresces in 
crowded conditions, such as in the squid's light organ. When 
widely dispersed, the bacteria cease to produce light. 


The fluorescence of A. fischeri is an example of a behavior 
known as quorum sensing, in which single-celled organisms 
communicate with one another using chemical messages. 
For years after this phenomenon was first discovered, 
quorum sensing was thought to happen only in rare, isolated 
instances. In recent years, however, numerous studies have 
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shown that nearly all bacteria appear to produce and release 
signaling molecules in order to coordinate a variety of 
activities within and across species. Secretion of virulence 
factors, formation and maintenance of biofilms, fluores- 
cence, and even programmed cell death are all behaviors 
that can be activated by an ever-growing list of bacterial 
signaling molecules. 


The mechanisms by which signaling occurs vary widely 
between species. Gram-positive bacteria, for example, 
secrete signaling molecules that are typically short, modified 
peptides. When found in high concentrations, these pep- 
tides can bind to cognate receptors on the cell surface of 


15.1 The Basic Elements 
of Cell Signaling Systems 


The English poet John Donne expressed his belief in the interde- 
pendence of humans in the phrase “No man is an island” The same 
can be said of the cells that make up a complex multicellular organ- 
ism. Most cells in a plant or animal are specialized to carry out one 
or more specific functions. Many biological processes require vari- 
ous cells to work together and to coordinate their activities. To make 
this possible, cells have to communicate with each other, which is 
accomplished by a process called cell signaling. Cell signaling 
makes it possible for cells to respond in an appropriate manner to a 
specific environmental stimulus. 

Cell signaling affects virtually every aspect of cell structure and 
function, which is one of the primary reasons that this chapter 
appears near the end of the book. On one hand, an understanding of 
cell signaling requires knowledge about other types of cellular activ- 
ity. On the other hand, insights into cell signaling can tie together a 
variety of seemingly independent cellular processes. Cell signaling is 
also intimately involved in the regulation of cell growth and division. 
This makes the study of cell signaling crucially important for under- 
standing how a cell can lose the ability to control cell division and 
develop into a malignant tumor. 

It may be helpful to begin the discussion of this complex subject 
by describing a few of the general features that are shared by most 
signaling pathways. Cells usually communicate with each other 
through extracellular messenger molecules. Extracellular 
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other bacteria, turning on a signaling pathway that leads 

to the transcription of genes and the production of pro- 
teins that alter the bacterium’s behavior. In contrast, gram- 
negative bacteria synthesize and release signaling molecules 
in the form of small molecules. These molecules are able to 
freely diffuse across the membrane and bind directly to 
cytoplasmic transcription factors. A. fischeri, which is a 
gram-negative bacterium, produces a small molecule called 
AHL (N-acyl homoserine lactone) that it releases into the 
environment. At high enough cell densities, AHL binds to a 
transcription factor, eventually leading to the production of 
luminescence genes and the production of light. 


messengers can travel a short distance and stimulate cells that are in 
close proximity to the origin of the message, or they can travel 
throughout the body, potentially stimulating cells that are far away 
from the source. In the case of autocrine signaling, the cell that is 
producing the messenger expresses receptors on its surface that can 
respond to that messenger (FIGURE 15.1a). Consequently, cells 
releasing the message will stimulate (or inhibit) themselves. During 
paracrine signaling (Figure 15.1b), messenger molecules travel only 
short distances through the extracellular space to cells that are in 
close proximity to the cell that is generating the message. Paracrine 
messenger molecules are usually limited in their ability to travel 
around the body because they are inherently unstable, or they are 
degraded by enzymes, or they bind to the extracellular matrix. 
Finally, during endocrine signaling, messenger molecules reach their 
target cells via passage through the bloodstream (Figure 15.1c). 
Endocrine messengers are also called hormones, and they typically 
act on target cells located at distant sites in the body. 

An overview of cellular signaling pathways is depicted in 
FIGURE 15.2. Cell signaling is initiated with the release of a messen- 
ger molecule by a cell that is engaged in sending messages to other 
cells in the body (step 1, Figure 15.2). The extracellular environments 
of cells contain hundreds of different informational molecules, rang- 
ing from small compounds (e.g., steroids and neurotransmitters ) to 
small, soluble protein hormones (e.g., glucagon and insulin) to huge 
glycoproteins bound to the surfaces of other cells. Cells can only 
respond to a particular extracellular message if they express receptors 
that specifically recognize and bind that messenger molecule (step 2). 


FIGURE 15.1 Autocrine (a), paracrine (b), and endocrine (c) types of intercellular signaling. 
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FIGURE 15.2 An overview of the major signaling pathways by which 
extracellular messenger molecules can elicit intracellular responses. Two 
different types of signal transduction pathways are depicted, one in which a 
signaling pathway is activated by a diffusible second messenger and another 
in which a signaling pathway is activated by recruitment of proteins to the 
plasma membrane. Most signal transduction pathways involve a combina- 
tion of these mechanisms. It should also be noted that signaling pathways 
are not typically linear tracks as depicted here, but are branched and 
interconnected to form a complex web. The steps are described in the text. 


The molecule that binds to the receptor is called a ligand. Different 
types of cells possess different complements of receptors that allow 
them to respond to different extracellular messengers. Even cells 
that share a specific receptor may respond very differently to the 
same extracellular messenger. Liver cells and smooth muscle cells 
both possess the B,-adrenergic receptor. Activation of this receptor 
by circulating adrenaline leads to glycogen breakdown in a liver cell 
and relaxation in a smooth muscle cell. These different outcomes 
following interaction with the same initial stimulus can be traced to 
different intracellular proteins that become engaged in the response 
in these two types of cells. Thus the type of activities in which a cell 
engages depends both on the stimuli that it receives and the intracel- 
lular machinery that it possesses at that particular time in its life. 

In most cases, the extracellular messenger molecule binds to a 
receptor at the outer surface of the responding cell. This interaction 
induces a conformational change in the receptor that causes the signal 
to be relayed across the membrane to the receptor’s cytoplasmic 
domain (step 3, Figure 15.2). Once it has reached the inner surface of 


the plasma membrane, there are two major routes by which the signal 
is transmitted into the cell interior, where it elicits the appropriate 
response. The particular route taken depends on the type of receptor 
that is activated. In the following discussion, we will focus on these two 
major routes of signal transduction, but keep in mind there are other 
ways that extracellular signals can have an impact on a cell. For exam- 
ple, we saw in Section 4.18 how neurotransmitters act by opening 
plasma membrane ion channels and in Figure 12.47 how steroid hor- 
mones diffuse through the plasma membrane and bind to intracellular 
receptors. In the two major routes discussed in this chapter: 


@ One type of receptor (Section 15.6) transmits a signal from its 
cytoplasmic domain to a nearby enzyme (step 4), which gener- 
ates asecond messenger (step 5). Because it brings about 
(effects) the cellular response by generating a second messenger, 
the enzyme responsible is referred to as an effector. Second 
messengers are small substances that typically activate (or 
inactivate) specific proteins. Depending on its chemical struc- 
ture, a second messenger may diffuse through the cytosol or 
remain embedded in the lipid bilayer of a membrane. 


e Another type of receptor (Section 15.10) transmits a signal by 
transforming its cytoplasmic domain into a recruiting station for 
cellular signaling proteins (step 4a). Proteins interact with one 
another, or with components of a cellular membrane, by means 
of specific types of interaction domains, such as the SH3 domain 
discussed on page 606. 


Whether the signal is transmitted by a second messenger or by 
protein recruitment, the outcome is similar; a protein that is posi- 
tioned at the top of an intracellular signaling pathway is activated 
(step 6, Figure 15.2). Signaling pathways are the information super- 
highways of the cell. Each signaling pathway consists of a series of 
distinct proteins that operate in sequence (step 7). Most “signaling 
proteins” are constructed of multiple domains, which allow them to 
interact in a dynamic way with a number of different partners, either 
simultaneously or sequentially. This type of modular construction is 
illustrated by the Grb2 and IRS-1 proteins depicted in Figures 15.20 
and 24, respectively. Unlike Grb2 and IRS-1, which function exclu- 
sively in mediating protein-protein interactions, many signaling 
proteins also contain catalytic and/or regulatory domains that give 
them a more active role in a signaling pathway. 

Each protein in a signaling pathway typically acts by altering the 
conformation of the subsequent (or downstream) protein in the 
series, an event that activates or inhibits that protein (FIGURE 15.3). 
It should come as no surprise after reading about other topics in cell 
biology that alterations in the conformation of signaling proteins are 
often accomplished by protein kinases and protein phosphatases 
that, respectively, add or remove phosphate groups from other pro- 
teins (Figure 15.3). The human genome encodes more than 500 dif- 
ferent protein kinases and approximately 150 different protein 
phosphatases. Whereas protein kinases typically work as a single 
subunit, many protein phosphatases contain a key regulatory sub- 
unit that determines substrate specificity. As a result, a single phos- 
phatase catalytic subunit can form a host of different enzymes that 
remove phosphate groups from different protein substrates. 

Most protein kinases transfer phosphate groups to serine or 
threonine residues of their protein substrates, but a very important 
group of kinases (roughly 90 in humans) phosphorylates tyrosine 
residues. Some protein kinases and phosphatases are soluble cyto- 
plasmic proteins; others are integral membrane proteins. Many 
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FIGURE 15.3 Signal transduction pathway consisting of protein 
kinases and protein phosphatases whose catalytic actions change 

the conformations, and thus the activities, of the proteins they modify. 
In the example depicted here, protein kinase 2 is activated by protein 
kinase 1. Once activated, protein kinase 2 phosphorylates protein kinase 3, 
activating the enzyme. Protein kinase 3 then phosphorylates a transcription 
factor, increasing its affinity for a site on the DNA. Binding of a transcription 
factor to the DNA affects the transcription of the gene in question. Each of 
these activation steps in the pathway is reversed by a phosphatase. 


kinases are present in the cell in a self-inhibited state. Depending on 
the particular kinase, these enzymes can be activated in the cell by 
covalent modification or by interactions with other proteins, small 
molecules, or membrane lipids. It is remarkable that, even though 
thousands of proteins in a cell contain amino acid residues with the 
potential of being phosphorylated, each protein kinase or phosphatase 
is able to recognize only its specific substrates and ignore all of 
the others. Some protein kinases and phosphatases have numerous 


FIGURE 15.4 A comparison in the frequency of tyrosine phosphorylation 
in two different types of breast cancer cells. The panels on the left side of 
the figure show the frequency of phosphotyrosine (pTyr) residues in certain 
proteins (named on the right side of the figure) in triple-negative breast 
cancer cell lines. Triple-negative cells do not express three major molecular 
signatures of many breast cancer cells—estrogen receptor, progesterone 
receptor, and the growth factor receptor HER2. The frequency of pTyr 
residues in breast cancer cells that express these three signature proteins is 
shown in the panels on the right. The frequency of pTyr residues in a given 
protein in a given cell line is indicated by the intensity of the red shading of 
the box (see the key at the bottom of the figure). The names of the cell 
lines tested are given along the top of the figure. It is evident that the 
triple-negative cells have a much greater level of tyrosine phosphorylation 
than the other breast cancer cells. This may correlate with the loss of a 
particular protein tyrosine phosphatase activity (PTPN12) in many of the 
triple-negative cancers. 

Source: From J. G. Albeck and J. S. Brugge, from data by Ting-Lei Gu of 
Cell Signaling Technology, Cell 144:639, 2011. Reprinted with permission 
from Elsevier. 
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dephosphorylate only a single amino acid residue of a single protein 
substrate. Many of the protein substrates of these enzymes are 
enzymes themselves—most often other kinases and phosphatases— 
but the substrates also include ion channels, transcription factors, 
and various types of regulatory proteins. It is thought that at least 
half of all transmembrane and cytoplasmic proteins are phosphoryl- 
ated at one or more sites. Protein phosphorylation can change pro- 
tein behavior in several different ways. Phosphorylation can activate 
or inactivate an enzyme, it can increase or decrease protein-protein 
interactions, it can induce a protein to move from one subcellular 
compartment to another, or it can act as a signal that initiates protein 
degradation. Large-scale proteomic approaches have been employed 
(Section 2.15) to identify the substrates of various protein kinases 
and the specific residues that are phosphorylated in various tissues. 
Studies on vertebrate proteomes have identified more than 20,000 
phosphoproteins harboring over 200,000 sites of phosphorylation. 
The widespread occurrence of protein phosphorylation and its pre- 
sumed importance is seen in FIGURE 15.4, which shows the marked 
difference in the frequency of tyrosine phosphorylation in certain 
proteins from two different types of cancer cells. The primary 
challenge is to understand the roles of these diverse posttranslational 
modifications in the activities of different cell types. 

Signals transmitted along such signaling pathways ultimately 
reach target proteins (step 8, Figure 15.2) involved in basic cellular 
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response initiated by the target protein may involve a change in gene 
expression, an alteration of the activity of metabolic enzymes, a 
reconfiguration of the cytoskeleton, an increase or decrease in cell 
mobility, a change in ion permeability, activation of DNA synthesis, 
or even the death of the cell. Virtually every activity in which a cell is 
engaged is regulated by signals originating at the cell surface. This 
overall process in which information carried by extracellular mes- 
senger molecules is translated into changes that occur inside a cell is 
referred to as signal transduction. 

Finally, signaling has to be terminated. This is important 
because cells have to be responsive to additional messages that they 
may receive. The first order of business is to eliminate the extracel- 
lular messenger molecule. To do this, certain cells produce extracel- 
lular enzymes that destroy specific extracellular messengers. In other 
cases, activated receptors are internalized (Section 8.17). Once inside 
the cell, the receptor may be degraded together with its ligand, 
which can leave the cell with decreased sensitivity to subsequent 
stimuli. 


REVIEW | 


1. What is meant by the term signal transduction? What 
are some of the steps by which signal transduction 
can occur? 


2. What is a second messenger? Why do you suppose it 
is called this? 


oe 


15.2 A Survey of Extracellular 
Messengers and Their Receptors 


A large variety of molecules can function as extracellular carriers of 
information. These include 


e Amino acids and amino acid derivatives. Examples include 
glutamate, glycine, acetylcholine, epinephrine, dopamine, and 
thyroid hormone. These molecules act as neurotransmitters and 
hormones. 


e@ Gases, such as NO and CO. 


e Steroids, which are derived from cholesterol. Steroid hormones 
regulate sexual differentiation, pregnancy, carbohydrate 
metabolism, and excretion of sodium and potassium ions. 


e Eicosanoids, which are nonpolar molecules containing 20 
carbons that are derived from a fatty acid named arachidonic 
acid. Eicosanoids regulate a variety of processes including pain, 
inflammation, blood pressure, and blood clotting. Several 
over-the-counter drugs that are used to treat headaches and 
inflammation inhibit eicosanoid synthesis. 


e A wide variety of polypeptides and proteins. Some of these are 
present as transmembrane proteins on the surface of an 
interacting cell (Section 7.7). Others are part of, or associate 
with, the extracellular matrix. Finally, a large number of proteins 
are excreted into the extracellular environment where they are 
involved in regulating processes such as cell division, differentia- 
tion, the immune response, or cell death and cell survival. 


Extracellular signaling molecules are usually, but not always, 
recognized by specific receptors that are present on the surface of the 
responding cell. As illustrated in Figure 15.2, receptors bind their 
signaling molecules with high affinity and translate this interaction 
at the outer surface of the cell into changes that take place on the 
inside of the cell. The receptors that have evolved to mediate signal 
transduction are indicated below. 


e G protein-coupled receptors (GPCRs) are a huge family of 
receptors that contain seven transmembrane «@ helices. These 
receptors translate the binding of extracellular signaling molecules 
into the activation of GTP-binding proteins. GTP-binding 
proteins (or G proteins) were discussed in connection with 
vesicle budding and fusion in Chapter 8, microtubule dynamics 
in Chapter 9, protein synthesis in Chapters 8 and 11, and 
nucleocytoplasmic transport in Chapter 12. In the present 
chapter, we will explore their role in transmitting messages along 
“cellular information circuits.” 


e Receptor protein-tyrosine kinases (RTKs) represent a second 
class of receptors that have evolved to translate the presence of 
extracellular messenger molecules into changes inside the cell. 
Binding of a specific extracellular ligand to an RTK usually 
results in receptor dimerization followed by activation of the 
receptor’s protein-kinase domain, which is present within its 
cytoplasmic region. Upon activation, these protein kinases 
phosphorylate specific tyrosine residues of cytoplasmic substrate 
proteins, thereby altering their activity, their localization, or their 
ability to interact with other proteins within the cell. 


e Ligand-gated channels represent a third class of cell-surface 
receptors that bind to extracellular ligands. The ability of these 
proteins to conduct a flow of ions across the plasma membrane is 
regulated directly by ligand binding. A flow of ions across the 
membrane can result in a temporary change in membrane 
potential, which will affect the activity of other membrane 
proteins, for instance, voltage-gated channels. This sequence of 
events is the basis for formation of a nerve impulse (Section 4.16). 
In addition, the influx of certain ions, such as Ca”*, can change 
the activity of particular cytoplasmic enzymes. As discussed in 
Section 4.18, one large group of ligand-gated channels functions 
as receptors for neurotransmitters. 


e Steroid hormone receptors function as ligand-regulated transcrip- 
tion factors. Steroid hormones diffuse across the plasma membrane 
and bind to their receptors, which are present in the cytoplasm. 
Hormone binding results in a conformational change that causes 
the hormone-receptor complex to move into the nucleus and bind 
to elements present in the promoters or enhancers of hormone- 
responsive genes (see Figure 12.47). This interaction gives rise to 
an increase or decrease in the rate of gene transcription. 


e Finally, there are a number of other types of receptors that act by 
unique mechanisms. Some of these receptors, for example, the 
B- and T-cell receptors that are involved in the response to 
foreign antigens, associate with known signaling molecules such 
as cytoplasmic protein-tyrosine kinases. We will concentrate in 
this chapter on the GPCRs and RTKs. 


REVIEW | 


1. Provide examples of four different types of molecules 
that can act as extracellular messengers. 


Effector 


GDP G protein 


(a) or GTP (b) 


Rhodopsin (inactive) 


Rhodopsin (active) 


11-cis 
retinal 


G protein G protein 


FIGURE 15.5 The membrane-bound machinery for transducing signals by means of a seven-transmembrane receptor and a heterotrimeric 

G protein. (a) Receptors of this type, including those that bind epinephrine and glucagon, contain seven membrane-spanning helices. When bound to their 
ligand, the receptor interacts with a trimeric G protein, which activates an effector, such as adenylyl cyclase. As indicated in the figure, the a and Y subunits 
of the G protein are linked to the membrane by lipid groups that are embedded in the lipid bilayer. (Note: Many GPCRs may be active as complexes of two 
or more receptor molecules.) (b) A model depicting the activation of the GPCR rhodopsin based on X-ray crystallographic structures. On the left, rhodopsin 
is shown in its inactive (dark-adapted) conformation together with an unbound heterotrimeric G protein (called transducin). When the retinal cofactor (shown 
in in pink embedded in the rhodopsin proteins) absorbs a photon, it undergoes an isomerization reaction (from a cis to a trans form), which leads to the 
disruption of an ionic linkage between residues on the third and sixth transmembrane helix of the protein. This event in turn leads to a change in conforma- 
tion of the protein, including the outward tilt and rotation of the sixth transmembrane helix (red curved arrow), which exposes a binding site for the Gy 
subunit of the G protein. The rhodopsin molecule on the right is shown in the active conformation with a portion of the G, subunit bound to the receptor's 


cytoplasmic face. 


Source: (b) From Thue W. Schwartz and Wayne L. Hubbell, Nature 455:473, 2008. Reprinted by permission from Macmillan Publishers Ltd. 


15.3 Signal Transduction 
by G Protein-Coupled Receptors 


G protein-coupled receptors (GPCRs) are so named because 
they interact with G proteins, as discussed below. Members of the 
GPCR superfamily are also referred to as seven-transmembrane 
(7TM) receptors because they contain seven transmembrane helices 
(FIGURE 15.5b). Thousands of different GPCRs have been identified 
in organisms ranging from yeast to flowering plants and mammals 
that together regulate an extraordinary spectrum of cellular pro- 
cesses. In fact, GPCRs constitute the single largest superfamily of 
proteins encoded by animal genomes. Included among the natural 
ligands that bind to GPCRs are a diverse array of hormones (both 
plant and animal), neurotransmitters, opium derivatives, chemoat- 
tractants (e.g., molecules that attract phagocytic cells of the immune 


system), odorants and tastants (molecules detected by olfactory and 
gustatory receptors eliciting the senses of smell and taste), and 
photons. A list of some of the ligands that operate by means of this 
pathway and the effectors through which they act is provided in 
Table 15.1. 


Receptors 


G protein-coupled receptors normally have the following topology. 
Their amino-terminus is present on the outside of the cell, the seven 
a helices that traverse the plasma membrane are connected by loops 
of varying length, and the carboxyl-terminus is present on the inside 
of the cell (Figure 15.5b). There are three loops present on the out- 
side of the cell that, together, form the ligand-binding pocket, whose 
structure varies among different GPCRs. There are also three loops 


TABLE 15.1 Examples of Physiologic Processes Mediated by GPCRs and Heterotrimeric G Proteins 
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Stimulus Receptor Effector Physiologic response 


Glycogen breakdown 

Behavioral sensitization and learning in 
Aplysia 

Visual excitation 

Secretion 

Chemotaxis 

Slowing of pacemaker activity 


Adenylyl cyclase 
Adenylyl cyclase 


Epinephrine 
Serotonin 


B-Adrenergic receptor 
Serotonin receptor 


Light 

IgE-antigen complexes 
f-Met Peptide 
Acetylcholine 


cGMP phosphodiesterase 
Phospholipase C 
Phospholipase C 
Potassium channel 


Rhodopsin 

Mast cell IgE receptor 
Chemotactic receptor 
Muscarinic receptor 


Adapted from L. Stryer and H. R. Bourne, reproduced with permission from the Annual Review of Cell Biology, Vol 2, copyright 1986, by Annual Reviews Inc. Annual Review 
of Cell Biology by Annual Reviews, Inc. Reproduced with permission of Annual Reviews, in the format republish in a book via Copyright Clearance Center. 
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provide binding sites for intracellular signaling proteins. It is very 
difficult, for a number of technical reasons, to prepare crystals of 
unmodified GPCRs that are suitable for X-ray crystallographic anal- 
ysis. For a number of years, rhodopsin was the only member of 
the superfamily to have its X-ray crystal structure determined. 
Rhodopsin has an unusually stable structure for a GPCR, owing to 
the fact that its ligand (a retinal group) is permanently bound to the 
protein and the protein molecule can only exist in a single inactive 
conformation in the absence of a stimulus (i.e., in the dark). 
Beginning in 2007, as a result of years of effort by a number of 
research groups, a flurry of GPCR crystal structures appeared in the 
literature. For the most part, these structures revealed the GPCR in 
the inactive state, but more recent reports have described the struc- 
tures of several GPCRs in active and intermediate states 

The first X-ray crystal structure of an active GPCR with its 
bound G protein was solved by Brian Kobilka and colleagues at 
Stanford University. The inactive conformation of the GPCR is sta- 
bilized by noncovalent interactions between specific residues in the 
transmembrane « helices. Ligand binding disturbs these interac- 
tions, thereby causing the receptor to assume an active conforma- 
tion. This requires rotations and shifts of the transmembrane 
oa helices relative to each other. Because they are attached to the cyto- 
plasmic loops, rotation or movement of these transmembrane 
a helices causes changes in the conformation of the cytoplasmic 
loops. This in turn leads to an increase in the affinity of the receptor 
for a G protein that is present on the cytoplasmic surface of the plasma 
membrane (Figure 15.5b). As a consequence, the ligand-bound 
receptor forms a receptor-G protein complex (FIGURE 15.6, step 1). 
The interaction with the receptor induces a conformational change in 
the a subunit of a G protein, causing the release of GDP, which is fol- 
lowed by binding of GTP (step 2). While in the activated state, a single 
receptor can activate a number of G protein molecules, providing a 
means of signal amplification (discussed further on page 600). 


G Proteins 


Heterotrimeric G proteins were discovered, purified, and character- 
ized by Martin Rodbell and his colleagues at the National Institutes 
of Health and Alfred Gilman and colleagues at the University of 
Virginia. Their studies are discussed in the Experimental Pathways. 


FIGURE 15.6 The mechanism of receptor-mediated activation (or 
inhibition) of effectors by means of heterotrimeric G proteins. In step 1, 
the ligand binds to the receptor, altering its conformation and increasing its 
affinity for the G protein to which it binds. In step 2, the G, subunit releases 
its GDP, which is replaced by GTP. In step 3, the G, subunit dissociates 
from the Gy, complex and binds to an effector (in this case adenylyl 
cyclase), activating the effector. The Gg, dimer may also bind to an effector 
(not shown), such as an ion channel or an enzyme. In step 4, activated 
adenylyl cyclase produces cAMP. In step 5, the GTPase activity of G, 
hydrolyzes the bound GTP, deactivating G,. In step 6, G,, reassociates with 
Gp, reforming the trimeric G protein, and the effector ceases its activity. In 
step 7, the receptor has been phosphorylated by a GRK, and in step 8 the 
phosphorylated receptor has been bound by an arrestin molecule, which 
inhibits the ligand-bound receptor from activating additional G proteins. 
The receptor bound to arrestin is likely to be taken up by endocytosis. 


These proteins are referred to as G proteins because they bind 
guanine nucleotides, either GDP or GTP. They are described as 
heterotrimeric because all of them consist of three different polypeptide 
subunits, called a, B, and y. This property distinguishes them from 
small, monomeric G proteins, such as Ras, which are discussed later 
in this chapter. Heterotrimeric G proteins are held at the plasma 
membrane by lipid chains that are covalently attached to the a and y 
subunits (Figure 15.5a). 

The guanine nucleotide-binding site is present on the G, subunit. 
Replacement of GDP by GTP, following interaction with an acti- 
vated GPCR, results in a conformational change in the G, subunit. 
In its GTP-bound conformation, the G, subunit has a low affinity 
for Gay, leading to dissociation of the trimeric complex. Each disso- 
ciated G, subunit (with GTP attached) is free to activate an effector 
protein, such as adenylyl cyclase (Figure 15.6, step 3). In this case, 
activation of the effector leads to the production of the second mes- 
senger cAMP (step 4). Other effectors include phospholipase C-B 
and cyclic GMP phosphodiesterase (see below). Second messengers, 
in turn, activate one or more cellular signaling proteins. 

A G protein is said to be “on” when its a subunit is bound to 
GTP. G, subunits can turn themselves off by hydrolysis of GTP to 
GDP and inorganic phosphate (Pi) (Figure 15.6, step 5). This results 
in a conformational change causing a decrease in the affinity for the 
effector and an increase in the affinity for the By subunit. Thus, 
following hydrolysis of GTP, the G, subunit will dissociate from 
the effector and reassociate with the By subunit to reform the inac- 
tive heterotrimeric G protein (step 6). In a sense, heterotrimeric 
G proteins function as molecular timers. They are turned on by the 
interaction with an activated receptor and turn themselves off by 
hydrolysis of bound GTP after a certain amount of time has passed. 
While they are active, G, subunits can turn on downstream 
effectors. 

Heterotrimeric G proteins come in four flavors, G,, Gg, G;, and 
G42/13. This classification is based on the G, subunits and the effec- 
tors to which they couple. The particular response elicited by an 
activated GPCR depends on the type of G protein with which it 
interacts, although some GPCRs can interact with different G pro- 
teins and trigger more than one physiologic response. G, family 
members couple receptors to adenylyl cyclase. Adenylyl cyclase is 
activated by GTP-bound G, subunits. G, family members contain 
G,, subunits that activate PLCB. PLCB hydrolyzes phosphatidylinositol 
bisphosphate, producing inositol trisphosphate and diacylglycerol 
(page 598). Activated G, subunits function by inhibiting adenylyl 
cyclase. Gj/;3 members are less well characterized than the other 
G protein families although their inappropriate activation has been 
associated with excessive cell proliferation and malignant transfor- 
mation. Following its dissociation from the G, subunit, the By 
complex also has a signaling function and it can couple to a number 
of different types of effectors, including PLCB, K* and Ca”* ion 
channels, and adenylyl cyclase. 


Termination of the Response 


We have seen that ligand binding results in receptor activation. The 
activated receptors turn on G proteins, and G proteins turn on effec- 
tors. To prevent overstimulation, receptors have to be blocked from 
continuing to activate G proteins. To regain sensitivity to future 
stimuli, the receptor, the G protein, and the effector must all be 
returned to their inactive state. Desensitization, the process that 


blocks active receptors from turning on additional G proteins, takes 
place in two steps. In the first step, the cytoplasmic domain of the 
activated GPCR is phosphorylated by a specific type of kinase, called 
G protein-coupled receptor kinase (GRK ) (Figure 15.6, step 7). GRKs 
form a small family of serine-threonine protein kinases that specifi- 
cally recognize activated GPCRs. 

Phosphorylation of the GPCR sets the stage for the second step, 
which is the binding of proteins, called arrestins (Figure 15.6, step 8). 
Arrestins form a small family of proteins that bind to GPCRs and 
compete for binding with heterotrimeric G proteins. As a conse- 
quence, arrestin binding prevents the further activation of additional 
G proteins. This action is termed desensitization because the cell 
stops responding to the stimulus despite the continued presence of 
the stimulus on the outer surface of the cell. Desensitization is one of 
the mechanisms that allow a cell to respond to a change in its envi- 
ronment, rather than continuing to “fire” endlessly in the presence of 
an unchanging environment. The importance of desensitization is 
illustrated by the observation that mutations that interfere with 
phosphorylation of rhodopsin by a GRK lead to the death of the 
photoreceptor cells in the retina. This type of retinal cell death is 
thought to be one of the causes of blindness resulting from the dis- 
ease retinitis pigmentosa. 

Arrestins can be described as protein hubs in that they are capa- 
ble of binding to a variety of different proteins involved in different 
intracellular processes. While they are bound to phosphorylated 
GPCRs (step 1, FIGURE 15.7), arrestin molecules are also capable of 
binding to AP2 adaptor molecules that are situated in clathrin- 
coated pits (Section 8.17). The interaction between bound arrestin 
and clathrin-coated pits (step 2) promotes the uptake of phosphoryl- 
ated GPCRs into the cell by endocytosis. Depending on the circum- 
stances, receptors that have been removed from the surface by 
endocytosis can participate in several alternative outcomes. In some 
cases, receptors travel along the endocytic pathway into endosomes 
(Section 8.17), where the associated arrestin molecules serve as a 
scaffold for the assembly of various cytoplasmic signaling com- 
plexes. The MAPK pathway, which is discussed at length later in the 
chapter, is thought to be activated by arrestin-bound GPCRs local- 
ized within endosomes (step 3). The discovery of “signaling 
endosomes” came as a surprise to researchers in the field of cell sign- 
aling who had been working under the assumption that GPCRs (and 
RTKs as well) were only capable of signal transduction when they 
resided at the cell surface. Now it appears that the signals transmit- 
ted from endosomes have different properties and physiological 
roles from those that arise from the plasma membrane. In a second 
outcome, internalized receptors may traffic from endosomes to lys- 
osomes where they are degraded (step 4). If receptors are degraded, 
the cells lose, at least temporarily, sensitivity for the ligand in ques- 
tion. Finally, according to a third scheme, the arrestin-bound GPCRs 
may be dephosphorylated and returned to the plasma membrane 
(step 5). If receptors are returned to the cell surface, the cells remain 
sensitive to the ligand (they are said to be resensitized). 

Signaling by the activated G,, subunit is terminated by a less 
complex mechanism: The bound GTP molecule is simply hydro- 
lyzed to GDP (step 5, Figure 15.6). Thus, the strength and duration 
of the signal are determined in part by the rate of GTP hydrolysis by 
the G, subunit. G, subunits possess a weak GTPase activity, which 
allows them to slowly hydrolyze the bound GTP and inactivate 
themselves. Termination of the response is accelerated by regulators 
of G protein signaling (RGSs). The interaction with an RGS protein 
increases the rate of GTP hydrolysis by the G, subunit. Once the 
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ERK Other pathways 


GTP is hydrolyzed, the G,.-GDP reassociates with the G, subunits 
to reform the inactive trimeric complex (step 6) as discussed above. 
This returns the system to the resting state. 

The mechanism for transmitting signals across the plasma 
membrane by G proteins is of ancient evolutionary origin and is 
highly conserved. This is illustrated by an experiment in which yeast 
cells were genetically engineered to express a receptor for the mam- 
malian hormone somatostatin. When these yeast cells were treated 
with somatostatin, the mammalian receptors at the cell surface 
interacted with the yeast heterotrimeric G proteins at the inner 
surface of the membrane and triggered a response leading to prolif- 
eration of the yeast cells. 

The effects of certain mutations on the function of G protein- 
coupled receptors are discussed in the Human Perspective 
(Section 15.5). 


Bacterial Toxins 


Because G proteins are so important to the normal physiology of 
multicellular organisms, they provide excellent targets for bacterial 
pathogens. For example, cholera toxin (produced by the bacterium 
Vibrio cholerae) exerts its effect by modifying G, subunits and 


FIGURE 15.7 Arrestin-mediated internalization of GPCRs. 
Arrestin-bound GPCRs (step 1) are internalized when they are trapped 
in clathrin-coated pits that bud into the cytoplasm (step 2). As 
discussed in Section 8.8, clathrin-coated buds are transformed into 
clathrin-coated vesicles that deliver their contents, including the 
GPCRs, to endosomes. When present in the endosomes, arrestins can 
serve as scaffolds for the assembly of signaling complexes, including 
those that activate the MAPK cascade and the transcription factor ERK 
(step 3). Alternatively, the GPCRs can be delivered to lysosomes, 
where they are degraded (step 4), or they can be returned to the 
plasma membrane in a recycling endosome (step 5), where they can 
then interact with new extracellular ligands (step 6). 

Source: From S. L. Ritter and R. A. Hall, Nature Reviews MCB 10:820, 
2009, Box 1b. Nature Reviews Molecular Cell Biology by Nature 
Publishing Group. Reproduced with permission of Nature Publishing 
Group in the format reuse in a book/textbook via Copyright 
Clearance Center. 


inhibiting their GTPase activity in the cells of the intestinal 
epithelium. As a result, adenylyl cyclase molecules remain in an 
activated mode, churning out cAMP, which causes the epithelial 
cells to secrete large volumes of fluid into the intestinal lumen. The 
loss of water associated with this inappropriate response often leads 
to death due to dehydration. 

Pertussis toxin is one of several virulence factors produced by 
Bordetella pertussis, a microorganism that causes whooping cough. 
Whooping cough is a debilitating respiratory tract infection seen in 
50 million people worldwide each year, causing death in about 350,000 
of these cases annually. Pertussis toxin also inactivates G, subunits, 
thereby interfering with the signaling pathway that leads the host to 
mount a defensive response against the bacterial infection. 


REVIEW | 


1. What is the role of G proteins in a signaling pathway? 


2. Describe the steps between the binding of a ligand 
such as glucagon to a seven transmembrane receptor 
and the activation of an effector, such as adenylyl 


cyclase. How is the response normally attenuated? 
e—_—__—_—_————? 


15.4 EXPERIMENTAL PATHWAYS 


The Discovery and Characterization of GTP-Binding Proteins 


During the late 1950s and 1960s, Earl Sutherland and his col- 
leagues at Case Western Reserve University discovered that 
certain hormones, such as epinephrine, act by binding to a 
specific receptor at the cell surface, which activates the enzyme 
adenylyl cyclase on the inner side of the membrane. Activation of 
adenylyl cyclase leads to the production of a second messenger, 
cyclic AMP, which diffuses into the cytoplasm and initiates the 


cell's response."*“"°"e4 in! By the end of the 1960s, the concept of 
the second messenger was well established, but little was known 
about the precise molecular mechanism that allowed a hormone 
(or other ligand) bound at the outside of the membrane to acti- 
vate adenylyl cyclase activity. One possibility considered likely 
was that the active site of the adenylyl cyclase was part of the 
hormone receptor itself. 


To learn more about the relationship between hormone 
receptors and adenylyl cyclase, Martin Rodbell and Lutz 
Birnbaumer at the National Institutes of Health began a series of 
experiments on the isolated plasma membranes of fat cells and 
liver cells.? One of the first questions they sought to answer was 
whether these cells, which respond to several different hormones 
by producing cAMP, possess a single adenylyl cyclase that is 
activated by all the various hormones or have separate adenylyl 
cyclases for each of the hormones to which they respond. Fat 
cells were chosen for these initial studies because they are 
readily isolated free of other cell types, they respond to several 
hormones that cause a rapid increase in intracellular levels of 
cAMP (stimulating enzymes involved in lipid degradation), and 
their plasma membranes can be isolated as “ghosts” by osmotic 
lysis. Six different hormones (ACTH, epinephrine, glucagon, TSH, 
LH, and prolactin) were found to stimulate adenylyl cyclase activity 
in isolated fat-cell ghosts. The dose response curves of the first 
three of these hormones acting separately are shown in FIGURE 1. 
As shown in Table 1, when these hormones were combined in 
sets of two or three, their effects on adenylyl cyclase activity were 
not additive, suggesting that each of these hormones stimu- 
lates the same population of adenylyl cyclase molecules. It was 
shown, however, that the different hormones interact with spa- 
tially distinct receptors. Together, these findings led Rodbell and 
Birnbaumer to propose that, even though the various hormone 
receptors stimulate a common population of adenylyl cyclase 
molecules, the receptors and the adenylyl cyclase effectors 
exist separately from one another in the plasma membranes of 
fat cells. 

In 1971, Rodbell and his colleagues published a series of 
papers on the stimulation of adenylyl cyclase in isolated plasma 
membranes of liver cells in response to glucagon and epineph- 
rine. First, they measured the binding of radioactively labeled 
glucagon molecules to receptors on isolated plasma membranes. 
Because ATP is a substrate of the adenylyl cyclase reaction, the 
effects of ATP on the binding of labeled glucagon were moni- 
tored, as were the effects of the three other common nucleoside 
triphosphates, UTP, CTP, and GTP. It was during these investiga- 
tions that the special properties of GTP were first noted.? One 
of the observed effects of the nucleoside triphosphates was to 
cause the dissociation of labeled glucagon that was bound to the 
isolated plasma membranes. Whereas ATP, UTP, and CTP did not 
affect dissociation of glucagon from the membranes at concen- 
trations less than 100mM, GTP stimulated dissociation at concen- 
trations as low as 0.05mM. A comparison of the effects of ATP 
and GTP in causing glucagon dissociation from the membrane 
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FIGURE 1 The effects of varying concentrations of ACTH, 
epinephrine, and glucagon on adenylyl cyclase activity of fat cell 
ghosts. The dashed line represents the activity in the absence of added 
hormones. (Adenylyl cyclase was initially named adenyl cyclase and 
later called adenylate cyclase.) 

Source: From L. Birnbaumer and M. Rodbell, J. Biol. Chem. 244:3478, 
1969. 


is shown in FIGURE 2. (Note that the concentration is plotted on 
a log scale, indicating the very large differences in effectiveness 
of the two triphosphates.) These results suggested to the authors 
that guanyl nucleotides induce a change in the state or properties 
of the glucagon receptor that decreases its affinity for glucagon. 
In addition, they found that liver membranes can hydrolyze GTP. 
In the last paper of the series, Rodbell and his colleagues 
addressed the question as to whether the effects of the guanyl 
nucleotides on the binding of glucagon bear any relationship to 
the actions of glucagon on the adenylyl cyclase system of the liver 
membranes.* As shown in FIGURE 3, GTP stimulated the basal 
activity of adenylyl cyclase and did so at concentrations as low as 
0.01mM. None of the other nucleotides possessed this activity. 
The authors concluded that guanyl nucleotides are required 
for the activation of adenylyl cyclase by glucagon. Subsequent 


TABLE 1 Effects of Combinations of Hormones, at Supramaximal Concentrations, on Adenylyl Cyclase Activity in Fat Cell Ghosts* 


Change in adenylyl cyclase activity due to hormones 


Experiment 1 (37°) 


Additions Found 


Calculated if additive 


Experiment 2 (30°) 


Found Calculated if additive 


ACTH 

Epinephrine 

Glucagon 

ACTH + epinephrine 
Epinephrine + glucagon 

ACTH + glucagon 

ACTH + epinephrine + glucagon 


0.57 + 0.02 
1.00 + 0.06 
0.32 + 0.01 
0.80 + 0.04 
0.99 + 0.05 
0.64 + 0.02 
0.85 + 0.04 


1.19 + 0.07 
1.79 + 0.11 
0.57 + 0.04 
2.04 + 0.12 
2.13 + 0.10 
1.33 + 0.06 
2.30 + 0.10 


*Adenylyl cyclase activity was measured at either 37° in absence of, or at 30° in presence of, ATP-regenerating system. The following concentrations of the hormones 
used, either individualy or when combine: ACTH, 400 yg per ml; epinephrine, 400 yg per ml; glucagon, 60 jg per ml. Values are the mean of triplicate determination 


+ standard deviation. 
Source: L.Birnbaumer and M. Rodbell, J. Biol. Chem. 244:3479, 1969. 


continued 
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FIGURE 2 Effects of varying concentrations of GTP and ATP 

on dissociation of [1251] glucagon from plasma membranes of 
liver cells. In this experiment, the membranes were preincubated for 
15 minutes in a medium containing labeled glucagon. Then, either 
GTP or ATP was added in the presence of a large excess of 
unlabeled glucagon. After 15 minutes of incubation in one or the other 
nucleoside triphosphate, samples were taken to measure the labeled 
glucagon remaining bound. The percentage of bound, labeled 
glucagon that dissociated during the incubation with ATP or GTP was 
calculated by comparing the bound radioactivity present at the 
beginning and at the end of the 15-minute incubation with the 
nucleoside triphosphate. 


Source: From M. Rodbell et al., J. Biol. Chem. 246:1873, 1971. 
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FIGURE 3 The effect of fluoride ion, glucagon, and a combination 
of glucagon and GTP on the formation of cyclic AMP by a 
preparation of isolated liver cell plasma membranes. The marked 
stimulation by GTP shown here was achieved with a GTP concentration 
of 0.01 mM. Fluoride ion was also found to be an effective activator of 
adenylyl cyclase and was used in numerous subsequent assays. 


Source: From M. Rodbell et al., J. Biol. Chem. 246:1879, 1971. 


studies in other laboratories showed that GTP enhances the 
response of adenylyl cyclase systems to other hormones, 
including peptide hormones, catecholamines (e.g., epinephrine), 
and prostaglandins. 

Nonhydrolyzable analogues of nucleoside triphosphates are 
useful because they allow researchers to distinguish between the 
effects of nucleotide binding from those of nucleotide hydro- 
lysis. When a nonhydrolyzable analogue of GTP, 5’-guanylyli- 
mododiphosphate, Gpp(NH)p, was incubated with liver plasma 
membranes, glucagon, ATP, and other ingredients necessary for 
adenylyl cyclase activation, Gpp(NH)o mimicked the stimula- 
tory effects of GTP. In fact, the nonhydrolyzable GTP analogue 
activated adenylyl cyclase even in the absence of hormones.° 
These results indicated that the binding of GTP, rather than its 
hydrolysis, played an important role in activating adenylyl cyclase. 
The primary difference between the effects of GTP and a nonhy- 
drolyzable GTP analogue is that the former stimulates adenylyl 
cyclase only transiently, whereas the latter stimulates the enzyme 
for a prolonged period. These findings led investigators to focus 
on the mechanism by which GTP is normally hydrolyzed by the 
adenylyl cyclase system. 

Inthe mid-1970s, a number of laboratories, including Rodbell’s, 
identified a GTPase located within the plasma membranes of var- 
ious cells. In one key study, Don Cassel and Zvi Selinger of the 
Hebrew University of Jerusalem found that addition of isopro- 
terenol (a catecholamine hormone similar to epinephrine) to a 
plasma membrane GTPase assay system caused a 30-70 percent 
increase in the hydrolysis of [°7P]GTP as measured by the release 
of °2Pi (FIGURE 4).° Cassel and Selinger proposed that the hydro- 
lysis of GTP by the GTPase serves to turn off the activated adenyly| 
cyclase, returning the enzyme to the basal inactive state. 

In a subsequent study, these same researchers made an 
important connection between the GTP-binding protein, as it 
came to be called, and the infamous cholera toxin. Cholera is 
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FIGURE 4 The time course of GTP hydrolysis in the presence and 
absence of the catecholamine isoproterenol by isolated turkey 
erythrocyte membranes. The effect of the hormone is seen in the 
lower curve, which plots the difference in GTP hydrolysis between the 
hormone-treated and control membrane preparations. 

Source: Reprinted from D. Cassel and Z. Selinger, Biochim. Biophys. 
Acta 452:543, 1976. Catecholamin estimulated GTPase activity in turkey 
erythrocyte membranes. Copyright 1976, with permission from Elsevier 
Science. 


a disease caused by a bacterial toxin that produces excessive 
diarrhea and accompanying water loss in infected individuals. 
During the early 1970s, a number of laboratories found that 
cholera toxin acts by stimulating adenylyl cyclase in the cells of 
the intestinal epithelium. It became apparent that the cholera 
toxin mimics the action of hormones by acting on one of the 
components commonly present in the adenylyl cyclase system. 
In 1977, Cassel and Selinger demonstrated that the toxin acted 
by inhibiting the GTPase activity of the membrane, which had the 
effect of maintaining the adenylyl cyclase in a prolonged stimu- 
lated state (FIGURE 5).’ 

The results of these and other experiments demonstrated the 
existence of a regulatory component of the hormone-adenylyl 
cyclase system that was activated by GTP binding and deacti- 
vated by GTP hydrolysis. Attention turned to several other ques- 
tions. Is the GTP-binding protein an integral part of the adenylyl 
cyclase or a separate component? If the GTP-binding protein is a 
separate component, what is its structure and how does it interact 
with the other major components of the system, the receptor and 
the adenylyl cyclase? 

The purification of the GTP-binding protein was accom- 
plished in 1980 by Alfred Gilman and his co-workers at the 
University of Virginia. Purification was achieved by extracting 
liver plasma membrane proteins in detergent and subjecting the 
mixture to six successive chromatographic procedures (Table 2).® 
As indicated in the table, the specific activity of the detergent 
extract increased approximately 2,000-fold during the purifica- 
tion procedure, which corresponds to a purification of 5,000- to 
10,000-fold from plasma membranes, or nearly 100,000-fold from 
total cellular protein. When the purified GTP-binding protein was 
subjected to SDS-PAGE, three distinct polypeptides were found 
to be present (relative molecular masses of 52, 45, and 35 kDa), 
indicating that the regulatory protein was a multisubunit com- 
plex. Of the three polypeptides identified in the initial study, the 
52 kDa polypeptide was present in much smaller and more var- 
iable amounts and ultimately was found not to be a member of 


Catecholamine stimulated GTPase, 
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Adenylate cyclase, 
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FIGURE 5 Effect of the cholera toxin on the catecholamine-stimulated 
GTPase (left) and adenylyl cyclase (right) activities at various GTP 
concentrations. Membranes pretreated in the absence (open circles) or 
presence (closed circles) of toxin were assayed for GTPase and adenylyl 
cyclase activities at the indicated GTP concentrations. In the presence of 
the toxin, the lower GTPase activity (left) leads to an elevated level of 
adenylyl cyclase activity (right). 

Source: From D. Cassel and Z. Selinger, Proc. Nat’! Acad. Sci. U.S.A. 
74:3309, 1977. 


TABLE 2 Purification of the Regulatory Component of Adenylyl 
Cyclase from Rabbit Liver* 


Total Specific 
units activity 
Protein (nmol/ Recovery (nmol/ 
Step (mg) min) (%) min mg) 


Chelate extract of 2020 4380 100 DD 
membranes 


DEAE-Sephacel 114 2930 67 26 
Ultrogel AcA 34 9.8 1820 42 
Heptylamine- 0.70 740 17 

Sepharose 
Hydroxyapatite 0.03 350 8.0 
GTP-Sepharose 0.18 290 6.6 
DEAE-Sephacel 0.039 150 35 


*Rabbit liver plasma membranes (8.6 g of protein) were extracted and purified. 
Activity was measured by the reconstitution of the fluorideactivated adenylyl 
cyclase activity. Specific activity is defined as the amount of reconstituted 
activity per amount of protein added to the reconstitution. Recoveries are 
cumulative through the preparation. 

Source: J. K. Northup et al., Proc. Natl. Acad. Sci. U.S.A. 77, 6518, 1980. 


the GTP-binding protein. The 45 kDa polypeptide became known 
as the a subunit and the 35 kDa polypeptide as the B subunit. 
A third polypeptide of 9 kDa (the Y subunit) was subsequently dis- 
covered. Of the three subunits, the 45 kDa a subunit was shown 
to be the polypeptide that contains the GTP-binding site. 

The purification of the GTP-binding protein set the stage 
for the analysis of the molecular and biochemical properties of 
the protein. As a result of these studies, Gilman proposed that 
the activation of adenylyl cyclase follows the dissociation of the 
GTP-binding protein into its subunits, and that the dissociated 
a subunit with a bound GTP is responsible for activating the 
cAMP-producing enzyme.”"? 
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15.5 THE HUMAN PERSPECTIVE 


Disorders Associated with G Protein-Coupled Receptors 


GPCRs represent the largest family of genes encoded by the 
human genome. Their importance in human biology is reflected 
by the fact that over one-third of all prescription drugs act as 
ligands that bind to this huge superfamily of receptors. A number 
of inherited disorders have been traced to defects in both GPCRs 
(FIGURE 1) and heterotrimeric G proteins (Table 1). Retinitis pig- 
mentosa (RP) is an inherited disease characterized by progres- 
sive degeneration of the retina and eventual blindness. RP can 
be caused by mutations in the gene that encodes rhodopsin, 
the visual pigment of the rods. Many of these mutations lead to 
premature termination or improper folding of the rhodopsin pro- 
tein and its elimination from the cell before it reaches the plasma 
membrane (Section 8.7). Other mutations may lead to the syn- 
thesis of a rhodopsin molecule that cannot activate its G protein 
and thus cannot pass the signal downstream to the effector. 

RP results from a mutation that leads to a loss of function of 
the encoded receptor. Many mutations that alter the structure of 
signaling proteins can have an opposite effect, leading to what 
is described as a “gain of function.” In one such case, mutations 
have been found to cause a type of benign thyroid tumor, called an 
adenoma. Unlike normal thyroid cells that secrete thyroid hormone 
only in response to stimulation by the pituitary hormone TSH, the 
cells of these thyroid adenomas secrete large quantities of thyroid 
hormone without having to be stimulated by TSH (the receptor 
is said to act constitutively). The TSH receptor in these cells con- 
tains an amino acid substitution that affects the structure of the 
third intracellular loop of the protein (FIGURE 1, mutations at sites 
5 or 6). As a result of the mutation, the TSH receptor constitutively 
activates a G protein on its inner surface, sending a continual 
signal through the pathway that leads not only to excessive thyroid 


Intracellular 


FIGURE 1 Two-dimensional representation of a “composite” 
transmembrane receptor showing the approximate sites of a number 
of mutations responsible for causing human diseases. Most of the 
mutations (numbers 1, 2, 5, 6, 7, and 8) result in constitutive stimulation 
of the effector, but others (3 and 4) result in blockage of the receptor's 
ability to stimulate the effector. Mutations at sites 1 and 2 are found in 
the MSH (melanocyte-stimulating hormone) receptor; 3 in the ACTH 
(adrenocorticotrophic hormone) receptor; 4 in the vasopressin receptor; 
5 and 6 in the TSH (thyroid-stimulating hormone) receptor; 7 in the LH 
(luteinizing hormone) receptor; and 8 in rhodopsin, the light-sensitive 
pigment of the retina. 


TABLE 1 Human Diseases Linked to the G Protein Pathway 


Disease Defective G protein* 


Albright’s hereditary osteodystrophy and 
pseudohypoparathyroidisms 

McCune-Albright syndrome 

Pituitary, thyroid tumors (gsp oncogene) 

Adrenocortical, ovarian tumors (gip 
oncogene) 

Combined precocious puberty and 
peudohypoparathyroidism 


Defective G protein- 


Disease coupled receptor 


Familial hypocalciuric hypercalcemia Human analogue of 
BoPCARI receptor 
Human analogue of 
BoPCARI receptor 
(homozygous) 
Thyrotropin receptor 
Luteinizing hormone 
receptor 
V2 vasopressin receptor 
Rhodopsin receptor 
Cone opsin receptor 


Neonatal severe 
hyperparathyroidism 


Hyperthyroidism (thyroid adenomas) 
Familial male precocious puberty 


X-linked nephrogenic diabetes insipidus 

Retinitis pigmentosa 

Color blindness, spectral sensitivity 
variations 

Familial glucocorticoid deficiency and 
isolated glucocorticoid deficiency 

Short stature, obesity 

Early-onset severe obesity 


Adrenocorticoptropic 
hormone (ACTH) receptor 

Ghrelin receptor 

melanocortin-4 receptor 
(hetererozygote) 

Follicle stimulating hormone 
receptor 


Decreased fertility 


*As described in the text, a G protein with a G,, acts to stimulate the effector, 
whereas a G protein with a Gj, inhibits the effector. 

Source: D, E. Clapham, reprinted with permission from Nature, Vol 371, p. 109, 
1994, copyright 1994. Nature by Nature Publishing Group. Reproduced with 
permission of Nature Publishing Group in the format reuse in a book/textbook 
via Copyright Clearance Center. 


hormone secretion but to the excessive cell proliferation that 
causes the tumor. This conclusion was verified by introducing the 
mutant gene into cultured cells that normally lack this receptor 
and demonstrating that the synthesis of the mutant protein and 
its incorporation into the plasma membrane led to the continuous 
production of cAMP in the genetically engineered cells. 

The mutation that causes thyroid adenomas is not found in 
the normal portion of a patient's thyroid but only in the tumor 
tissue, indicating that the mutation was not inherited but arose 
in one of the cells of the thyroid, which then proliferated to give 
rise to the tumor. A mutation in a cell of the body, such as a thy- 
roid cell, is called a somatic mutation to distinguish it from an 
inherited mutation that would be present in all of the individual’s 
cells. As will be evident in the following chapter, somatic muta- 
tions are a primary cause of human cancer. At least one cancer- 
causing virus has been shown to encode a protein that acts as 
a constitutively active GPCR. The virus is a type of herpes virus 
that is responsible for Kaposi's sarcoma, which causes purplish 
skin lesions and is prevalent in AIDS patients. The virus genome 
encodes a constitutively active receptor for interleukin-8, which 
stimulates signaling pathways that control cell proliferation. 


As noted in Table 1, mutations in genes that encode the 
subunits of heterotrimeric G proteins can also lead to inherited 
disorders. This is illustrated by a report on two male patients 
suffering from a rare combination of endocrine disorders: preco- 
cious puberty and hypoparathyroidism. Both patients were found 
to contain a single amino acid substitution in one of the G, iso- 
forms. The alteration in amino acid sequence caused two effects 
on the mutant G protein. At temperatures below normal body 
temperature, the mutant G protein remained in the active state, 
even in the absence of a bound ligand. In contrast, at normal 
body temperatures, the mutant G protein was inactive, both in 
the presence and absence of bound ligand. The testes, which 
are housed outside of the body’s core, have a lower temperature 
than the body's visceral organs (33°C vs. 37°C ). Normally, the 
endocrine cells of the testes initiate testosterone production at 
the time of puberty in response to the pituitary hormone LH, which 
begins to be produced at that time. The circulating LH binds to 
LH receptors on the surface of the testicular cells, inducing the 
synthesis of cAMP and subsequent production of the male sex 
hormone. The testicular cells of the patients bearing the G pro- 
tein mutation were stimulated to synthesize cAMP in the absence 
of the LH ligand, leading to premature synthesis of testosterone 
and precocious puberty. In contrast, the mutation in this same G, 
subunit in the cells of the parathyroid glands, which function at 


a temperature of 37°C, caused the G protein to remain inactive. 
As a result, the cells of the parathyroid gland could not respond 
to stimuli that would normally cause them to secrete parathyroid 
hormone, leading to the condition of hypoparathyroidism. The 
fact that most of the bodily organs functioned in a normal manner 
in these patients suggests that this particular G, isoform is not 
essential in the activities of most other cells. 

Mutations are thought of as rare and disabling changes in 
the nucleotide sequence of a gene. Genetic polymorphisms, in 
contrast, are thought of as common, “normal” variations within 
the population (Section 10.15). Yet it has become clear in recent 
years that genetic polymorphisms may have considerable impact 
on human disease, causing certain individuals to be more or less 
susceptible to particular disorders than other individuals. This 
has been well documented in the case of GPCRs. For example, 
certain alleles of the gene encoding the Bb. adrenergic receptor 
have been associated with an increased likelihood of developing 
asthma or high blood pressure; certain alleles of a dopamine 
receptor are associated with increased risk of substance abuse or 
schizophrenia; and certain alleles of a chemokine receptor (CCR5) 
are associated with prolonged survival in HIV-infected individuals. 
As discussed in Chapter 10, identifying associations between dis- 
ease susceptibility and genetic polymorphisms is a current focus 
of clinical research. 


15.6 Second Messengers 


Cyclic AMP is a second messenger that is capable of diffusing to other 
sites within the cell. The synthesis of cyclic AMP follows the binding 
of a first messenger—a hormone or other ligand—to a receptor at 
the outer surface of the cell. FIGURE 15.8 shows the diffusion of cyclic 
AMP within the cytoplasm of a neuron following stimulation by an 
extracellular messenger molecule. Whereas the first messenger 
binds exclusively to a single receptor species, the second messenger 
often stimulates a variety of cellular activities. As a result, second 
messengers enable cells to mount a large-scale, coordinated response 
following stimulation by a single extracellular ligand. Other second 
messengers include Ca**, phosphoinositides, inositol trisphos- 
phate, diacylglycerol, cGMP, and nitric oxide. 


Before 5-HT 


19s after 50 uM S-HT 


109s after 5-HT 


The Discovery of Cyclic AMP 


How does the binding of a hormone to the plasma membrane change 
the activity of cytoplasmic enzymes, such as glycogen phosphorylase, 
an enzyme involved in glycogen metabolism? The answer to this 
question was provided by studies that began in the mid-1950s in the 
laboratories of Earl Sutherland and his colleagues at Case Western 
Reserve University. Sutherland’s goal was to develop an in vitro sys- 
tem to study the physiologic responses to hormones. After consider- 
able effort, he was able to activate glycogen phosphorylase in a 
preparation of broken cells that had been incubated with glucagon or 
epinephrine. This broken-cell preparation could be divided by cen- 
trifugation into a particulate fraction consisting primarily of cell 
membranes and a soluble supernatant fraction. Even though glycogen 


10 min after 5-HT 


FIGURE 15.8 The localized formation of cAMP in a live cell in response to the addition of an extracellular messenger molecule. This series of 
photographs shows a sensory nerve cell from the sea hare Aplysia. The concentration of free cAMP is indicated by color: blue represents a low cAMP 
concentration, yellow an intermediate concentration, and red a high concentration. The left image shows the intracellular CAMP level in the unstimulated 
neuron, and the next three images show the effects of stimulation by the neurotransmitter serotonin (5-hydroxytryptamine) at the times indicated. Notice 
that the cAMP levels drop by 109s despite the continued presence of the neurotransmitter. (The cAMP level was determined indirectly in this experiment by 
microinjection of a fluorescently labeled cAMP-dependent protein kinase labeled with both fluorescein and rhodamine on different subunits. Energy transfer 
between the subunits (see page 697) provides a measure of cAMP concentration.) 


Source: From Brian J. Bacskai et al., Science 260:223, 1993. Reprinted with permission from AAAS. 
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particulate material was required to obtain the hormone response. 
Subsequent experiments indicated that the response occurred in at 
least two distinct steps. If the particulate fraction of a liver homogen- 
ate was isolated and incubated with the hormone, some substance was 
released that, when added to the supernatant fraction, activated the 
soluble glycogen phosphorylase molecules. Sutherland identified the 
substance released by the membranes of the particulate fraction as 
cyclic adenosine monophosphate (cyclic AMP, or simply cAMP). This 
discovery is heralded as the beginning of the study of signal transduc- 
tion. As will be discussed below, cAMP stimulates glucose mobiliza- 
tion by activating a protein kinase that adds a phosphate group onto a 
specific serine residue of the glycogen phosphorylase polypeptide. 


Phosphatidylinositol-Derived 
Second Messengers 


It wasnt very long ago that the phospholipids of cell membranes were 
considered strictly as structural molecules that made membranes 
cohesive and impermeable to aqueous solutes. Our appreciation of 
phospholipids has increased with the realization that these molecules 
form the precursors of a number of second messengers. Phospholipids 
of cell membranes are converted into second messengers by a variety of 
enzymes that are regulated in response to extracellular signals. These 
enzymes include phospholipases (lipid-splitting enzymes), phospho- 
lipid kinases (lipid-phosphorylating enzymes), and phospholipid 
phosphatases (lipid-dephosphorylating enzymes). Phospholipases are 
enzymes that hydrolyze specific ester bonds that connect the different 
building blocks that make up a phospholipid molecule. FIGURE 15.94 
shows the cleavage sites within a generalized phospholipid that are 
attacked by the main classes of phospholipases. All four of the enzyme 
classes depicted in Figure 15.9a can be activated in response to extra- 
cellular signals and the products they produce function as second mes- 
sengers. In this section, we will focus on the best-studied lipid second 
messengers, which are derived from phosphatidylinositol and are gen- 
erated following the transmission of signals by G protein-coupled 
receptors and receptor protein-tyrosine kinases. Another group of lipid 
second messengers derived from sphingomyelin is not discussed. 
When the neurotransmitter acetylcholine binds to the surface of 
a smooth muscle cell within the wall of the stomach, the muscle cell 
is stimulated to contract. When a foreign antigen binds to the surface 
of a mast cell, the cell is stimulated to secrete histamine, a substance 


FIGURE 15.9 Phospholipid-based second messengers. (a) The structure 
of a generalized phospholipid (see Figure 2.22). Phospholipids are subject 
to attack by four types of phospholipases that cleave the molecule at the 
indicated sites. Of these enzymes, we will focus on PLC, which splits the 
phosphorylated head group from the diacylglycerol (see Figure 15.10). 

(b) An illustration showing the interaction between a PH domain that binds 
to the phosphorylated inositol ring of a phosphoinositide. This interaction 
tethers the protein to the inner surface of the plasma membrane and may 
alter the activity of the parent protein. (c) Fluorescence micrograph of a cell 
that has been stimulated to move toward a chemoattractant (i.e., a 
chemical to which the cell is attracted). This cell has been stained with an 
antibody that binds specifically to P! 3,4,5-trisphosphate (PIP3), which is 
seen to be localized at the leading edge of the migrating cell (arrows). Bar 
equals 15 ym. 

Source: (6) From J. Hurley and J. Grobler, Curr. Opin. Struct. Biol. 7:559, 
1997 with permission from Elsevier. (c) From Paula Rickert et al., Courtesy 
of Henry R. Bourne, University of California, San Francesco, Trends Cell 
Biol. 10:470, 2000. Both images. 


that can trigger the symptoms of an allergy attack. Both of these 
responses, one leading to contraction and the other to secretion, are 
triggered by the same second messenger, a substance derived from 
the compound phosphatidyl, a minor component of most cellular 
membranes (see Figure 4.10). 
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The first indication that phospholipids might be involved in cel- 
lular responses to extracellular signals emerged from studies carried 
out in the early 1950s by Lowell and Mabel Hokin of Montreal 
General Hospital and McGill University. These investigators had set 
out to study the effects of acetylcholine on RNA synthesis in the pan- 
creas. To carry out these studies, they incubated slices of pigeon pan- 
creas in [**P orthophosphate. The idea was that [*’P] orthophosphate 
would be incorporated into nucleoside triphosphates, which are 
used as precursors during the synthesis of RNA. Interestingly, they 
found that treatment of the tissue with acetylcholine led to the incor- 
poration of radioactivity into the phospholipid fraction of the cell. 
Further analysis revealed that the isotope was incorporated primar- 
ily into phosphatidylinositol (PI), which was rapidly converted to 
other phosphorylated derivatives, which are collectively referred to 
as phosphoinositides. This suggested that inositol-containing 
lipids can be phosphorylated by specific lipid kinases that are acti- 
vated in response to extracellular messenger molecules, such as ace- 
tylcholine. It is now well established that lipid kinases are activated 
in response to a large variety of extracellular signals. 

Several of the reactions of phosphoinositide metabolism are 
shown in FIGURE 15.10. As indicated on the left side of this figure, the 
inositol ring, which resides at the cytoplasmic surface of the bilayer, 
has six carbon atoms. Carbon number 1 is involved in the linkage 
between inositol and diacylglycerol. The 3, 4, or 5 carbons can be 
phosphorylated by specific phosphoinositide kinases present in cells 
to generate 7 distinct phosphoinositides. For example, transfer of a 
single phosphate group to the 4-position of the inositol sugar of PI by 
PI 4-kinase (PI4K) generates PI 4-phosphate (PI(4)P), which can be 
phosphorylated by PIP 5-kinase (PIP5K) to form PI 4,5-bisphosphate 
(PI(4,5)P5); Figure 15.10, steps 1 and 2). PI(4,5)P, can be phosphoryl- 
ated by PI 3-kinase (PI3K) to form PI(3,4,5)P3 (PIP3) (shown in Figure 
15.25c). The phosphorylation of PI(4,5)P, to form PIP; is of particular 
interest because the PI3K enzymes involved in this process can be 
controlled by a large variety of extracellular molecules and PI3K over- 
activity has been associated with human cancers. The formation of 
PIP; during the response to insulin is discussed in Section 15.12. 
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FIGURE 15.10 The generation of second messengers as a result of 
ligand-induced breakdown of phosphoinositides (PI) in the lipid bilayer. 
In steps 1 and 2, phosphate groups are added by lipid kinases to phos- 
phatidylinositol (Pl) to form PIP;. When a stimulus is received by a receptor, 
the ligand-bound receptor activates a heterotrimeric G protein containing a 
Gag subunit (step 3), which activates the enzyme Pl-specific phospholipase 
C-B (step 4), which catalyzes the reaction in which PI(4, 5)Pz is split into 
diacylglycerol (DAG) and inositol 1,4,5-trisohosphate (IP3) (step 5). DAG 
recruits the protein kinase PKC to the membrane and activates the enzyme 
(step 6). IP3 diffuses into the cytosol (step 7), where it binds to an IP3 
receptor and Ca** channel in the membrane of the SER (step 8). Binding of 
IP3 to its receptor causes release of calcium ions into the cytosol (step 9). 


All of the phospholipid species discussed above remain in the 597 


cytoplasmic leaflet of the plasma membrane; they are membrane- 
bound second messengers. Just as there are lipid kinases to add 
phosphate groups to phosphoinositides, there are lipid phosphatases 
(e.g., PTEN) to remove them. The activity of these kinases and phos- 
phatases are coordinated so that specific phosphoinositides appear 
at specific membrane compartments at specific times after a signal 
has been received. The role of specific phosphoinositides in mem- 
brane trafficking was discussed on page 296. 

The phosphorylated inositol rings of phosphoinositides form 
binding sites for several lipid-binding domains (PH, PX, and FYVE) 
found in proteins. Best known is the PH domain (Figure 15.9b), 
which has been identified in over 150 different proteins. Binding of a 
protein by its PH domain to PI(3, 4)P, or PIP; typically recruits the 
protein to the cytoplasmic face of the plasma membrane where it can 
interact with other membrane-bound proteins, including activators, 
inhibitors, or substrates. Figure 15.9c shows an example where PIP; is 
specifically localized to a particular portion of the plasma membrane 
of a cell. PIP; is produced at the front of the cell by a localized lipid 
kinase and is subsequently degraded at the rear and sides of the cell by 
a localized lipid phosphatase. The cell shown in Figure 15.9c is engaged 
in chemotaxis, which is to say that it is moving toward an increasing 
concentration of a particular chemical in the medium that serves as the 
chemoattractant. This is the mechanism that causes phagocytic cells 
such as macrophages to move toward bacteria or other targets that they 
engulf. Chemotaxis depends on the localized production of phospho- 
inositide messengers, which bind to certain actin-binding proteins 
(Section 9.13) to influence the formation of actin filaments and lamel- 
lipodia that are required to move the cell in the direction of the target. 


Phospholipase C 


Not all inositol-containing second messengers remain in the lipid 
bilayer of a membrane. When acetylcholine binds to a smooth mus- 
cle cell, or an antigen binds to a mast cell, the bound receptor acti- 
vates a heterotrimeric G protein (Figure 15.10, step 3), which, in 
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C-B (PLCB) (step 4). Like the protein depicted in Figure 15.9b, PLCB 
is situated at the inner surface of the membrane (Figure 15.10), 
bound there by the interaction between its PH domain and a phos- 
phoinositide embedded in the bilayer. PLC catalyzes a reaction that 
splits PI(4, 5)P, into two molecules, inositol 1,4,5-trisphosphate (IP3) 
and diacylglycerol (DAG) (step 5, Figure 15.10), both of which play 
important roles as second messengers in cell signaling. We will 
examine each of these second messengers separately. 


DIACYLGLYCEROL Diacylglycerol (DAG, Figure 15.10) is a 
lipid molecule that remains in the plasma membrane following its 
formation by PLC. There it recruits and activates effector proteins 
that bear a DAG-binding C1 domain. The best-studied of these 
effectors is a family of proteins called protein kinase C (PKC) (step 6, 
Figure 15.10), which phosphorylate serine and threonine residues 
on a wide variety of target proteins. 

Protein kinase C isoforms have a number of important roles in 
cellular growth and differentiation, cellular metabolism, cell death, 
and immune responses. The apparent importance of protein kinase 
C in growth control is seen in studies with a group of powerful plant 
compounds, called phorbol esters, that resemble DAG. These com- 
pounds activate protein kinase C isoforms in a variety of cultured 
cells, causing them to lose growth control and behave temporarily as 
malignant cells. When the phorbol ester is removed from the 
medium, the cells recover their normal growth properties. In con- 
trast, cells that have been genetically engineered to constitutively 
express protein kinase C exhibit a permanent malignant phenotype 
in cell culture and can cause tumors in susceptible mice. The impor- 
tance of PKC in the development of immune responses is seen in 
studies of a specific PKC inhibitor called AEBO71, which is being 
tested in clinical trials as an immunosuppressant to prevent rejection 
of transplanted kidneys and to treat psoriasis, an autoimmune skin 
disease. 


INOSITOL 1,4,5-TRISPHOSPHATE (IP3) Inositol 1,4,5- 
trisphosphate (IP3) is a sugar phosphate—a small, water-soluble 
molecule capable of rapid diffusion throughout the interior of the 
cell. IP; molecules formed at the membrane diffuse into the cytosol 
(step 7, Figure 15.10) and bind to a specific IP; receptor located at 
the surface of the smooth endoplasmic reticulum (step 8). It was 
noted in Section 8.3 that the smooth endoplasmic reticulum is a site 
of calcium storage in a variety of cells. The IP; receptor also func- 
tions as a tetrameric Ca** channel. Binding of IP; opens the channel, 
allowing Ca”* ions to diffuse into the cytoplasm (step 9). Calcium 
ions can also be considered as intracellular or second messengers 
because they bind to various target molecules, triggering specific 
responses. In the two examples used above, contraction of a smooth 
muscle cell and exocytosis of histamine-containing secretory gran- 
ules in a mast cell, both are triggered by elevated calcium levels. So, 
too, is the response of a liver cell to the hormone vasopressin (the 
same hormone that has antidiuretic activity in the kidney, page 142). 
Vasopressin binds to its receptor at the liver cell surface and causes a 
series of IP;-mediated bursts of Ca”* release, which appear as oscil- 
lations of free cytosolic calcium in the recording shown in 
FIGURE 15.11. The frequency and intensity of such oscillations may 
encode information that governs the cell’s specific response. A list of 
some of the responses mediated by IP; is indicated in Table 15.2. We 
will have more discussion about Ca”* ions in Section 15.14. 
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FIGURE 15.11 Experimental demonstration of changes in free calcium 
concentration in response to hormone stimulation. A single liver cell was 
injected with aequorin, a protein extracted from certain jellyfish that 
luminesces when it binds calcium ions. The intensity of the luminescence is 
proportional to the concentration of free calcium ions. Exposure of the cell 
to vasopressin leads to controlled spikes in the concentration of free calcium 
at periodic intervals. Higher concentrations of hormone do not increase the 
height (amplitude) of the spikes, but they do increase their frequency. 


Source: Reprinted with permission from N. M. Woods, K. S. Cuthbertson, 
and P. H. Cobbold, Nature 319:601, 1986; copyright 1986. Nature by 
Nature Publishing Group. Reproduced with permission of Nature Publishing 
Group in the format reuse in a book/textbook via Copyright Clearance Center. 


TABLE 15.2 Summary of Cellular Responses Elicited by Adding IP3 
to Either Permeabilized or Intact Cells 


Cell type 


Response 


Contraction 

Contraction 

Cyclic GMP formation, actin polymerization 
Shape change, aggregation 

Modulation of light response 

Calcium mobilization, membrane depolarization 
Membrane depolarization, cortical reaction 
Increased potassium current 


Vascular smooth muscle 
Stomach smooth muscle 
Slime mold 

Blood platelets 
Salamander rods 
Xenopus oocytes 

Sea urchin eggs 
Lacrimal gland 


Adapted from M. J. Berridge, reproduced with permission from the Annual 
Review of Biochemistry, Vol. 56, copyright 1987 Annual Review of Biochemistry. 
Volume 56, 1987 by Richardson, Charles C. Reproduced with permission of 
Annual Reviews, Incorporated in the format republish in a book via Copyright 
Clearance Center. 
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1. Describe Sutherland's experiment that led to the 
concept of the second messenger. 


2. What is the mechanism of formation of the second 
messenger IP3? What is the relationship between the 
formation of IP3 and an elevation of intracellular [Ca2*]? 


3. Describe the relationship between 
phosphatidylinositol, diacylglycerol, calcium ions, and 
protein kinase C. How do phorbol esters interfere 
with signal pathways involving DAG? 


© 


15.7 The Specificity of G 
Protein-Coupled Responses 


A wide variety of agents, including hormones, neurotransmitters, 
and sensory stimuli, act by way of GPCRs and heterotrimeric G pro- 
teins to transmit information across the plasma membrane, trigger- 
ing a wide variety of cellular responses. Thus as a group, GPCRs are 
capable of binding a diverse array of ligands. In addition, the recep- 
tor for a given ligand can exist in several different versions (iso- 
forms). For example, researchers have identified nine different 
isoforms of the adrenergic receptor, which binds epinephrine, and 
15 different isoforms of the receptor for serotonin, a powerful neu- 
rotransmitter released by nerve cells in parts of the brain. Different 
isoforms can have different affinities for the ligand or may interact 
with different types of G proteins. Different isoforms of a receptor 
may coexist in the same plasma membrane, or they may occur in the 
membranes of different types of target cells. The heterotrimeric G 
proteins that transmit signals from receptor to effector can also exist 
in multiple forms, as can many of the effectors. The human genome 
encodes at least 16 different Gy subunits, five different G, subunits, 
and 11 different G subunits, along with nine isoforms of the effec- 
tor adenylyl cyclase. Different combinations of specific subunits 
construct G proteins having different capabilities of reacting with 
specific isoforms of both receptors and effectors. 

As mentioned on page 589, some G proteins act by inhibiting 
their effectors. The same stimulus can activate a stimulatory G pro- 
tein (one with a G,, subunit) in one cell and an inhibitory G protein 
(one with aG qi Subunit) in a different cell. For example, when epi- 
nephrine binds to a B-adrenergic receptor on a cardiac muscle cell, a 
G protein with a G,, subunit is activated, which stimulates cAMP 
production, leading to an increase in the rate and force of contraction. 
In contrast, when epinephrine binds to an a-adrenergic receptor on a 
smooth muscle cell in the intestine, a G protein with a G,, subunit is 
activated, which inhibits cAMP production, producing muscle relax- 
ation. Finally, some adrenergic receptors turn on G proteins withG gi 
subunits, leading to activation of PLC. Clearly, the same extracellu- 
lar messenger can activate a variety of pathways in different cells. 


REVIEW | 


1. Describe ways in which a single GPCR ligand can 
effect a variation of responses in different cells. 


15.8 Regulation of Blood 
Glucose Levels 


Glucose can be utilized as a source of energy by nearly all cell types 
present in the body. It is oxidized to CO, and H,O by glycolysis and 
the TCA cycle, providing cells with ATP that can be used to drive 
energy-requiring reactions. The body maintains glucose levels in the 
bloodstream within a narrow range. As discussed in Chapter 3, 
excess glucose is stored in animal cells as glycogen, a large branched 
polymer composed of glucose monomers that are linked through 
glycosidic bonds. The hormone glucagon is produced by the alpha 
cells of the pancreas in response to low blood glucose levels. 
Glucagon stimulates breakdown of glycogen and release of glucose 


into the bloodstream, thereby causing glucose levels to rise. The 
hormone insulin is produced by the beta cells of the pancreas in 
response to high glucose levels and stimulates glucose uptake and 
storage as glycogen. Finally, epinephrine—which is sometimes called 
the “fight-or-flight” hormone—is produced by the adrenal gland in 
stressful situations. Epinephrine causes an increase in blood glucose 
levels to provide the body with the extra energy resources needed to 
deal with the stressful situation at hand. 

Insulin acts through a receptor protein-tyrosine kinase and its 
signal transduction is discussed in Section 15.12. In contrast, both 
glucagon and epinephrine act by binding to GPCRs. Glucagon is a 
small protein that is composed of 29 amino acids, whereas epineph- 
rine is a small molecule that is derived from the amino acid tyrosine. 
Structurally speaking, these two molecules have nothing in common, 
yet both of them bind to GPCRs and stimulate the breakdown of gly- 
cogen into glucose 1-phosphate (FIGURE 15.12). In addition, the 
binding of either of these hormones leads to the inhibition of the 
enzyme glycogen synthase, which catalyzes the opposing reaction in 
which glucose units are added to growing glycogen molecules. Thus 
two different stimuli (glucagon and epinephrine), recognized by dif- 
ferent receptors, induce the same response in a single target cell. The 
two receptors differ from one another primarily in the structure of 
the ligand-binding pocket on the extracellular surface of the cell, 
which is specific for one or the other hormone. Following activation 
by their respective ligands, both receptors activate the same type of 
heterotrimeric G proteins that cause an increase in the levels of cAMP. 
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FIGURE 15.12 The reactions that lead to glucose storage or 
mobilization. The activities of two of the key enzymes in these reactions, 
glycogen phosphorylase and glycogen synthase, are controlled by hormones 
that act through signal transduction pathways. Glycogen phosphorylase is 
activated in response to glucagon and epinephrine, whereas glycogen 
synthase is activated in response to insulin (Section 15.12). 
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of a Response Induced by cAMP 


cAMP is synthesized by adenylyl cyclase, an integral membrane pro- 
tein whose catalytic domain resides at the inner surface of the plasma 
membrane (FIGURE 15.13). cAMP evokes a response that leads to 
glucose mobilization by initiating a chain of reactions, as illustrated 
in FIGURE 15.14. The first step in this reaction cascade occurs as the 
hormone binds to its receptor, activating a G,, subunit, which 
activates an adenylyl cyclase effector. The activated enzyme catalyzes 
the formation of cAMP (steps 1 and 2, Figure 15.14). 

Once formed, cAMP molecules diffuse into the cytoplasm 
where they bind to an allosteric site on a regulatory subunit of a 
cAMP-dependent protein kinase (protein kinase A, PKA) (step 3, 
Figure 15.14). In its inactive form, PKA is a heterotetramer com- 
posed of two regulatory (R) and two catalytic (C) subunits. The reg- 
ulatory subunits normally inhibit the catalytic activity of the enzyme. 
cAMP binding causes the dissociation of the regulatory subunits, 
thereby releasing the active catalytic subunits of PKA. The target 
substrates of PKA in a liver cell include two enzymes that play a piv- 
otal role in glucose metabolism, glycogen synthase and phosphory- 
lase kinase (steps 4 and 5). Phosphorylation of glycogen synthase 
inhibits its catalytic activity and thus prevents the conversion of glu- 
cose to glycogen. In contrast, phosphorylation of phosphorylase 
kinase activates the enzyme to catalyze the transfer of phosphate 
groups to glycogen phosphorylase molecules. As discovered by 
Krebs and Fischer (page 173), the addition of a single phosphate 
group to a specific serine residue in the glycogen phosphorylase 
polypeptide activates this enzyme (step 6), stimulating the break- 
down of glycogen (step 7). The glucose 1-phosphate formed in the 
reaction is converted to glucose, which diffuses into the bloodstream 
and so reaches the other tissues of the body (step 8). 

As one might expect, a mechanism must exist to reverse the 
steps discussed above; otherwise the cell would remain in the acti- 
vated state indefinitely. Liver cells contain phosphatases that remove 
the phosphate groups added by the kinases. A particular member of 
this family of enzymes, protein phosphatase-1, can remove phos- 
phates from all of the phosphorylated enzymes of Figure 15.14: 
phosphorylase kinase, glycogen synthase, and glycogen phosphory- 
lase. The destruction of cAMP molecules present in the cell is 
accomplished by the enzyme cAMP phosphodiesterase, which helps 
terminate the response. 


Signal Amplification 


The binding of a single hormone molecule at the cell surface can 
activate a number of G proteins, each of which can activate an adenyl 
cyclase effector, each of which can produce a large number of cAMP 
messengers in a short period of time. Thus, the production of a sec- 
ond messenger provides a mechanism to greatly amplify the signal 
generated from the original message. Many of the steps in the reac- 
tion cascade illustrated in Figure 15.14 result in amplification of the 
signal (these steps are indicated by the blue arrows). cAMP mole- 
cules activate PKA. Each PKA catalytic subunit phosphorylates a 
large number of phosphorylase kinase molecules, which in turn 
phosphorylate an even larger number of glycogen phosphorylase 
molecules, which in turn can catalyze the formation of a much larger 
number of glucose phosphates. Thus, what begins as a barely percep- 
tible stimulus at the cell surface is rapidly transformed into a major 
mobilization of glucose within the cell. 
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FIGURE 15.13 Formation of cyclic AMP from ATP is catalyzed by 
adenylyl cyclase, an integral membrane protein that consists of two 
parts, each containing six transmembrane helices (shown here in two 
dimensions). The enzyme’s active site is located on the inner surface of the 
membrane in a cleft situated between two similar cytoplasmic domains. 
The breakdown of cAMP (not shown) is accomplished by a phosphodiester- 
ase that converts the cyclic nucleotide to a 5’ monophosphate. 


Other Aspects of cAMP Signal 
Transduction Pathways 


Although the most rapid and best-studied effects of cAMP are pro- 
duced in the cytoplasm, the nucleus and its genes also participate in 
the response. A fraction of the activated PKA molecules translocate 
into the nucleus where they phosphorylate key nuclear proteins 
(step 9, Figure 15.14), most notably a transcription factor called 
CREB (cAMP response element-binding protein). The phosphorylated 
version of CREB binds as a dimer to sites on the DNA (Figure 15.14, 
step 10) containing a particular nucleotide sequence (TGACGTCA), 
known as the cAMP response element (CRE). Recall from Section 
12.13 that response elements are sites in the DNA where transcrip- 
tion factors bind and increase the rate of initiation of transcription. 
CREs are located in the regulatory regions of genes that play a role in 
the response to cAMP. In liver cells, for example, several of the 
enzymes involved in gluconeogenesis, a pathway by which glucose is 
formed from the intermediates of glycolysis (see Figure 3.31), are 
encoded by genes that contain nearby CREs. Thus, epinephrine and 
glucagon not only activate catabolic enzymes involved in glycogen 
breakdown, they promote the synthesis of anabolic enzymes required 
to synthesize glucose from smaller precursors. 

cAMP is produced in many different cells in response to a wide 
variety of different ligands. Several of the hormonal responses medi- 
ated by cAMP in mammalian cells are listed in Table 15.3. Cyclic 
AMP pathways have also been implicated in processes occurring in 
the nervous system, including learning, memory, and drug addiction. 
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Chronic use of opiates, for example, leads to elevated levels of adenylyl 
cyclase and PKA, which may be partially responsible for the physio- 
logic responses that occur during drug withdrawal. Another cyclic 


TABLE 15.3 Examples of Hormone-Induced Responses Mediated 
by cAMP 


Tissue Hormone Response 


Liver Epinephrine and 


glucagon 


Glycogen breakdown, 
glucose synthesis 
(gluconeogenesis), 
inhibition of glycogen 
synthesis 

Glycogen breakdown, 
inhibition of glycogen 
synthesis 

Increased contractility 

Triacylglycerol catabolism 


Skeletal muscle Epinephrine 


Cardiac muscle 
Adipose 


Epinephrine 

Epinephrine, ACTH, 
and glucagon 

Vasopressin (ADH) 


Kidney 


Increased permeability of 
epithelial cells to water 

Secretion of thyroid 
hormones 

Increased calcium 
resorption 

Increased secretion of 
steroid hormones 

Increased secretion of 
glucocorticoids 


Thyroid TSH 


Bone Parathyroid hormone 
Ovary LH 


Adrenal cortex 


cascade are accompanied by a dramatic amplification of the signal. Steps 
leading to amplification are indicated by clusters of blue arrows. Activation of 


transcription by CREB occurs in conjunction with the usual array of coactiva- 
tors (e.g., P300 and CBP) and chromatin-modifying complexes (not shown). 


nucleotide, cyclic GMP, also acts as a second messenger in certain 
cells as illustrated by the induced relaxation of smooth muscle cells 
discussed in Section 15.16. As discussed in the next section, cyclic 
GMP also plays a key role in the signaling pathway involved in vision. 

Because cAMP exerts most of its effects by activating PKA, the 
response of a given cell to cAMP is typically determined by the spe- 
cific proteins phosphorylated by this kinase (FIGURE 15.15). 
Although activation of PKA in a liver cell in response to epinephrine 
leads to the breakdown of glycogen, activation of the same enzyme 
in a kidney tubule cell in response to vasopressin causes an increase 
in the permeability of the membrane to water, and activation of the 


enzyme in a thyroid cell in response to TSH leads to the secretion of 


thyroid hormone. Clearly, PKA must phosphorylate different sub- 


strates in each of these cell types, thereby linking the increase in 


cAMP levels induced by epinephrine, vasopressin, and TSH to dif- 
ferent physiological responses. 


Over one hundred PKA substrates have been described. Most of 


these carry out different functions, which brings up the question as 


to how PKA phosphorylates the appropriate substrates in response 


to a particular stimulus, in a particular cell type. This question was 
answered in part by the observation that different cells express 
different PKA substrates and in part by the discovery of PKA- 


anchoring proteins or AKAPs that function as signaling hubs. The 


first AKAPs were discovered as proteins that co-purified with PKA. 


At least 50 different AKAPs have been discovered since, several of 
which are shown in FIGURE 15.16. As indicated in this figure, AKAPs 
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FIGURE 15.15 Schematic illustration of the variety of processes that 
can be affected by changes in cAMP concentration. All of these effects are 
thought to be mediated by activation of the same enzyme, protein kinase A. 
In fact, the same hormone can elicit very different responses in different 
cells, even when it binds to the same receptor. Epinephrine, for example, 
binds to a similar B-adrenergic receptor in liver cells, fat cells, and smooth 
muscle cells of the intestine, causing the production of cAMP in all three cell 
types. The responses, however, are quite different: Glycogen is broken down 
in the liver cell, triacylglycerols are broken down in the fat cell, and the 
smooth muscle cells undergo relaxation. In addition to PKA, cAMP is known 
to interact with ion channels, phosphodiesterases, and GEFs (page 608). 


provide a structural framework or scaffold for coordinating protein— 
protein interactions by sequestering PKA to specific locations within 


FIGURE 15.16 schematic representation 
of AKAP signaling complexes operating in 
different subcellular compartments. The 
AKAP in each of these protein complexes is 
represented by the purple bar. In each case, 
the AKAP forms a scaffold that brings 
together a PKA molecule with potential 
substrates and other proteins involved in the 
signaling pathway, including phosphatases 
(green triangles) that can remove the added 
phosphate groups and phosphodiesterases 
that can terminate continued signaling. The 
AKAPs shown here target PKA to a number 
of different compartments, including the 
plasma membrane, mitochondrion, 
cytoskeleton, centrosome, and nucleus. 


AMPA 


NMDA 
receptor 


Source: Reprinted with permission from 

W. Wong and J. D. Scott, Nature Reviews 
Mol. Cell. Biol. 5:961, 2004; copyright 2004. 
Nature Reviews Molecular Cell Biology by 
Nature Publishing Group. Reproduced with 
permission of Nature Publishing Group in 
the format reuse in a book/textbook via 
Copyright Clearance Center. 


receptor 


the cell. As a consequence, PKA accumulates in close proximity to 
one or more substrates. When cAMP levels rise and PKA is acti- 
vated, the relevant substrates are present close by and they are the 
first ones to become phosphorylated. Substrate selection thus is 
partly a consequence of the subcellular localization of PKA in the 
presence of particular substrates. Different cells express different 
AKAPs, resulting in localization of PKA in the presence of different 
substrates and consequently phosphorylation of different substrates 
following an increase in cAMP levels. It is interesting to note that, 
unlike most proteins with a similar function, AKAPs have a diverse 
structure, suggesting that evolution has co-opted a variety of differ- 
ent types of proteins to carry out a similar role in cell signaling. 


REVIEW | 


1. What is meant by the term amplification in regard to 
signal transduction? How does the use of a reaction 
cascade result in amplification of a signal? How does 
it increase the possibilities for metabolic regulation? 


2. How is it possible that the same first messenger, such 
as epinephrine, can evoke different responses in 
different target cells? that the same second 
messenger, such as cAMP, can also evoke different 
responses in different target cells? that the same 
response, such as glycogen breakdown, can be 
initiated by different stimuli? 


3. Describe the steps that lead from the synthesis of 
cAMP at the inner surface of the plasma membrane 
of a liver cell to the release of glucose into the 
bloodstream. How is this process controlled by GRKs 
and arrestin? By protein phosphatases? By cAMP 
phosphodiesterase? 


——— 


L-type Ca2+ 
channel 


Plasma 
membrane 


15.9 The Role of GPCRs 
in Sensory Perception 


Our ability to see, taste, and smell depends largely on GPCRs. It was 
mentioned above that rhodopsin, whose structure and activation is 
depicted in Figure 15.5), is a GPCR. Rhodopsin is the light-sensitive 
protein present in the rods of our retina, which are the photoreceptor 
cells that respond to low light intensity and provide us with a black- 
and-white picture of our environment at night or in a darkened room. 
Several closely related GPCRs are present in the cones of the retina, 
which provide us with color vision under conditions of brighter light. 
Absorption of a single photon of light induces a conformational 
change in the rhodopsin molecule, which transmits a signal to a het- 
erotrimeric G protein (called transducin), which activates a coupled 
effector. The effector in this case is the enzyme cGMP phosphodies- 
terase, which hydrolyzes the cyclic nucleotide cGMP, a second mes- 
senger similar in structure to cAMP (Figure 15.13). cGMP plays an 
important role in visual excitation in the rod cells of the retina. In the 
dark, cGMP levels remain high and thus capable of binding to cGMP- 
gated sodium channels in the plasma membrane, keeping the chan- 
nels in an open configuration, leading to a continued inward ionic 
current (a “dark current”). Activation of cGMP phosphodiesterase 
results in lowered cGMP levels, leading to the closure of the sodium 
channels. This response, which is unusual in that it is triggered by a 
decrease in the concentration of a second messenger, may lead to the 
generation of action potentials along the optic nerve. 

Our sense of smell depends on nerve impulses transmitted along 
olfactory neurons that extend from the epithelium that lines our 
upper nasal cavity to the olfactory bulb that is located in our brain 
stem. The distal tips of these neurons, which are located in the nasal 
epithelium, contain odorant receptors, which are GPCRs capable of 
binding various chemicals that enter our nose. Mammalian odorant 
receptors were first identified in 1991 by Linda Buck and Richard Axel 
of Columbia University. It is estimated that humans express roughly 
400 different odorant receptors that, taken together, are able to com- 
bine with a large variety of different chemical structures (odorants).' 
Each olfactory receptor neuron expresses only one allele of one of the 
hundreds of different odorant receptor genes. Consequently, each of 
these sensory neurons contains only one specific odorant receptor 
and is only capable of responding to one or a few related chemicals. As 
a result, activation of different neurons containing different odorant 
receptors provides us with the perception of different aromas. That 
does not mean that a single chemical cannot interact with more than 
one olfactory receptor, but rather that the specific combination of 
receptors that are activated by that compound may play a key role in 
producing a particular smell. Mutations in a specific gene encoding a 
particular odorant receptor can leave a person with the inability to 
detect a particular chemical in their environment that most other 
members of the population can perceive. When activated by bound 
ligands, odorant receptors signal through heterotrimeric G proteins 
to adenylyl cyclase, resulting in the synthesis of cAMP and the open- 
ing of a cAMP-gated cation channel. This response leads to the 
generation of action potentials that are transmitted to the brain. 

Our perception of taste is much less discriminating than our 
perception of smell. Each taste receptor cell in the tongue transmits a 


'The human genome contains roughly 1000 genes that encode odorant receptors but 
the majority are present as nonfunctional pseudogenes (page 391). Mice, which 
depend more heavily than humans on their sense of smell, have more than 1000 of 
these genes in their genome, and 95 percent of them encode functional receptors. 


sense of one of only five basic taste qualities, namely: salty, sour, 603 


sweet, bitter, or umami (from the Japanese word meaning “flavorful”). 
Taste receptor cells that elicit the taste of umami respond to the 
amino acids aspartate and glutamate and to purine nucleotides, gen- 
erating a perception that a food is “savory.” This is the reason that 
monosodium glutamate and disodium guanylate are commonly 
added to processed foods to enhance flavor. The pleasurable umami 
taste is thought to have evolved as a mechanism to drive mammals to 
seek high-protein foods. The perception that a food or beverage is 
salty or sour is elicited directly by sodium ions or protons in the food. 
These ions pass through the plasma membrane of receptor cells via 
Na* or H" channels, respectively, eventually leading to a depolariza- 
tion of the cell’s plasma membrane (Section 4.16). In contrast, the 
perception that a food is bitter, sweet, or savory depends on a com- 
pound interacting with a GPCR at the surface of the receptor cell. 
Humans encode a family of about 30 bitter-taste receptors called 
T2Rs, which are coupled to the same heterotrimeric G protein. As a 
group, these taste receptors bind a diverse array of different com- 
pounds, including plant alkaloids or cyanides, that evoke a bitter 
taste in our mouths. For the most part, substances that evoke this 
perception are toxic compounds that elicit a distasteful, protective 
response that causes us to expel the food matter from our mouth. 
Unlike olfactory cells that contain a single receptor protein, a single 
taste-bud cell that evokes a bitter sensation contains a variety of dif- 
ferent T2R receptors that respond to unrelated noxious substances. 
As a result, many diverse substances evoke the same basic taste, 
which is simply that the food we have eaten is bitter and disagreeable. 
In contrast, a food that elicits a sweet taste is likely to be one that 
contains energy-rich carbohydrates. Humans possess only one high 
affinity sweet-taste receptor (a T1R2-T1R3 heterodimer) and it 
responds to sugars, certain sweet-tasting peptides and proteins (e.g., 
monellin), and artificial sweeteners. Umami receptors consist of a 
TR1-TR3 heterodimer. Fortunately, food that is chewed releases 
odorants that travel via the throat to olfactory neurons in our nasal 
mucosa, allowing the brain to learn much more about the food we 
have eaten than the relatively simple messages provided by taste 
receptors. It is this merged input from both olfactory and taste (gus- 
tatory) receptors that provides us with our rich sense of taste. The 
importance of olfactory neurons in our perception of taste becomes 
more evident when we have a cold that causes us to lose some of our 
appreciation for the taste of food. 


REVIEW | 


1. Describe the role of GPCRs in vision, odor detection, 
and flavor sensing. 


15.10 Protein-Tyrosine 
Phosphorylation as a Mechanism 
for Signal Transduction 


Protein-tyrosine kinases are enzymes that phosphorylate specific 
tyrosine residues on protein substrates. Protein-tyrosine phospho- 
rylation is a mechanism for signal transduction that appeared with 
the evolution of multicellular organisms. Over 90 different protein- 
tyrosine kinases are encoded by the human genome. These kinases 
are involved in the regulation of growth, division, differentiation, 
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cells. Expression of mutant protein-tyrosine kinases that cannot be 
regulated and are continually active can lead to uncontrolled cell 
division and the development of cancer. One type of leukemia, for 
example, occurs in cells that contain an unregulated version of the 
protein-tyrosine kinase ABL. 

Protein-tyrosine kinases can be divided in two groups: recep- 
tor protein-tyrosine kinases (RTKs), which are integral mem- 
brane proteins that contain a single transmembrane helix and an 
extracellular ligand binding domain, and non-receptor (or cyto- 
plasmic) protein-tyrosine kinases. The human genome encodes 
nearly 60 RTKs and 32 non-receptor TKs. The first RTK to be stud- 
ied, EGFR, was identified in 1978 by Stanley Cohen of Vanderbilt 
University. The discovery of the first non-receptor TK is discussed in 
Section 16.3. RTKs are activated directly by extracellular growth and 
differentiation factors such as epidermal growth factor (EGF) and 
platelet-derived growth factor (PDGF) or by metabolic regulators 
such as insulin. Non-receptor protein-tyrosine kinases are regulated 
indirectly by extracellular signals, and they control processes as 
diverse as the immune response, cell adhesion, and neuronal cell 
migration. This section of the chapter is focused on signal transduc- 
tion by RTKs. 


Receptor Dimerization 


An obvious question comes to mind when considering the mechan- 
ics of signal transduction: How is the presence of a growth factor on 
the outside of the cell translated into biochemical changes inside the 
cell? Numerous structural and biochemical studies have demon- 
strated that ligand binding results in the dimerization of the 
extracellular ligand-binding domains of a pair of receptors. Two 
mechanisms for receptor dimerization have been recognized: 
ligand-mediated dimerization and receptor-mediated dimerization 
(FIGURE 15.17). Early work suggested that ligands of RTKs contain 
two receptor-binding sites. This made it possible for a single growth 
or differentiation factor molecule to bind to two receptors at the 
same time, thereby causing ligand-mediated receptor dimerization 
(Figure 15.17a). This model was supported by the observation that 
growth and differentiation factors such as platelet-derived growth 
factor (PDGF) or colony-stimulating factor-1 (CSF-1) are com- 
posed of two similar or identical disulfide-linked subunits, in which 
each subunit contains a receptor-binding site. However, not all 
growth factors were found to conform to this model. Some growth 
factors (e.g., EGF or TGFa) contain only a single receptor-binding 
site. Structural work now supports a second mechanism (Figure 
15.17b) in which ligand binding induces a conformational change 
in the extracellular domain of a receptor, leading to the formation 
or exposure of a receptor dimerization interface. With this mecha- 
nism, ligands act as allosteric regulators that turn on the ability of 
their receptors to form dimers. A small number of RTKs, including 
the insulin and IGF-1 receptors, are present as inactive dimers in 
the absence of ligand (see Figure 15.24). For most RTKs, receptor 
dimerization results in the juxtapositioning of two protein-tyrosine 
kinase domains on the cytoplasmic side of the plasma membrane. 
Bringing two kinase domains in close contact allows for trans- 
autophosphorylation, in which the protein kinase activity of one 
receptor of the dimer phosphorylates tyrosine residues in the cyto- 
plasmic domain of the other receptor of the dimer, and vice versa 
(Figure 15.17a,b). 


Protein Kinase Activation 


Autophosphorylation sites on RTKs can carry out two different 
functions: They can regulate the receptor’s kinase activity or serve as 
binding sites for cytoplasmic signaling molecules. Kinase activity is 
usually controlled by autophosphorylation on tyrosine residues that 
are present in the activation loop of the kinase domain. The activa- 
tion loop, when unphosphorylated, obstructs the substrate-binding 
site, thereby preventing ATP from entering. Following its phospho- 
rylation, the activation loop is stabilized in a position away from the 
substrate-binding site, resulting in activation of the kinase domain. 
Once their kinase domain has been activated, the receptor subunits 
proceed to phosphorylate each other on tyrosine residues that are 
present in regions adjacent to the kinase domain. It is these autophos- 
phorylation sites that act as binding sites for cellular signaling proteins. 


Phosphotyrosine-Dependent 
Protein-Protein Interactions 


Signaling pathways consist of a chain of signaling proteins that inter- 
act with one another in a sequential manner (see Figure 15.3). 
Signaling proteins are able to associate with activated protein-tyrosine 
kinase receptors, because such proteins contain domains that bind 
specifically to phosphorylated tyrosine residues (as in Figure 15.17). 
The best-studied pTyr-binding domains are the Src-homology 2 (SH2) 
domain and the phosphotyrosine-binding (PTB) domain. SH2 domains 
were initially identified as part of proteins encoded by the genome of 
tumor-causing (oncogenic) viruses. They are composed of approxi- 
mately 100 amino acids and contain a conserved binding-pocket that 
accommodates a phosphorylated tyrosine residue (FIGURE 15.18). 
More than 110 SH2 domains are encoded by the human genome. 
They mediate a large number of phosphorylation-dependent protein- 
protein interactions. These interactions occur following phosphorylation 
of specific tyrosine residues. The specificity of the interactions is 
determined by the amino acid sequence immediately adjacent to the 
phosphorylated tyrosine residues. For example, the SH2 domain of 
the Src protein-tyrosine kinase recognizes pTyr-Glu-Glu-Ile, whereas 
the SH2 domains of PI 3-kinase bind to pTyr-Met-X-Met (in which X 
can be any residue). It is interesting to note that the budding-yeast 
genome encodes only one SH2-domain-containing protein, which 
correlates with the overall lack of tyrosine-kinase signaling activity in 
these lower single-celled eukaryotes. 

PTB domains were discovered more recently. They can bind to 
phosphorylated tyrosine residues that are usually present as part of 
an asparagine-proline-X-tyrosine (Asn-Pro-X-Tyr) motif. The story 
is more complicated, however, because some PTB domains appear to 
bind specifically to an unphosphorylated Asn-Pro-X-Tyr motif, 
whereas others bind specifically to the phosphorylated motif. PTB 
domains are poorly conserved and different PTB domains possess 
different residues that interact with their ligands. 


Activation of Downstream 
Signaling Pathways 


We have seen that receptor protein-tyrosine kinases (RTKs) are 
autophosphorylated on one or more tyrosine residues. A variety of 
signaling proteins with SH2 or PTB domains are present in the cyto- 
plasm. Receptor activation therefore results in formation of signaling 
complexes, in which SH2- or PTB-containing proteins bind to 


Ligand Ligand 


Inactive 
monomers 


Inactive 
monomers 


Ligand induces 
dimerization 
interface 


Inactive 
monomers 


Ligand-mediated 
dimerization 


Receptor-mediated 
dimerization 


Active dimers Active dimers 


Trans-autophos- 
phorylation 


Trans-autophos- 
phorylation 


Signal 
transmission 


Signal 
transmission 


SH2 or PTB SH2 or PTB 
domain domain 
(a) (b) 


FIGURE 15.17 Steps in the activation of a receptor protein-tyrosine kinase (RTK). (a) Ligand-mediated dimerization. In the nonactivated state, the 
receptors are present in the membrane as monomers. Binding of a bivalent ligand leads directly to dimerization of the receptor and activation of its kinase 
activity, causing it to add phosphate groups to the cytoplasmic domain of the other receptor subunit. The newly formed phosphotyrosine residues of the 
receptor serve as binding sites for target proteins containing either SH2 or PTB domains. The target proteins become activated as a result of their interaction 
with the receptor. (b) Receptor-mediated dimerization. The sequence of events are similar to those in part a, except that the ligand is monovalent and, 
consequently, a separate ligand molecule binds to each of the inactive monomers. Binding of each ligand induces a conformational change in the receptor 
that creates a dimerization interface (red arrows). The ligand-bound monomers interact through this interface to become an active dimer. 

Source: (a-b) Based on a drawing by J. Schlessinger and A. Ullrich, Neuron 9:384, 1992; by Permission of Cell Press. Neuron by Cell Press. Reproduced with 
permission of Cell Press in the format reuse in a book/textbook via Copyright Clearance Center. 
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-1 ASN +1 GLU +2 GLU 


FIGURE 15.18 The interaction between an SH2 domain of a protein 
and a peptide containing a phosphotyrosine residue. The SH2 

domain of the protein is shown in a cutaway view with the accessible 
surface area represented by red dots and the polypeptide backbone as a 
purple ribbon. The phosphotyrosine-containing heptapeptide (Pro-Asn- 
pTyr-Glu-Glu-lle-Pro) is shown as a space-filling model whose side chains 
are colored green and backbone is colored yellow. The phosphate group is 
shown in light blue. The phosphorylated tyrosine residue and the isoleucine 
residue (+3) are seen to project into pockets on the surface of the SH2 
domain, creating a tightly fitting interaction, but only when the key tyrosine 
residue is phosphorylated. 


Source: From Gabriel Waksman et al., courtesy of John Kuriyan, Cell 
72:783, 1993. Reprinted with permission from Elsevier. 


specific autophosphorylation sites present on the receptor (as in 
Figure 15.17). We can distinguish several groups of signaling proteins 
that can interact with activated RTKs, including adaptor and scaf- 
folding proteins, transcription factors, and enzymes (FIGURE 15.19). 


e Adaptor proteins and scaffolding proteins function as linkers 
that enable two or more signaling proteins to become joined 
together as part of a signaling complex (Figure 15.19a). Adaptor 
proteins contain an SH2 domain and one or more additional 
protein-protein interaction domains. For instance, the adaptor 
protein Grb2 contains one SH2 and two SH3 (Sre-homology 3) 


FIGURE 15.19 A diversity of signaling proteins. Cells contain numerous 
proteins with SH2 or PTB domains that bind to phosphorylated tyrosine 
residues. (a) Adaptor proteins, such as Grb2, function as a link between 
other proteins. As shown here, Grb2 can serve as a link between an 
activated growth factor RTK and Sos, an activator of a downstream protein 
named Ras. The function of Ras is discussed later. (b) The docking protein 
IRS contains a PTB domain that allows it to bind to the activated receptor. 
Once bound, tyrosine residues on the docking protein are phosphorylated 
by the receptor. These phosphorylated residues function as binding sites 
for other signaling proteins. (c) Certain transcription factors bind to 
activated RTKs, an event that leads to the phosphorylation and activation 
of the transcription factor and its translocation to the nucleus. Members of 
the STAT family of transcription factors become activated in this manner. 
(a) A wide array of signaling enzymes are activated following binding to an 
activated RTK. In the case depicted here, a phospholipase (PLC-Y), a lipid 
kinase (PI3K), and a protein-tyrosine phosphatase (Shp2) have all bound to 
phosphotyrosine sites on the receptor. 


domains (FIGURE 15.20). As shown in Figure 2.42, SH3 domains 
bind to proline-rich sequence motifs. The SH3 domains of Grb2 
bind constitutively to other proteins, including Sos and Gab. The 
SH2 domain binds to phosphorylated tyrosine residues within a 
Tyr-X-Asn motif. Consequently, tyrosine phosphorylation of the 
Tyr-X-Asn motif on an RTK results in translocation of Grb2-Sos 
or Grb2-Gab from the cytosol to a receptor, which is present at 
the plasma membrane (Figure 15.19a). 
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FIGURE 15.20 Tertiary structure of an adaptor protein, Grb2. Grb2 
consists of three parts: two SH3 domains and one SH2 domain. SH2 
domains bind to a protein (e.g., the activated EGF receptor) containing a 
particular motif that includes a phosphotyrosine residue. SH3 domains bind 
to a protein (e.g., Sos) that contains a particular motif that is rich in proline 
residues. Dozens of proteins that bear these domains have been identified. 
Interactions involving SH3 and SH2 domains are shown in Figures 2.42 and 
15.18, respectively. Other adaptor proteins include Nck, Shc, and Crk. 
Source: From Sébastien Maignan et al., Science 268:291, 1995. Reprinted 
with permission from AAAS. Image provided courtesy of Arnaud Ducruix. 


» Transcription factors were discussed at length in Chapter 12. 
Transcription factors that belong to the STAT family play an 
important role in the function of the immune system. STATs 
contain an SH2 domain together with a tyrosine phosphoryla- 
tion site that can act as a binding site for the SH2 domain of 
another STAT molecule (Figure 15.19c). Tyrosine phosphoryla- 
tion of STAT SH2 binding sites situated within a dimerized 
receptor leads to the recruitment of STAT proteins (Figure 
15.19c). Upon association with the receptor complex, tyrosine 
residues in these STAT proteins are phosphorylated. As a result 
of the interaction between the phosphorylated tyrosine residue 
on one STAT protein and the SH2 domain on a second STAT 
protein, and vice versa, these transcription factors will form 
dimers. Dimers, but not monomers, move to the nucleus where 
they stimulate the transcription of specific genes involved in an 
immune response. The role of STATs in signaling an immune 
response is discussed in Section 17.13. 


» Signaling enzymes include protein kinases, protein phosphatases, 
lipid kinases, phospholipases, and GTPase activating proteins. 
When equipped with SH2 domains, these enzymes associate with 
activated RTKs and are turned on directly or indirectly as a 
consequence of this association (Figure 15.19d ). Three general 
mechanisms have been identified by which these enzymes are 
activated following their association with a receptor. Enzymes 


can be activated simply as a result of translocation to the 
membrane, which places them in close proximity to their 
substrates. Enzymes can also be activated through an allosteric 
mechanism (page 110), in which binding to phosphotyrosine 
results in a conformational change in the SH2 domain that 
causes a conformational change in the catalytic domain, resulting 
in a change in catalytic activity. Finally, enzymes can be regulated 
directly by phosphorylation. As will be described below, signal- 
ing proteins that associate with activated RTKs initiate cascades 
of events that lead to the biochemical changes required to 
respond to the presence of extracellular messenger molecules. 


Ending the Response 


Signal transduction by RTKs is usually terminated by internalization 
of the receptor, primarily through clathrin-mediated endocytosis 
(Section 8.17). Some RTKs possess a short sequence motif on their 
cytoplasmic domain that bind to the clathrin adaptor protein AP-2, 
which is thought to mediate its internalization. Numerous recent 
reports also point to the ubiquitination of RTKs as playing a key role 
in internalization. Ubiquitin ligases (most notably, those of the Cbl 
family) associate with the RTK signalling complex through SH2 
domains or adaptor proteins, and subsequently covalently attach one 
or more ubiquitins to the RTK, marking it for internalization. As in 
the case of GPCRs (Figure 15.7), internalized RTKs can have several 
alternate fates; they can be degraded in lysosomes, returned to the 
plasma membrane, or become part of endosomal signaling com- 
plexes and engage in continued intracellular signaling. 

Now that we have discussed some of the general mechanisms by 
which RTKs are able to activate signaling pathways, we can look 
more closely at several important pathways that are activated down- 
stream of RTKs. First we will discuss the Ras-MAP kinase pathway, 
which is probably the best characterized signaling cascade that is 
turned on by activated protein-tyrosine kinases. A different cascade 
will be described in the context of the insulin receptor. 


REVIEW | 


1. What is an SH2 domain, and what role does it play 


in signaling pathways? 
° 


15.11 The Ras-MAP Kinase 
Pathway 


Retroviruses are small viruses that carry their genetic information in 
the form of RNA. Some of these viruses contain genes, called onco- 
genes, that enable them to transform normal cells into tumor cells. 
Ras was originally described as the product of a retroviral oncogene 
and, only later, determined to be derived from its mammalian host. 
It was subsequently discovered that approximately 30 percent of all 
human cancers contain mutant versions of RAS genes. At this point 
it is important to note that Ras proteins are part of a superfamily of 
more than 150 small (monomeric) G proteins including the Rabs 
(page 288), Sarl (page 283), and Ran (page 464). These proteins are 
involved in the regulation of numerous processes, including cell 
division, differentiation, gene expression, cytoskeletal organization, 
vesicle trafficking, and nucleocytoplasmic transport. The principles 
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small G-protein superfamily. 

Ras is a small GTPase that is anchored at the inner surface of the 
plasma membrane by a covalently attached lipid group that is embed- 
ded in the inner leaflet of the bilayer (Figure 15.19a). Ras is function- 
ally similar to the heterotrimeric G proteins that were discussed earlier 
and, like those proteins, Ras also acts as both a switch and a molecular 
timer. Unlike heterotrimeric G proteins, however, Ras consists of only 
a single small subunit. Ras proteins are present in two different forms: 
an active GTP-bound form and an inactive GDP-bound form 
(FIGURE 15.21a). Ras-GTP binds and activates downstream signaling 
proteins. Ras is turned off by hydrolysis of its bound GTP to GDP. 
Mutations in one of the human RAS genes that lead to tumor forma- 
tion prevent the protein from hydrolyzing the bound GTP back to the 
GDP form. As a result, the mutant version of Ras remains in the “on” 
position, sending a continuous message downstream along the signal- 
ing pathway, keeping the cell in the proliferative mode. 


Accessory Proteins 


The cycling of monomeric G proteins, such as Ras, between active 
and inactive states is aided by accessory proteins that bind to the G 
protein and regulate its activity (Figure 15.21b). These accessory 
proteins include: 


1. GTPase-activating proteins (GAPs). Most monomeric G pro- 
teins possess some capability to hydrolyze a bound GTP, but this 
capability is greatly accelerated by interaction with specific GAPs. 


Switch Il 


(a) 


Because they stimulate hydrolysis of the bound GTP, which inac- 
tivates the G protein, GAPs dramatically shorten the duration of 
a G protein-mediated response. Mutations in one of the Ras- 
GAP genes (NF1) cause neurofibromatosis 1, a disease in which 
patients develop large numbers of benign tumors (neurofibro- 
mas) along the sheaths that line the nerve trunks. 


2. Guanine nucleotide-exchange factors (GEFs). An inactive 
G protein is converted to the active form when the bound GDP is 
replaced with a GTP. GEFs are proteins that bind to an inactive 
monomeric G protein and stimulate dissociation of the bound 
GDP. Once the GDP is released, the G protein rapidly binds a 
GTP, which is present at relatively high concentration in the cell, 
thereby activating the G protein. 


3. Guanine nucleotide-dissociation inhibitors (GDIs). GDIs are 
proteins that inhibit the release of a bound GDP from a mono- 
meric G protein, thus maintaining the protein in the inactive, 
GDP-bound state. The activity and localization of these various 
accessory proteins are tightly regulated by other proteins, which 
thus regulate the state of the G protein. 


Ras-GTP can be thought of as a signaling hub because it can 
interact directly with several downstream targets. Here we will dis- 
cuss Ras as an element of the Ras-MAP kinase cascade. The Ras- 
MAP kinase cascade is turned on in response to a wide variety of 
extracellular signals and plays a key role in regulating vital activities 
such as cell proliferation and differentiation. The pathway relays 
extracellular signals from the plasma membrane through the 
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FIGURE 15.21 The structure of a G protein and the G protein cycle. (a) Comparison of the tertiary structure of the active GTP-bound state (red) 

and inactive GDP-bound state (green) of the small G protein Ras. A bound guanine nucleotide is depicted in the ball-and-stick form. The differences in 
conformation occur in two flexible regions of the molecule known as switch | and switch Il. The difference in conformation shown here affects the molecule’s 
ability to bind to other proteins. (b) The G protein cycle. G proteins are in their inactive state when they are bound by a molecule of GDP. If the inactive G 
protein interacts with a guanine nucleotide dissociation inhibitor (GDI), release of the GDP is inhibited and the protein remains in the inactive state (step 1a). 
If the inactive G protein interacts with a guanine nucleotide exchange factor (GEF; step 1b), the G protein exchanges its GDP for a GTP (step 2), which 
activates the G protein so that it can bind to a downstream target protein (step 3). Binding to the GTP-bound G protein activates the target protein, which 
is typically an enzyme such as a protein kinase or a protein phosphatase. This has the effect of transmitting the signal farther downstream along the 
signaling pathway. G proteins have a weak intrinsic GTPase activity that is stimulated by interaction with a GTPase-activating protein (GAP) (step 4). 

The degree of GTPase stimulation by a GAP determines the length of time that the G protein is active. Consequently, the GAP serves as a type of clock 
that regulates the duration of the response (step 5). Once the GTP has been hydrolyzed, the complex dissociates, and the inactive G protein is ready to 


begin a new cycle (step 6). 


Source: (a) From Steven J. Gamblin and Stephen J. Smerdon, Struct. 7:R200, 1999. Reprinted with permission from Elsevier. 


cytoplasm and into the nucleus. The overall outline of the pathway is 
depicted in FIGURE 15.22. This pathway is activated when a growth 
factor, such as EGF or PDGF, binds to the extracellular domain of its 
RTK. Many activated RTKs possess phosphorylated tyrosine resi- 
dues that act as docking sites for the adaptor protein Grb2. Grb2, in 
turn, binds to Sos, which is a guanine nucleotide exchange factor 
(a GEF) for Ras. Creation of a Grb2-binding site on an activated 
receptor promotes the translocation of Grb2-Sos from the cytoplasm 
to the cytoplasmic surface of the plasma membrane, placing Sos in 
close proximity to Ras (as in Figure 15.194). 

Simply bringing Sos to the plasma membrane is sufficient to 
cause Ras activation. This was illustrated by an experiment with 
a mutant version of Sos that is permanently tethered to the inner 
surface of the plasma membrane. Expression of this membrane- 
bound Sos mutant results in constitutive activation of Ras and trans- 
formation of the cell to a malignant phenotype. Interaction with Sos 
opens the Ras nucleotide-binding site. As a result, GDP is released 
and is replaced by GTP. Exchange of GDP for GTP in the nucleotide- 
binding site of Ras results in a conformational change and the crea- 
tion of a binding interface for a number of proteins, including an 
important signaling protein called Raf. Raf is then recruited to the 
inner surface of the plasma membrane where it is activated by a 
combination of phosphorylation and dephosphorylation reactions. 

Raf is a serine-threonine protein kinase. One of its substrates is 
the protein kinase MEK (Figure 15.22). MEK, which is activated as a 
consequence of phosphorylation by Raf, goes on to phosphorylate 
and activate two MAP kinases named ERK1 and ERK2. Over 160 
proteins that can be phosphorylated by these kinases have been 
identified, including transcription factors, protein kinases, cytoskel- 
etal proteins, apoptotic regulators, receptors, and other signaling 
proteins. Once activated, the MAP kinase is able to move into the 


FIGURE 15.22 The steps of a generalized MAP kinase cascade. Binding of 
growth factor to its receptor (step 1) leads to the autophosphorylation of tyrosine 
residues of the receptor (step 2) and the subsequent recruitment of the Grb2-Sos 
proteins (step 3). This complex causes the GTP-GDP exchange of Ras (step 4), 
which recruits the protein Raf to the membrane, where it is phosphorylated and 
thus activated (step 5). In the pathway depicted here, Raf phosphorylates and 
activates another kinase named MEK (step 6), which in turn phosphorylates and 
activates still another kinase termed ERK (step 7). This three-step phosphoryla- 


tion scheme shown in steps 5-7 is characteristic of all MAP kinase cascades. 
Because of their sequential kinase activity, Raf is known as a MAPKKK (MAP 


kinase kinase kinase), MEK as a MAPKK (MAP kinase kinase), and ERK as a MAPK 
(MAP kinase). MAPKKs are dual-specificity kinases, a term denoting that they can 
phosphorylate tyrosine as well as serine and threonine residues. All MAPKs have 


a tripeptide near their catalytic site with the sequence Thr-X-Tyr. MAPKK 
phosphorylates MAPK on both the threonine and tyrosine residue of this 
sequence, thereby activating the enzyme (step 7). Once activated, MAPK 


translocates into the nucleus where it phosphorylates transcription factors (TF, 
step 8), such as Elk-1. Phosphorylation of the transcription factors increases their 
affinity for regulatory sites on the DNA (step 9), leading to an increase in the 


transcription of specific genes (e.g., Fos and Jun) involved in the growth 


response. One of the genes whose expression is stimulated encodes a MAPK 


phosphatase (MKP-1; step 10). Members of the MKP family can remove 


phosphate groups from both tyrosine and threonine residues of MAPK (step 11), 
which inactivates MAPK and stops further signaling activity along the pathway. 
Source: H. Sun and N. K. Tonks, Trends Biochem Science 19:484, 1994. Trends in 
Biochemical Sciences by International Union of Biochemistry reproduced with 
permission of Elsevier Ltd. in the format reuse in a book/textbook via Copyright 


Clearance Center. 


nucleus where it phosphorylates and activates a number of tran- 609 
scription factors and other nuclear proteins. Eventually, the pathway 
leads to the activation of genes involved in cell proliferation, includ- 
ing cyclin D1, which plays a key role in driving a cell from G1 into S 
phase (Figure 14.8). 

As discussed in the following chapter, oncogenes are identified 
by their ability to cause cells to become cancerous. Oncogenes are 
derived from normal cellular genes that have either become mutated. 
or are overexpressed. Many of the proteins that play a part in the Ras 
signaling pathway were discovered because they were encoded by 
cancer-causing oncogenes. This includes the genes for Ras, Raf, and 
a number of the transcriptional factors generated at the end of the 
pathway (e.g., Fos and Jun). Genes for several of the RTKs situated at 
the beginning of the pathway, including the receptors for both EGF 
and PDGF, have also been identified among the many known onco- 
genes. The fact that so many proteins in this pathway are encoded by 
genes that can cause cancer when mutated emphasizes the impor- 
tance of the pathway in the control of cell growth and proliferation. 
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Different Types of Information 


The same basic pathway from RTKs through Ras to the activation of 
transcription factors, as illustrated in Figure 15.22, is found in all 
eukaryotes investigated, from yeast through flies and nematodes to 
mammals. Evolution has adapted the pathway to meet many differ- 
ent ends. In yeast, for example, the MAP kinase cascade is required 
for cells to respond to mating factors; in fruit flies, the pathway is 
utilized during the differentiation of the photoreceptors in the com- 
pound eye; and in flowering plants, the pathway transmits signals 
that initiate a defense against pathogens. In each case, the core of the 
pathway contains a trio of enzymes that act sequentially: a MAP 
kinase kinase kinase (MAPKKK), a MAP kinase kinase (MAPKK), 
and a MAP kinase (MAPK) (Figure 15.22). Each of these compo- 
nents is represented in a particular organism by a small family of 
proteins. To date, 14 different MAPKKKs, seven different MAPKKs, 
and 13 different MAPKs have been identified in mammals. By utiliz- 
ing different members of these protein families, mammals are able to 
assemble a number of different MAP kinase pathways that transmit 
different types of extracellular signals. We have already described 
how mitogenic stimuli are transmitted along one type of MAP kinase 
pathway that leads to cell proliferation. In contrast, when cells are 
exposed to stressful stimuli, such as X-rays or damaging chemicals, 
signals are transmitted along different MAP kinase pathways that 
cause the cell to withdraw from the cell cycle, rather than progress- 
ing through it as indicated in Figure 15.22. Withdrawal from the cell 
cycle gives the cell time to repair the damage resulting from the 
adverse conditions. 

Recent studies have focused on the signaling specificity of MAP 
kinase cascades in an attempt to understand how cells are able to 
utilize similar proteins as components of pathways that elicit differ- 
ent cellular responses. Studies of amino acid sequences and protein 
structures suggest that part of the answer lies in selective interac- 
tions between enzymes and substrates. For example, certain mem- 
bers of the MAPKKK family phosphorylate specific members of the 
MAPKK family, which in turn phosphorylate specific members of 
the MAPK family. But many members of these families can partici- 
pate in more than one MAPK signaling pathway. 

Specificity in MAP kinase pathways is also achieved by spatial 
localization of the component proteins. Spatial localization is accom- 
plished by scaffolding proteins, whose apparent function is to tether 
the appropriate members of a signaling pathway in a specific spatial 
orientation that enhances their mutual interactions. The AKAPs 
depicted in Figure 15.16 are examples of scaffolding proteins 
involved in cAMP-driven pathways. Another group of scaffolding 
proteins, such as the yeast proteins shown in FIGURE 15.23a,b, play a 
role in routing signals through one of various MAP kinase pathways. 
In some cases, scaffolding proteins can take an active role in signal- 
ing events. For example, they can induce a change in conformation 
of bound signaling proteins, leading to their activation or inhibition. 
A few scaffolding proteins are known to have an enzymatic role, as 
illustrated by the MAPKK activity of the yeast Pbs2 scaffold shown 
in Figure 15.230. In addition to facilitating a particular series of reac- 
tions, scaffolding proteins may prevent proteins involved in one 
signaling pathway from participating in other pathways. As a result, 
several pathways can share the same limited set of signaling proteins 
without compromising specificity. This is the case for the yeast 
MAPKKK protein Stell shown in Figure 15.23a,b, which partici- 
pates in both the mating and osmoregulatory response depending 
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FIGURE 15.23 The roles of scaffolding proteins in mediating two 
yeast MAPK pathways. (a) The MAPK pathway that regulates mating in 
these cells is elicited by a mating factor that binds to a GPCR, Ste2, leading 
to the activation of a Gp, which binds to the scaffolding protein Ste5, which 
in turn binds the MAPKKK, MAPKK, and MAPK proteins of the pathway. 

(b) The MAPK pathway that regulates the yeast osmoregulatory response in 
cells exposed to high salt. The activated receptor (Sho1) binds to the Pbs2 
scaffolding protein by its SH3 domain. The MAPKKK Ste11 is shared in 
these two pathways but is recruited into one or the other response by 
virtue of its interaction with the appropriate protein scaffold. The scaffold 
Pbs2 does not recruit a separate MAPKK, but has its own MAPKK enzymatic 
activity. (c) When cells are genetically engineered to express a chimeric 
Ste5-Pbs2 scaffold, they respond to a mating factor by exhibiting the 
osmoregulatory response. (See Science 332:680, 2011, for a discussion of 
scaffold proteins and this experiment.) 


upon which scaffolding protein it has interacted with. Recent 
research has shown that scaffold proteins can associate with multiple 
distinct clusters of proteins over time, allowing for a receptor to trig- 
ger different downstream events at specific timepoints after its acti- 
vation. The importance of scaffolding proteins is well illustrated by 
an experiment in which parts of two different yeast MAPK-cascade 
scaffolding proteins (Ste5 and Pbs2) were genetically combined to 
form a chimeric protein (Ste5-Pbs2) (Figure 15.23c). Normally these 
two scaffolds mediate two different MAPK signaling pathways 
(Figure 15.23a,b). When yeast cells containing the chimeric protein 
were exposed to a mating factor that normally stimulates the mating 
response, the cells responded by displaying the osmoregulatory 
response. 
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1. What is the role of Ras in signaling pathways? How is 
this affected by the activity of a Ras-GAP? How does 
Ras differ from a heterotrimeric G protein? 

2. How does the MAP kinase cascade alter the 
transcriptional activity of a cell? 


o—___! 


15.12 Signaling by the 
Insulin Receptor 


Our bodies spend considerable effort maintaining blood glucose lev- 
els within a narrow range. A decrease in blood glucose levels can lead 
to loss of consciousness and coma, as the central nervous system 
depends largely on glucose for its energy metabolism. A persistent 
elevation in blood glucose levels results in a loss of glucose, fluids, 
and electrolytes in the urine and serious health problems. The levels 
of glucose in the circulation are monitored by the pancreas. When 
blood glucose levels fall below a certain level, the alpha cells of the 
pancreas secrete glucagon. As discussed earlier, glucagon acts through 
GPCRs and stimulates the breakdown of glycogen resulting in an 
increase in blood glucose levels. When glucose levels rise, as occurs 
after a carbohydrate-rich meal, the beta cells of the pancreas respond 
by secreting insulin. Insulin functions as an extracellular messenger 
molecule, informing cells that glucose levels are high. Cells that 
express insulin receptors on their surface, such as cells in the liver, 
respond to this message by increasing glucose uptake, increasing gly- 
cogen and triglyceride synthesis, and/or decreasing gluconeogenesis. 


The Insulin Receptor Is a 
Protein-Tyrosine Kinase 


Each insulin receptor is composed of an a and a 8 chain, which are 
derived from a single precursor protein by proteolytic processing. The 
o chain is entirely extracellular and contains the insulin-binding site. 
The f chain is composed of an extracellular region, a single transmem- 
brane region, and a cytoplasmic region (FIGURE 15.24). The a and B 
chains are linked together by disulfide bonds. Two of these af heter- 
odimers are held together by disulfide bonds between the a chains. 
Thus, while most RTKs are thought to be present on the cell surface as 
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monomers, insulin receptors are present as stable dimers. Like other 
RTKs, insulin receptors are inactive in the absence of ligand (Figure 
15.24a). Recent work suggests that the insulin receptor dimer binds 
tightly to a single insulin molecule. This causes repositioning of the 
ligand-binding domains on the outside of the cell, which causes the 
tyrosine kinase domains on the inside of the cell to come into close 
physical proximity. Juxtaposition of the kinase domains leads to trans- 
autophosphorylation and receptor activation (Figure 15.245). 

Several tyrosine phosphorylation sites have been identified in 
the cytoplasmic region of the insulin receptor. Three of these phos- 
phorylation sites are present in the activation loop. In the unphos- 
phorylated state, the activation loop assumes a conformation in 
which it occupies the active site. Upon phosphorylation of the three 
tyrosine residues, the activation loop assumes a new conformation 
away from the catalytic cleft. This new conformation requires a rota- 
tion of the small and large lobes of the kinase domain with respect to 
each other, thereby bringing residues that are essential for catalysis 
closer together. In addition, the activation loop now leaves the cata- 
lytic cleft open so that it can bind substrates. Following activation of 
the kinase domain, the receptor phosphorylates itself on tyrosine 
residues that are present adjacent to the membrane and in the 
carboxyl-terminal tail (Figure 15.240). 


Insulin Receptor Substrates 1 and 2 


Most RTKs possess autophosphorylation sites that directly recruit 
SH2 domain-containing signaling proteins (as in Figure 15.19a, c, 
and d). The insulin receptor is an exception to this general rule, 
because it associates instead with a small family of adaptor proteins 
(Figure 15.19), called insulin-receptor substrates (IRSs). The 
IRSs, in turn, provide the binding sites for SH2 domain-containing 
signaling proteins. Some of the events that occur during insulin 
signaling are shown in Figure 15.24c. Following ligand binding and 
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FIGURE 15.24 The response of the insulin receptor to ligand binding. (a) The insulin receptor, shown here in schematic form in the inactive state, is 
a tetramer consisting of two a and two B subunits. (b) The binding of a single insulin molecule to the @ subunits causes a conformational change in the 
B subunits, which activates the tyrosine kinase activity of the B subunits. The activated § subunits phosphorylate tyrosine residues located on the cytoplasmic 


domain of the receptor as well as tyrosine residues on several insulin receptor substrates (IRSs). (c) Once bound to the insulin receptor, a number of tyrosine 


residues in the IRS may be phosphorylated and subsequently serve as binding sites for other proteins, including an adaptor protein Grb2, a lipid kinase PI3K, 
or other signaling proteins. Grb2 activation leads to activation of the Ras pathway and gene transcription, while PI3K activation leads to PIP3 production. 
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960, which then forms a binding site for the phosphotyrosine bind- 
ing (PTB) domains of insulin receptor substrates. As indicated in 
Figure 15.24c, IRSs are characterized by the presence of an N-terminal 
PH domain, a PTB domain, and a long tail containing tyrosine phos- 
phorylation sites. The PH domain may interact with phospholipids 
present at the inside leaflet of the plasma membrane, the PTB 
domain binds to tyrosine phosphorylation sites on the activated 
receptor, and the tyrosine phosphorylation sites provide binding 
sites for SH2 domain-containing signaling proteins. At least four 
members of the IRS family have been identified. Based on the results 
obtained in knockout experiments in mice, it is thought that IRS-1 
and IRS-2 are most relevant to insulin-receptor signaling. 

Autophosphorylation of the activated insulin receptor at Tyr960 
provides a binding site for IRS-1 or IRS-2. Only after stable associa- 
tion with either IRS-1 or IRS-2 is the activated insulin receptor able 
to phosphorylate tyrosine residues present on these adaptor proteins 
(Figure 15.24c). Both IRS-1 and IRS-2 contain a large number of 
potential tyrosine phosphorylation sites that include binding sites 
for the SH2 domains of downstream signalling proteins, including 
PI 3-kinase, Grb2, and Shp2. 

PI 3-kinase (PI3K ) is composed of two subunits, one containing 
two SH2 domains and the other containing the catalytic domain 
(FIGURE 15.25). PI3K, which is activated directly as a consequence of 
binding of its two SH2 domains to tyrosine phosphorylation sites, 
phosphorylates phosphoinositides at the 3 position of the inositol 
ring. The products of this enzyme, which include PI 3,4-bisphosphate 
(PI(3,4)P,) and PI 3,4,5-trisphosphate (PIP;), remain in the cyto- 
solic leaflet of the plasma membrane where they provide binding 
sites for PH domain-containing signaling proteins such as the 


Plasma membrane 


PH domain 


Glycogen synthesis 


FIGURE 15.25 The role of PI 3-kinase in activating a variety of 
signaling pathways. Activation of PI3K leads to the formation of 
membrane-bound phosphoinositides, including PIP3. One of the key 
kinases in numerous signaling pathways is PKB (AKT), which interacts at the 
plasma membrane with PIP; by means of a PH domain. This interaction 
changes the conformation of PKB, making it a substrate for another PIP3- 
bound kinase (PDK1), which phosphorylates PKB. The second phosphate 
shown linked to PKB is added by a second kinase, mostly likely mTOR. 
Once activated, PKB dissociates from the plasma membrane and moves 
into the cytosol and nucleus. PKB is a major component of a number of 
separate signaling pathways that mediate the insulin response. These 
pathways lead to translocation of glucose transporters to the plasma 
membrane, synthesis of glycogen, and the synthesis of new proteins in the 
cell. PKB also plays a key role in promoting cell survival by inhibiting the 
proapoptotic protein Bad (page 623) and/or activating the transcription 
factor NF-kB (page 625). 


serine-threonine kinases PKB and PDK1. As indicated in Figure 
15.25, PKB (more commonly known as AKT) plays a role in mediat- 
ing the response to insulin, as well as to other extracellular signals. 
Recruitment of PDK1 to the plasma membrane, in close proximity 
to PKB, provides a setting in which PDK1 can phosphorylate and 
activate the Ser/Thr kinase activity of PKB (Figure 15.25). While 
phosphorylation by PDK1 is essential, it is not sufficient for activa- 
tion of PKB. Activation of PKB also depends on phosphorylation by 
a second kinase, mTOR, which has a crucial role in regulating 
numerous cellular activities. PI3K signaling is terminated by removal 
of the phosphate at the 3-position on the inositol ring by the lipid 
phosphatase PTEN (Figure 15.25). 


Glucose Transport 


PKB is directly involved in regulating glucose transport and glycogen 
synthesis. The glucose transporter GLUT4 carries out insulin- 
dependent glucose transport from the blood (page 152). In the 
absence of insulin, GLUT4 is trapped in membrane vesicles of insulin 
responsive cells (FIGURE 15.26). Recent findings suggest that these 
vesicles are tethered to the cis-Golgi network by a protein called TUG. 
TUG is cleaved in response to insulin, causing the GLUT4 vesicles to 
fuse with the plasma membrane, a process that is referred to as GLUT4 
translocation. The increase in numbers of glucose transporters in the 
plasma membrane leads to increased glucose uptake (Figure 15.26). 
Excess glucose that is taken up by muscle and liver cells is stored 
in the form of glycogen. Glycogen synthesis is carried out by glycogen 
synthase, an enzyme that is turned off by phosphorylation on serine 
and threonine residues. Glycogen synthase kinase-3 (GSK-3) has 
been identified as a negative regulator of glycogen synthase. GSK-3, 
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FIGURE 15.26 Regulation of glucose uptake in muscle and fat cells by 
insulin. Glucose transporters are stored in the walls of cytoplasmic vesicles 
that form by budding from the plasma membrane (endocytosis). When the 
insulin level increases, a signal is transmitted through the IRS-PI3K-PKB 
pathway, which triggers the translocation of cytoplasmic vesicles to the cell 
periphery. The vesicles fuse with the plasma membrane (exocytosis), 
delivering the transporters to the cell surface where they can mediate 
glucose uptake. A second pathway leading from the insulin receptor to 
GLUT4 translocation is not shown (see Trends Biochem. Sci. 31:215, 2006). 


Source: D. Voet and J. G. Voet, Biochemistry, 2e; copyright 1995, John 
Wiley & Sons, Inc. Reprinted by permission of John Wiley & Sons, Inc. 


in turn, is inactivated following phosphorylation by PKB. Thus, 
activation of the PI 3-kinase-PKB pathway in response to insulin leads 
to a decrease in GSK-3 kinase activity, resulting in an increase in glyco- 
gen synthase activity (Figure 15.25c). Activation of protein phos- 
phatase 1, an enzyme known to dephosphorylate glycogen synthase, 
contributes further to glycogen synthase activation (Figure 15.14). 


Diabetes Mellitus 


One of the most common human diseases, diabetes mellitus, is 
caused by defects in insulin signaling. Diabetes occurs in two varie- 
ties: type 1, which accounts for 5-10 percent of the cases, and type 2, 
which accounts for the remaining 90-95 percent. Type 1 diabetes is 
caused by an inability to produce insulin and is discussed in the 
Human Perspective in 17.3. Type 2 diabetes is a more complex dis- 
ease whose incidence is increasing around the world at an alarming 
rate. The rising incidence of the disease is most likely a result of 
changing lifestyle and eating habits. A high-calorie diet combined 
with a sedentary lifestyle is thought to lead to a chronic increase in 
insulin secretion. Elevated levels of insulin overstimulate target cells 
in the liver and elsewhere in the body, which leads to a condition 
referred to as insulin resistance, in which these target cells stop 
responding to the presence of the hormone. This is turn leads to a 
chronic elevation in blood glucose levels, which stimulates the 
pancreas to secrete even more insulin, setting up a vicious cycle that 
can ultimately lead to the death of the insulin-secreting beta cells of 
the pancreas. Most of the health risks that result from diabetes— 
cardiovascular disease, blindness, kidney disease, and reduced cir- 
culation in the limbs leading to amputations—are thought to be due 
to damage to the body’s blood vessels, but the molecular mechanism 
by which insulin resistance and its consequent metabolic effects lead 
to this condition remain the subject of debate. 


REVIEW | 


1. Describe the steps between the binding of an insulin 
molecule at the surface of a target cell and the 
activation of the effector PI3K. How does the action 
of insulin differ from other ligands that act by means 
of receptor tyrosine kinases? 


2. What is the relationship between type 2 diabetes and 
insulin production? How is it that a drug that increases 


insulin sensitivity might help treat this disease? 
oe 


15.13 Signaling Pathways in Plants 


Plants and animals share certain basic signaling mechanisms, includ- 
ing the use of Ca** and phosphoinositide messengers, but other path- 
ways are unique to each major kingdom. For example, cyclic 
nucleotides, which may be the most ubiquitous animal cell messen- 
gers, appear to play little, if any, role in plant cell signaling. Receptor 
tyrosine kinases are also lacking in plant cells. On the other hand, 
plants contain a type of protein kinase that is absent from animal cells. 

It has long been known that bacterial cells have a protein kinase 
that phosphorylates histidine residues and mediates the cell’s 
response to a variety of environmental signals. Until 1993, these 
enzymes were thought to be restricted to bacterial cells but were 
then discovered in both yeast and flowering plants. In both types of 


eukaryotes, the enzymes are transmembrane proteins with an extra- 613 


cellular domain that acts as a receptor for external stimuli and a 
cytoplasmic, histidine kinase domain that transmits the signal to the 
cytoplasm. One of the best studied of these plant proteins is encoded 
by the Etr1 gene. The product of the Etrl gene encodes a receptor for 
the gas ethylene (C,H,), a plant hormone that regulates a diverse 
array of developmental processes, including seed germination, flow- 
ering, and fruit ripening. Binding of ethylene to its receptor leads to 
transmission of signals along a pathway that is very similar to the 
MAP kinase cascade found in yeast and animal cells. As in other 
eukaryotes, the downstream targets of the MAP kinase pathway in 
plants are transcription factors that activate expression of specific 
genes encoding proteins required for the hormone response. As 
researchers analyze the massive amount of data obtained from 
sequencing Arabidopsis and other plant genomes, the similarities 
and differences between plant and animal signaling pathways should 
become more apparent. 


REVIEW | 


1. What types of signalling proteins are common in 
animals but absent from animal cells, and vice versa? 


15.14 The Role of Calcium 
as an Intracellular Messenger 


Calcium ions play a significant role in a remarkable variety of cellu- 
lar activities, including muscle contraction, immune responses, cell 
division, secretion, fertilization, synaptic transmission, metabolism, 
transcription, cell movement, and cell death. In each of these cases, 
an extracellular message is received at the cell surface and leads to a 
dramatic increase in concentration of calcium ions within the cyto- 
sol. The concentration of calcium ions in a particular cellular com- 
partment is controlled by the regulated activity of Ca** pumps, Ca”* 
exchangers, and/or Ca”* ion channels located within the membranes 
that surround the compartment (as in Figure 15.28). The concentra- 
tion of Ca* ions in the cytosol of a resting cell is maintained at very 
low levels, typically about 10~’ M. In contrast, the concentration of 
this ion in the extracellular space or within the lumen of the ER or a 
plant cell vacuole is typically 10,000 times higher than the cytosol. 
The cytosolic calcium level is kept very low because (1) Ca”* ion 
channels in both the plasma and ER membranes are normally kept 
closed, making these membranes highly impermeable to this ion, 
and (2) energy-driven Ca”* transport systems of the plasma and ER 
membranes pump calcium out of the cytosol.” Abnormal elevation 
of cytosolic Ca”* concentration, as can occur in brain cells following 
a stroke, can lead to massive cell death. 


IP; and Voltage-Gated Ca?* Channels 


We have described in previous pages two major types of signaling 
receptors, GPCRs and RTKs. It was noted on page 597 that interac- 
tion of an extracellular messenger molecule with a GPCR can lead 
to the activation of the enzyme phospholipase C-f, which splits the 


Mitochondria also play an important role in sequestering and releasing Ca** ions, 
but their role was considered in Chapter 5 and will not be discussed here. 
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614 phosphoinositide PIP, to release the molecule IP3, which opens 
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calcium channels in the ER membrane, leading to a rise in cytosolic 
[Ca**]. Extracellular messengers that signal through RTKs can trig- 
ger a similar response. The primary difference is that RTKs activate 
members of the phospholipase C-y subfamily, which possess an 
SH2 domain that allows them to bind to the activated, phosphoryl- 
ated RTK. There are numerous other PLC isoforms. For example, 
PLCS is actived by Ca”* ions, and PLC is activated by Ras-GTP. All 
PLC isoforms carry out the same reaction, producing IP; and link- 
ing a multitude of cell surface receptors to an increase in cytoplas- 
mic Ca”*. There is another major route leading to elevation of 
cytosolic [Ca**], which was encountered in our discussion of syn- 
aptic transmission in Section 4.18. In this case, a nerve impulse 
leads to a depolarization of the plasma membrane, which triggers 
the opening of voltage-gated calcium channels in the plasma mem- 
brane, allowing the influx of Ca** ions from the extracellular 
medium. 


Visualizing Cytoplasmic Ca?* 
Concentration in Living Cells 


Our understanding of the role of Ca”* ions in cellular responses has 
been greatly advanced by the development of indicator molecules 
that emit light in the presence of free calcium. In the mid-1980s, new 
types of highly sensitive, fluorescent, calcium-binding compounds 
(e.g., fura-2) were developed in the laboratory of Roger Tsien at the 
University of California, San Diego. These compounds are synthe- 
sized in a form that can enter a cell by diffusing across its plasma 
membrane. Once inside a cell, the compound is modified to a form 
that is unable to leave the cell. Using these probes, the concentration 
of free calcium ions in different parts of a living cell can be deter- 
mined over time by monitoring the light emitted using a fluores- 
cence microscope and computerized imaging techniques. Use of 
calcium-sensitive, light-emitting molecules has provided dramatic 
portraits of the complex spatial and temporal changes in free cyto- 
solic calcium concentration that occur in a single cell in response to 
various types of stimuli. This is one of the advantages of studying 
calcium-mediated responses compared to responses mediated by 
other types of messengers whose location in a cell cannot be readily 
visualized. 

Depending on the type of responding cell, a particular stimulus 
may induce repetitive oscillations in the concentration of free cal- 
cium ions, as seen in Figure 15.11; cause a wave of Ca’* release that 
spreads from one end of the cell to the other (see Figure 15.29); or 
trigger a localized and transient release of Ca** in one part of the cell. 
FIGURE 15.27 shows a Purkinje cell, a type of neuron in the mam- 
malian cerebellum that maintains synaptic contact with thousands 
of other cells through an elaborate network of postsynaptic den- 
drites. The micrograph in Figure 15.27 shows the release of free cal- 
cium in a localized region of the “dendritic tree” of the cell following 
synaptic activation. The burst of calcium release remains restricted 
to this region of the cell. 

IP; receptors described earlier are one of two main types of Ca** 
ion channels present in the ER membrane; the other type are called 
ryanodine receptors (RyRs) because they bind the toxic plant alkaloid 
ryanodine. Ryanodine receptors are found primarily in excitable 
cells and are best studied in cardiac and skeletal muscle cells, where 
they mediate the rise in Ca** levels following the arrival of an action 
potential. Mutations in the cardiac RyR isoform have been linked to 


ee 


FIGURE 15.27 Experimental demonstration of localized release of 
intracellular Ca?* within a single dendrite of a neuron. The mechanism of 
IP3-mediated release of Ca** from intracellular stores was described in 
Figure 15.10. In the micrograph shown here, which pictures an enormously 
complex Purkinje cell (neuron) of the cerebellum, calcium ions have been 
released locally within a small portion of the complex “dendritic tree.” 
Calcium release from the ER (shown in red) was induced in the dendrite 
following the local production of IP3, which followed repetitive activation of 
a nearby synapse. The sites of release of cytosolic Ca2* ions are revealed by 
fluorescence from a fluorescent calcium indicator that was loaded into the 
cell prior to stimulation of the cell. 

Source: From Elizabeth A. Finch and George J. Augustine, Nature, 

vol. 396, cover of 12/24/98. Reprinted by permission from Macmillan 
Publishers Limited. 


occurrences of sudden death during periods of exercise. Depending 
on the type of cell in which they are found, RyRs can be opened by a 
variety of agents, including calcium itself. The influx of a limited 
amount of calcium through open channels in the plasma membrane 
induces the opening of ryanodine receptors in the ER, causing the 
release of Ca”* into the cytosol (FIGURE 15.28). This phenomenon is 
called calcium-induced calcium release (CICR). 

Extracellular signals that are transmitted by Ca”* ions typically 
act by opening a small number of Ca** ion channels at the cell sur- 
face at the site of the stimulus. As Ca™* ions rush through these chan- 
nels and enter the cytosol, they act on nearby Ca** ion channels in 
the ER, causing these channels to open and release additional cal- 
cium into adjacent regions of the cytosol. In some responses, the 
elevation of Ca** levels remains localized to a small region of the 
cytosol (as in Figure 15.27). In other cases, a propagated wave of cal- 
cium release spreads through the entire cytoplasmic compartment. 


High [Ca2*] with the plasma membrane of the egg (FIGURE 15.29). The sudden 615 


‘ a ‘ ° (O rise in cytoplasmic calcium concentration following fertilization 

©» * ° oo triggers a number of events, including activation of cyclin-dependent 

eCat O° ° : 3Nat ICa2+ kinases (Section 14.4) that drive the zygote toward its first mitotic 

— © 0 ~ . division. Calcium waves are transient because the ions are rapidly 
Voltage-gated © o : . Nat /Ca22+ Pumped out of the cytosol and back into the ER and/or the extracel- 


° exchanger lular space. 
Recent research in the field of calcium signaling has focused 


on a phenomenon known as store-operated calcium entry (or 
SOCE), in which the “store” refers to the calcium ions stored in the 
ER. During periods of repeated cellular responses, the stockpile of 
intracellular calcium ions can become depleted. During SOCE the 
depletion of calcium levels in the ER triggers a response leading to 
the opening of calcium channels in the plasma membrane as 
depicted in FIGURE 15.30. Once these channels have opened, Ca”* 
ions can enter the cytosol from where they can be pumped back 
into the ER, thereby replenishing the ER’s calcium stores. The 
mechanism responsible for SOCE had been an unsolved mystery 
for many years until it was discovered that these events are orches- 
trated by a signaling system operating between the ER and plasma 
membrane. In this system, the depletion of Ca”* in the ER leads to 
the clustering within the ER membrane of a Ca**-sensing protein 
called STIM1 into regions where the ER and plasma membranes 
come into close proximity (25—50nm) to one another. Following 
their rearrangement in the ER membrane, the STIM1 clusters act to 
recruit subunits of a plasma membrane protein called Orail into 
adjacent regions of the plasma membrane (Figure 15.30). Orail is a 


Ca2* channel 
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SER Ca2*Pump tetrameric Ca** ion channel that had been identified as being 
membrane of the SER involved in a particular type of inherited human immune defi- 
ciency that results from a lack of Ca** stores in T lymphocytes. 

FIGURE 15.28 Calcium-induced calcium release, as it occurs in a Contact between the cytosolic surfaces of the STIM1 and Orail 


cardiac muscle cell. A depolarization in membrane voltage causes the proteins in these ER-plasma membrane junctions leads to the 


opening of the Orail channels, the influx of Ca?* into microdo- 
mains of the cytosol near the STIM1 clusters, and the refilling of 
the cell’s ER stores. 


opening of voltage-gated calcium channels in the plasma membrane, 
allowing entry of a small amount of Ca”* into the cytosol (step 1). The 
calcium ions bind to ryanodine receptors in the SER membrane (step 2), 
leading to release of stored Ca** into the cytosol (step 3), which triggers 
the cell’s contraction. The calcium ions are subsequently removed from 
the cytosol by the action of Ca** pumps located in the membrane of the 
SER (step 4) and a Na*/Ca®* secondary transport system in the plasma 
membrane (step 5), which leads to relaxation. This cycle is repeated Ca?*-Binding Proteins 
after each heartbeat. 
Source: Michael Berride, Nature 361:317, 1993, Reprinted by permission Unlike cAMP, whose action is usually mediated by stimulation of a 
from Macmillan Publishers Ltd. protein kinase, calcium can affect a number of different types of 
cellular effectors, including protein kinases (Table 15.4). Depending 
One of the most dramatic Ca** waves occurs within the first minute _ on the cell type, calcium ions can activate or inhibit various enzyme 
or so following fertilization and is induced by the sperm’s contact and transport systems, change the ionic permeability of membranes, 


15 pum 


FIGURE 15.29 Calcium wave in a starfish egg induced by a fertilizing sperm. The unfertilized egg was injected with a calcium-sensitive fluorescent 
dye, fertilized, and photographed at 10-second intervals. The rise in Ca** concentration is seen to spread from the point of sperm entry (arrow) throughout 
the entire egg. The blue color indicates low free [Ca?*], whereas the red color indicates high free [Ca2*]. A similar Ca2* wave in mammalian eggs is triggered 
by the formation of IP3 by a phospholipase C that is brought into the egg by the fertilizing sperm. 


Source: Courtesy of Stephen A. Stricker. 
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FIGURE 15.30 A model for store-operated calcium entry. When the ER 
lumen contains abundant Ca’* ions, the STIM1 proteins of the ER 
membrane and the Orail proteins of the plasma membrane are situated 
diffusely in their respective membranes, and the Orai1 calcium channel is 
closed. If the ER stores are depleted, a signaling system operates between 
the two membranes, causing the two proteins to become clustered within 
their respective membranes in close proximity to one another. Apparent 
interaction between the two membrane proteins leads to opening of the 
Orai1 channel and the influx of Ca?* ions into the cytosol from where they 
can be pumped into the ER lumen. 


TABLE 15.4 Examples of Mammalian Proteins Activated by Ca?* 


Protein 


Protein function 


Troponin C 
Calmodulin 


Calretinin, retinin 
Calcineurin B 
Calpain 
PI-specific PLC 
a-Actinin 
Annexin 


Phospholipase A; 
Protein kinase C 
Gelsolin 

IP; receptor 
Ryanodine receptor 
Na*/Ca** exchanger 


Ca**-ATPase 
Ca** antiporters 
Caldesmon 
Villin 

Arrestin 
Calsequestrin 


Modulator of muscle contraction 

Ubiquitous modulator of protein kinases and other 
enzymes (MLCK, CaM kinase II, adenylyl cyclase I) 

Activator of guanylyl cylase 

Phosphatase 

Protease 

Generator of IP; and diacylglycerol 

Actin-bundling protein 

Implicated in endo- and exocytosis, inhibition 
of PLA, 

Producer of arachidonic acid 

Ubiquitous protein kinase 

Actin-severing protein 

Effector of intracellular Ca”* release 

Effector of intracellular Ca”* release 

Effector of the exchange of Ca** for Na* across the 
plasma membrane 

Pumps Ca** across membranes 

Exchanger of Ca** for monovalent ions 


Regulator of muscle contraction 
Actin organizer 

Terminator of photoreceptor response 
Ca** buffer 


Adapted from D. E. Clapham, Cell 80:260, 1995, by copyright permission of Cell 
Press. Cell by Cell Press. Reproduced with permission of Cell Press in the format 
reuse in a book/textbook via Copyright Clearance Center. 


induce membrane fusion, or alter cytoskeletal structure and function. 
Calcium does not bring about these responses by itself but acts 
in conjunction with a number of calcium-binding proteins 
(examples are discussed on pages 291 and 351). The best-studied 
calcium-binding protein is calmodulin, which participates in many 
signaling pathways. 

Calmodulin is found universally in plants, animals, and eukary- 
otic microorganisms, and it has virtually the same amino acid 
sequence from one end of the eukaryotic spectrum to the other. 
Each molecule of calmodulin (FIGURE 15.31) contains four binding 
sites for calcium. Calmodulin does not have sufficient affinity for 
Ca** to bind the ion in a nonstimulated cell. If, however, the Ca”* 
concentration rises in response to a stimulus, the ions bind to calmo- 
dulin, changing the conformation of the protein and increasing its 
affinity for a variety of effectors. Depending on the cell type, the 
calcium-calmodulin (Ca?*-CaM) complex may bind to a protein 
kinase, a cyclic nucleotide phosphodiesterase, ion channels, or even 
to the calcium-transport system of the plasma membrane. In the lat- 
ter instance, rising levels of calcium activate the system responsible 
for ridding the cell of excess quantities of the ion, thus constituting a 
self-regulatory mechanism for maintaining low intracellular calcium 
concentrations. The Ca”*-CaM complex can also stimulate gene 
transcription through activation of various protein kinases (CaMKs) 
that phosphorylate transcription factors. In the best-studied case, 
one of these protein kinases phosphorylates CREB on the same 
serine residue as PKA (Figure 15.14). 


FIGURE 15.31 Calmodulin. A ribbon diagram of calmodulin (CaM) with 
four bound calcium ions (white spheres). Binding of these Ca* ions changes 
the conformation of calmodulin, exposing a hydrophobic surface that 
promotes interaction of Ca**-CaM with a large number of target proteins. 


Source: Courtesy of Michael Carson, University of Alabama at Birmingham. 
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FIGURE 15.32 A simplified model of the role of Ca?* in guard cell closure. (a) Photograph of stomatal pores, each flanked by a pair of guard cells. The 
stomata are kept open as turgor pressure is kept high within the guard cells, causing them to bulge outward as seen here. (b) One of the factors controlling 
stomatal pore size is the hormone abscisic acid (ABA). When ABA levels rise, calcium ion channels in the plasma membrane are opened, allowing the influx 
of Ca?* (step 1), which triggers the release of Ca?* from internal stores (step 2). The subsequent elevation of intracellular [Ca?*] closes K* influx channels 
(step 3a) and opens K* and anion efflux channels (step 3b). These ion movements lead to a drop in internal solute concentration and the osmotic loss of 


water (step 4). (Phosphorylation by protein kinases also plays a role in these events.) 


Source: Dr. Jeremy Burgess/photo Researchers, Inc. 


Regulating Calcium Concentrations 
in Plant Cells 


Calcium ions (acting in conjunction with calmodulin) are important 
intracellular messengers in plant cells. The levels of cytosolic cal- 
cium change dramatically within certain plant cells in response to a 
variety of stimuli, including changes in light, pressure, gravity, and 
the concentration of plant hormones such as abscisic acid. The con- 
centration of Ca** in the cytosol of a resting plant cell is kept very 
low by the action of transport proteins situated in the plasma mem- 
brane and vacuolar membrane (tonoplast). 

The role of Ca”* in plant cell signaling is illustrated by guard 
cells that regulate the diameter of the microscopic pores (stomata) 
of a leaf (FIGURE 15.32a). Stomata are a major site of water loss in 
plants, and the diameter of their aperture is tightly controlled, 
which prevents dehydration. The diameter of the stomatal pore 
decreases as the fluid (turgor) pressure in the guard cell decreases. 
The drop in turgor pressure is caused in turn by a decrease in the 
ionic concentration (osmolarity) of the guard cell. Adverse condi- 
tions, such as high temperatures and low humidity, stimulate the 
release of the plant stress hormone abscisic acid. Studies suggest 
that abscisic acid binds to a GPCR in the plasma membrane of 
guard cells, triggering the opening of Ca** ion channels in the same 
membrane (Figure 15.32b). The resulting influx of Ca’* into the 
cytosol triggers the release of additional Ca** from intracellular 
stores. The elevated cytosolic Ca** concentration leads to closure of 
K" influx channels in the plasma membrane and opening of both 
K and anion efflux channels. These changes produce a net outflow 
of K* ions and anions (NO; and Cl) and a resulting decrease in 
turgor pressure. 


REVIEW | 


1. How is the [Ca?*] of the cytosol maintained at such a 
low level? How does the concentration change in 
response to stimuli? 


2. What is the role of calcium-binding proteins such as 
calmodulin in eliciting a response? 


3. Describe the role of calcium in mediating the 
diameter of stomata in guard cells. 


e—______ 


15.15 Convergence, Divergence, 
and Cross-Talk among Different 
Signaling Pathways 


The signaling pathways described above and illustrated schemati- 
cally in the various figures depict linear pathways leading directly 
from a receptor at the cell surface to an end target. In actual fact, 
signaling pathways in the cell are much more complex. For example: 


e Signals from a variety of unrelated receptors, each binding to its 
own ligand, can converge to activate a common effector, such as 
Ras or Raf. 

e Signals from the same ligand, such as EGF or insulin, can diverge 
to activate a variety of different effectors and pathways, leading 
to diverse cellular responses. 

e Signals can be passed back and forth between different pathways, 
a phenomenon known as cross-talk. 


617 
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618 These characteristics of cell-signaling pathways are illustrated sche- 
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matically in FIGURE 15.33. 

Signaling pathways provide a mechanism for routing informa- 
tion through a cell, not unlike the way the central nervous system 
routes information to and from the various organs of the body. Just as 
the central nervous system collects information about the environ- 
ment from various sense organs, the cell receives information about 
its environment through the activation of various surface receptors, 
which act like sensors to detect extracellular stimuli. Like sense organs 
that are sensitive to specific forms of stimuli (e.g., light, pressure, or 
sound waves), cell-surface receptors can bind only to specific ligands 
and are unaffected by the presence of a large variety of unrelated mol- 
ecules. A single cell may have dozens of different receptors sending 
signals to the cell interior simultaneously. Once they have been trans- 
mitted into the cell, signals from these receptors can be selectively 
routed along a number of different signaling pathways that may cause 
a cell to divide, change shape, activate a particular metabolic pathway, 
or even commit suicide (discussed in a following section). In this way, 
the cell integrates information arriving from different sources and 
mounts an appropriate and comprehensive response. 

Different signaling pathways are often interconnected. As a 
result, signals from a variety of unrelated ligands can converge to 
activate a common effector, such as Ras; signals from the same 
ligand can diverge to activate a variety of different effectors; and sig- 
nals can be passed back and forth between different pathways (cross- 
talk). The following are examples of these types of interactions. 


1. Convergence. We have discussed two distinct types of cell-surface 
receptors in this chapter: G protein-coupled receptors and recep- 
tor tyrosine kinases. Another type of cell-surface receptor that is 
capable of signal transduction was discussed in Chapter 7, 
namely, integrins. Although these three types of receptors may 


G Protein-coupled receptors 


Acetylcholine, histamine 
NA, 5-HT, ATP, PAF, TXAz, 
Glutamate, Angiotensin Il, 
Vasopressin, Bradykinin, 
Substance P, Bombesin, 
Neuropeptide Y, Thrombin, 
Cholecystokinin, Endothelin, 
Neuromedin, TRH, GnRH, 
PTH 

Odorants, Light 


Tyrosine kinase-inked receptors 


bind to very different ligands, all of them can lead to the forma- 
tion of phosphotyrosine docking sites for the SH2 domain of the 
adaptor protein Grb2 in close proximity to the plaama membrane 
(FIGURE 15.34). The recruitment of the Grb2-Sos complex results 
in the activation of Ras and transmission of signals down the 
MAP kinase pathway. As a result of this convergence, signals 
from diverse receptors can lead to the transcription and translation 
of a similar set of growth-promoting genes in each target cell. 


2. Divergence. Evidence of signal divergence has been evident in 
virtually all of the examples of signal transduction that have been 
described in this chapter. A quick look at Figure 15.15 or 15.250, c 
illustrates how a single stimulus—a ligand binding to a GPCR or 
an insulin receptor—sends signals out along a variety of different 
pathways. 


3. Cross-talk. In previous sections, we examined a number of sign- 
aling pathways as if each operated as an independent, linear 
chain of events. In fact, the information circuits that operate in 
cells are more likely to resemble an interconnected web in which 
components produced in one pathway can participate in events 
occurring in other pathways. The more that is learned about 
information signaling in cells, the more cross-talk between sign- 
aling pathways that is discovered. Rather than attempting to cata- 
log the ways that information can be passed back and forth 
within a cell, we will look at an example involving cAMP that 
illustrates the importance of this type of cross-talk. 


Cyclic AMP was depicted earlier as an initiator of a reaction 
cascade leading to glucose mobilization. However, cAMP can also 
inhibit the growth of a variety of cells, including fibroblasts and fat 
cells, by blocking signals transmitted through the MAP kinase cas- 
cade. Cyclic AMP is thought to accomplish this by activating PKA, 
the cAMP-dependent kinase, which can phosphorylate and inhibit 
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FIGURE 15.33 Examples of convergence, divergence, and cross-talk among various signal-transduction pathways. This drawing shows the outlines of 
signal-transduction pathways initiated by receptors that act by means of both heterotrimeric G proteins and receptor protein-tyrosine kinases. The two are 
seen to converge by the activation of different phospholipase C isoforms, both of which lead to the production of the same second messengers (IP3 and 
DAG). Activation of the RTK by either PDGF or EGF leads to the transmission of signals along three different pathways, an example of divergence. Cross-talk 
between the two types of pathways is illustrated by calcium ions, which are released from the SER by action of IP3 and can then act on various proteins, 


including protein kinase C (PKC), whose activity is also stimulated by DAG. 


Source: M. J. Berridge, reprinted with permission from Nature Vol. 361, p. 315, 1993, copyright 1993. Nature by Nature Publishing Group. Reproduced with 
permission of Nature Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center. 
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FIGURE 15.34 Signals transmitted from a G protein-coupled receptor, 
an integrin, and a receptor tyrosine kinase all converge on Ras and are 
then transmitted along the MAP kinase cascade. 


Raf, the protein that heads the MAP kinase cascade (FIGURE 15.35). 
These two pathways also intersect at another important signaling 
effector, the transcription factor CREB. CREB was described on 
page 600 as a terminal effector of cAMP-mediated pathways. It was 
assumed for years that CREB could only be phosphorylated by the 
cAMP-dependent kinase, PKA. It has since become apparent that 
CREB is a substrate of a much wider range of kinases. For example, 
one of the kinases that phosphorylates CREB is Rsk-2, which is 
activated as a result of phosphorylation by MAPK (Figure 15.35). In 
fact, both PKA and Rsk-2 phosphorylate CREB on precisely the same 
amino acid residue, Ser133, which should endow the transcription 
factor with the same potential in both pathways. 

A major unanswered question is raised by these examples of 
convergence, divergence, and cross-talk: How are different stimuli 
able to evoke distinct responses, even though they utilize similar 
pathways? PI3K, for example, is an enzyme that is activated by a 
remarkable variety of stimuli, including cell adhesion to the ECM, 
insulin, and EGE How is it that activation of PI3K in an insulin- 
stimulated liver cell promotes GLUT4 translocation and protein syn- 
thesis, whereas activation of PI3K in an adherent epithelial cell 
promotes cell survival? Ultimately, these contrasting cellular 
responses must be due to differences in the protein composition of 
different cell types. Part of the answer probably lies in the fact that 
different cells have different isoforms of these various proteins, 
including PI3K. Some of these isoforms are encoded by different, 
but related genes, whereas others are generated by alternative 
splicing (Section 12.18), or other mechanisms. Different isoforms of 
PI3K, PKB, or PLC, for example, may bind to different sets of 
upstream and downstream components, which could allow similar 
pathways to evoke distinct responses. The variation in responses elic- 
ited by different cells possessing similar signaling proteins may also 
be partly explained by the presence of different protein scaffolds in 
each of the cell types. As shown in Figure 15.23, the specificity of a 
response can be orchestrated by the scaffolds with which the 
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FIGURE 15.35 An example of cross-talk between two major signaling 
pathways. Cyclic AMP acts in some cells, by means of the cAMP-dependent 
kinase PKA, to block the transmission of signals from Ras to Raf, which 
inhibits the activation of the MAP kinase cascade. In addition, both PKA and 
the kinases of the MAP kinase cascade phosphorylate the transcription 
factor CREB on the same serine residue, activating the transcription factor 
and allowing it to bind to specific sites on the DNA. 


signaling proteins can interact. But it isn't likely that variations in 
isoforms and scaffolds can fully explain the extraordinary diversity 
of cellular responses any more than differences in the structures of 
neurons can explain the range of responses evoked by the nervous 
system. Hopefully, as the signaling pathways of more and more cells 
are described, we will gain a better understanding of the specificity 
that can be achieved through the use of similar signaling molecules. 


REVIEW | 


1. What is meant by the term cross-talk in protein 


signaling? What about convergence and divergence? 
oe_____——_ 


15.16 The Role of NO as 
an Intercellular Messenger 


During the 1980s, a new type of messenger was discovered that was 
neither an organic compound, such as cAMP, nor an ion, such as 
Ca**; it was an inorganic gas—nitric oxide (NO). NO is unusual 
because it acts both as an extracellular messenger, mediating inter- 
cellular communication, and as a second messenger, acting within 
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L-arginine in a reaction that requires oxygen and NADPH and that 
is catalyzed by the enzyme nitric oxide synthase (NOS). Since its 
discovery, it has become evident that NO is involved in a myriad of 
biological processes including anticoagulation, neurotransmission, 
smooth muscle relaxation, and visual perception. 

As with many other biological phenomena, the discovery that 
NO functions as a messenger molecule began with an accidental 
observation. It had been known for many years that acetylcholine 
acts in the body to relax the smooth muscle cells of blood vessels, but 
the response could not be duplicated in vitro. When portions of a 
major blood vessel such as the aorta were incubated in physiologic 
concentrations of acetylcholine in vitro, the preparation usually 
showed little or no response. In the late 1970s, Robert Furchgott, a 
pharmacologist at a New York State medical center, was studying the 
in vitro response of pieces of rabbit aorta to various agents. In his 
earlier studies, Furchgott used strips of aorta that had been dissected 
from the organ. For technical reasons, Furchgott switched from strips 
of aortic tissue to aortic rings and discovered that the new prepara- 
tions responded to acetylcholine by undergoing relaxation. Further 
investigation revealed that the strips had failed to display the relaxa- 
tion response because the delicate endothelial layer that lines the 
aorta had been rubbed away during the dissection. This surprising 
finding suggested that the endothelial cells were somehow involved 
in the response by the adjacent muscle cells. In subsequent studies, it 
was found that acetylcholine binds to receptors on the surface of 
endothelial cells, leading to the production and release of an agent 
that diffuses through the cell’s plasma membrane and causes the 
muscle cells to relax. The diffusible relaxing agent was identified in 
1986 as nitric oxide by Louis Ignarro at UCLA and Salvador Moncada 
at the Wellcome Research Labs in England. The steps in the acetyl- 
choline-induced relaxation response are illustrated in FIGURE 15.36. 

The binding of acetylcholine to the outer surface of an endothelial 
cell (step 1, Figure 15.36) signals a rise in cytosolic Ca”* concentration 
(step 2) that activates nitric oxide synthase (step 3). The NO formed 
in the endothelial cell diffuses across the plasma membrane and into 
the adjacent smooth muscle cells (step 4), where it binds and stimu- 
lates guanylyl cyclase (step 5), the enzyme that synthesizes cyclic 
GMP (cGMP), which is an important second messenger similar in 
structure to cAMP. Cyclic GMP binds to a cGMP-dependent protein 
kinase (a PKG), which phosphorylates specific substrates causing 
relaxation of the muscle cell (step 6) and dilation of the blood vessel. 


NO as an Activator of Guanylyl Cyclase 


The discovery that NO acts as an activator of guanylyl cyclase was 
made in the late 1970s by Ferid Murad and colleagues at the 
University of Virginia. Murad was working with azide (N;), a potent 
inhibitor of electron transport, and chanced to discover that the 
molecule stimulated cGMP production in cellular extracts. Murad 
and colleagues ultimately demonstrated that azide was being con- 
verted enzymatically into nitric oxide, which served as the actual 
guanylyl cyclase activator. These studies also explained the action of 
nitroglycerine, which had been used since the 1860s to treat the pain 
of angina that results from an inadequate flow of blood to the heart. 
Nitroglycerine is metabolized to nitric oxide, which stimulates the 
relaxation of the smooth muscles lining the blood vessels of the 
heart, increasing blood flow to the organ. The therapeutic benefits of 
nitroglycerine were discovered through an interesting observation. 
Persons with heart disease who worked with nitroglycerine in Alfred 
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FIGURE 15.36 A signal transduction pathway that operates by means 
of NO and cyclic GMP that leads to the dilation of blood vessels. The 
steps illustrated in the figure are described in the text. 

Source: From R. G. Knowles and S. Moncada, Trends Biochem Science 
17:401, 1992. Trends in Biochemical Sciences by International Union of 
Biochemistry reproduced with permission of Elsevier Ltd. in the format 
reuse in a book/textbook via Copyright Clearance Center. 


Nobel’s dynamite factory were found to suffer more from the pain of 
angina on days they weren't at work. It is only fitting that the Nobel 
Prize, which is funded by a donation from Alfred Nobel’s estate, was 
awarded in 1998 for the discovery of NO as a signaling agent. 


Inhibiting Phosphodiesterase 


The discovery of NO as a second messenger has also led to the devel- 
opment of Viagra (sildenafil). During sexual arousal, nerve endings 
in the penis release NO, which causes relaxation of smooth muscle 
cells in the lining of penile blood vessels and engorgement of the 
organ with blood. As described above, NO mediates this response in 
smooth muscle cells by activation of the enzyme guanylyl cyclase 
and subsequent production of cGMP. Viagra (and related drugs) has 
no effect on the release of NO or the activation of guanylyl cyclase, 
but instead acts as an inhibitor of cGMP phosphodiesterase, the 
enzyme that destroys cGMP. Inhibition of this enzyme leads to 
maintained, elevated levels of cGMP, which promotes the develop- 
ment and maintenance of an erection. Viagra is quite specific for one 
particular isoform of cGMP phosphodiesterase, PDE5, which is the 
version that acts in the penis. Another isoform of the enzyme, PDE3, 
plays a key role in the regulation of heart muscle contraction, but 
fortunately is not inhibited by the drug. Viagra was discovered when 
a potential angina medication had unexpected side effects. 

Recent investigations have revealed that NO has a variety of 
actions within the body that do not involve production of cGMP. 
For example, NO is added to the —SH group of certain cysteine resi- 
dues in well over a hundred proteins, including hemoglobin, Ras, 


ryanodine channels, and caspases. This posttranslational modifica- 
tion, which is called S-nitrosylation, alters the activity, turnover, or 
interactions of the protein. 


REVIEW | 


1. Describe the steps in the signaling pathway by which 


nitric oxide mediates dilation of blood vessels. 
_— 


15.17 Apoptosis (Programmed 
Cell Death) 


Apoptosis, or programmed cell death, is a normal process that is 
unique to animal cells. Apoptosis occurs through an orchestrated 
sequence of events that leads to the death of a cell. Death by apopto- 
sis is a neat, orderly process (FIGURE 15.37) characterized by the 
overall shrinkage in volume of the cell and its nucleus, the loss of 
adhesion to neighboring cells, the formation of blebs at the cell sur- 
face, the dissection of the chromatin into small fragments, and the 
rapid engulfment of the “corpse” by phagocytosis. Apoptosis is often 
contrasted with a different type of cell death called necrosis, which 
generally follows some type of physical trauma or biochemical insult. 
Like apoptosis, necrosis can also occur as a regulated and pro- 
grammed process (called necroptosis), although much less orderly 
in nature. Necrosis is characterized by the swelling of both the cell 
and its internal membranous organelles, membrane breakdown, 
leakage of cell contents into the medium, and the resulting induction 
of inflammation. Because it is a safe and orderly process, apoptosis 
might be compared to the controlled implosion of a building using 
carefully placed explosives as compared to simply blowing up the 
structure without concern for what happens to the flying debris. 
Why do our bodies have unwanted cells, and where do we find 
cells that become targeted for elimination? The short answer is: almost 
anywhere you look. During embryonic development, the earliest 
form of the human hand resembles a paddle without any space 
between the tissues that will become the fingers. The fingers are 
essentially carved out of the paddle via the elimination of the excess 
cells by apoptosis. Three stages of this process as it occurs in mice are 
shown in FIGURE 15.38. T lymphocytes are cells of the immune sys- 
tem that recognize and kill abnormal or pathogen-infected target 
cells. These target cells are recognized by specific receptors that are 
present on the surfaces of T lymphocytes. During embryonic devel- 
opment, T lymphocytes are produced that possess receptors capable 
of binding tightly to proteins present on the surfaces of normal cells 
within the body. T lymphocytes that have this dangerous capability 
are eliminated by apoptosis (see Figure 17.25). Apoptosis does not 
stop with the end of embryonic development. It has been estimated. 
that 10'°-10"' cells in the adult body die every day by apoptosis. For 
example, apoptosis is involved in the elimination of cells that have 
sustained irreparable genomic damage. This is important because 
damage to the genetic blueprint can result in unregulated cell divi- 
sion and the development of cancer. Apoptosis is also responsible for 
the death of cells that are no longer required, such as activated T cells 
that have responded to an infectious agent that has been eliminated. 
Finally, apoptosis appears to be involved in neurodegenerative dis- 
eases such as Alzheimer’s disease, Parkinson’s disease, and 
Huntington's disease. Elimination of essential neurons during dis- 
ease progression gives rise to loss of memory or decrease in motor 


FIGURE 15.37 A comparison of normal and apoptotic cells. (a, b) 
Scanning electron micrographs of a normal (a) and apoptotic (b) T-cell 
hybridoma. The apoptotic cell exhibits many surface blebs that are budded 
off in the cell. Bar equals 4mm. (c) Illustration of the distinctive morphologi- 
cal differences between a cell undergoing apoptosis (upper right) and 
necroptosis. (center). 


Source: (a—b) From S. J. Martin et al., Trends Biochem. Sci. 19:28, 1994. 
Reprinted with permission from Elsevier. 
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FIGURE 15.38 Apoptosis carves out the structure of the mammalian 
digits. Three stages in this process in a mouse embryo. In this particular 
mouse, which is called a MacBlue mouse, all of the embryonic mac- 
rophages express the cyan fluorescent protein. Fluorescent macrophages 
have infiltrated the regions of the footpad where apoptosis has occurred 
and are clearing the space between the digits. 

Source: From David A. Hume, Nature Immunol. 9:13, 2008; © 2008, 
Reprinted by permission from Macmillan Publishers Limited. 


coordination. These examples show that apoptosis is important in 
shaping tissues and organs during embryonic development and in 
maintaining homeostasis within the bodies of adult animals. Serious 
diseases can result from both the failure to carry out apoptosis when 
the elimination of cells (e.g., cancer cells) is appropriate and from the 
overactive induction of apoptosis when the elimination of cells is not 
appropriate (e.g., type 1 diabetes). 

The term apoptosis was coined in 1972 by John Kerr, Andrew 
Wylie, and A. R. Currie of the University of Aberdeen, Scotland, ina 
landmark paper that described for the first time the coordinated 
events that occurred during the programmed death of a wide range 
of cells. Insight into the molecular basis of apoptosis was first 
revealed in studies on the nematode worm C. elegans, whose cells 
can be followed with absolute precision during embryonic develop- 
ment. Of the 1090 cells produced during the development of this 
worm, 131 cells are normally destined to die by apoptosis. In 1986, 
Robert Horvitz and his colleagues at the Massachusetts Institute of 
Technology discovered that worms carrying a mutation in the CED-3 
gene proceed through development without losing any of their cells 
to apoptosis. This finding suggested that the product of the CED-3 
gene played a crucial role in the process of apoptosis in this organ- 
ism. The identification of the CED-3 gene in nematodes led to the 
discovery of a homologous family of proteins in mammals, which 
are now called caspases. Caspases are a distinctive group of 
cysteine proteases (i.e., proteases with a key cysteine residue in their 
catalytic site) that are activated at an early stage of apoptosis and are 
responsible for triggering most, if not all, of the changes observed. 
during cell death. Caspases accomplish this feat by cleaving a select 
group of essential proteins. Among the targets of caspases are the 
following: 


e More than a dozen protein kinases, including focal adhesion 
kinase (FAK ), PKB, PKC, and Raf1. Inactivation of FAK, for 
example, is presumed to disrupt cell adhesion, leading to 
detachment of the apoptotic cell from its neighbors. Inactivation 
of certain other kinases, such as PKB, serves to disrupt 
prosurvival signaling pathways. Caspases also disrupt the 
generation of survival signals by inactivating the NF-kB 
pathway (page 625). 

e Lamins, which make up the inner lining of the nuclear envelope. 


Cleavage of lamins leads to the disassembly of the nuclear lamina 
and shrinkage of the nucleus. 


e Proteins of the cytoskeleton, such as those of intermediate 
filaments, actin, tubulin, and gelsolin. Cleavage and consequent 
inactivation of these proteins lead to changes in cell shape. 


e Anendonuclease called caspase activated DNase (CAD), which is 
activated following caspase cleavage of an inhibitory protein. 
Once activated, CAD translocates from the cytoplasm to the 
nucleus where it attacks DNA, severing it into fragments. 


Recent studies have focused on the events that lead to the acti- 
vation of a cell’s suicide program. Apoptosis can be triggered by both 
internal stimuli, such as abnormalities in the DNA, and external 
stimuli, such as certain cytokines (proteins secreted by cells of the 
immune system). For example, epithelial cells of the prostate become 
apoptotic when deprived of the male sex hormone testosterone. This 
is the reason why prostate cancer that has spread to other tissues is 
often treated with drugs that interfere with testosterone production. 
Studies indicate that external stimuli activate apoptosis by a signal- 
ing pathway, called the extrinsic pathway, that is distinct from that 
utilized by internal stimuli, which is called the intrinsic pathway. 
Here we will discuss the extrinsic and intrinsic pathways separately. 
However, it should be noted that there is cross-talk between these 
pathways and that extracellular apoptotic signals can cause activa- 
tion of the intrinsic pathway. 


The Extrinsic Pathway of Apoptosis 


The steps in the extrinsic pathway are illustrated in FIGURE 15.39. In 
the case depicted in the figure, the stimulus for apoptosis is carried by 
an extracellular messenger protein called tumor necrosis factor (TNF), 
which was named for its ability to kill tumor cells. TNF is a trimeric 
protein produced by certain cells of the immune system in response to 
adverse conditions, such as exposure to ionizing radiation, elevated 
temperature, viral infection, or toxic chemical agents such as those 
used in cancer chemotherapy. Like other types of first messengers dis- 
cussed in this chapter, TNF evokes its response by binding to a trans- 
membrane receptor, TNFRI. The trimeric TNFRI protein is amember 
of a family of related “death receptors” that turns on the apoptotic pro- 
cess. The cytoplasmic domain of each TNF receptor subunit contains a 
segment of about 70 amino acids called a “death domain” (each green 
segment in Figure 15.39) that mediates protein-protein interactions. 
Binding of TNF to the trimeric receptor produces a change in confor- 
mation of the receptor’s death domain, which leads to the recruitment 
of a number of proteins, as indicated in Figure 15.39. 

The last proteins to join the complex that assembles at the inner 
surface of the plasma membrane are two procaspase-8 molecules 
(Figure 15.39). These proteins are called “procaspases” because 
each is a precursor of a caspase; it contains an extra portion (called 
the pro region) that must be removed by proteolytic processing to 
activate the enzyme. The synthesis of caspases as proenzymes pro- 
tects the cell from accidental proteolytic damage. Unlike most pro- 
enzymes, procaspases exhibit a low level of proteolytic activity. 
According to one model, when two or more procaspases are held in 
close association with one another, as they are in Figure 15.39, they 
are capable of cleaving one another’s polypeptide chain and con- 
verting the other molecule to the fully active caspase. The final 
mature enzyme (ie., caspase-8) contains four polypeptide chains, 
derived from two procaspase precursors as illustrated in the figure. 

Activation of caspase-8 is similar in principle to the activation of 
effectors by a hormone or growth factor. In all of these signaling 
pathways, the binding of an extracellular ligand causes a change in 
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FIGURE 15.39 A simplified model of the extrinsic (receptor-mediated) 
pathway of apoptosis and the necroptotic pathway. When TNF binds to 
a TNF receptor (TNFR1), the activated receptor binds the cytoplasmic 
adaptor protein TRADD and additional signaling proteins including the 
kinase RIP1K. The cytoplasmic domains of the TNF receptor, FADD, 
TRADD, and RIP1K, interact with one another by homologous regions 
called death domains that are present in each protein. In the apoptotic 
pathway, RIP1K associates with the adaptor protein FADD and procas- 
pase-8. Once assembled in the complex, two procaspase molecules cleave 
one another to generate an active caspase-8 molecule containing four 
polypeptide segments. Caspase-8 is an initiator complex that activates 
downstream (executioner) caspases that carry out the death sentence. In 
the necroptotic pathway, the RIP1K associates with another kinase, RIP3K to 
form the active necroptosome. RIP3K phosphorylation of MLKL is thought 
to allow MLKL to translocate to the membrane and oligomerize to form 
channels that allow for cytoplasmic leakage. It can be noted that the 
interaction between TNF and TNFR1 also can lead to cell survival rather 
than self-destruction (page 624). 


conformation of a receptor that leads to the binding and activation of 
proteins situated downstream in the pathway. Caspase-8 is described 
as an initiator caspase because it initiates apoptosis by cleaving and 
activating downstream, or executioner, caspases, that carry out the 
controlled self-destruction of the cell as described above. 


The Intrinsic Pathway of Apoptosis 


Internal stimuli, such as irreparable genetic damage, lack of oxygen 
(hypoxia), extremely high concentrations of cytosolic Ca**, viral 
infection, ER stress, or severe oxidative stress (i.e., the production of 
large numbers of destructive free radicals), trigger apoptosis by the 
intrinsic pathway illustrated in FIGURE 15.40. Activation of the 
intrinsic pathway is regulated by members of the Bcl-2 family of pro- 
teins, which are characterized by the presence of one or more small 
BH domains. Bcl-2 family members can be subdivided into three 


groups: (1) proapoptotic members (containing several BH domains) 
that promote apoptosis (Bax and Bak), (2) antiapoptotic members 
(containing several BH domains) that protect cells from apoptosis 
(e.g., Bcl-x;, Bcl-w, and Bcl-2),° and (3) BH3-only proteins (so-named 
because they contain only one BH domain), which promote apoptosis 
by an indirect mechanism. According to the prevailing view, BH3- 
only proteins (e.g., Bid, Bad, Puma, and Bim) can exert their 
proapoptotic effect in two different ways, depending on the particular 
proteins involved. In some cases they promote apoptosis by inhibiting 
antiapoptotic Bcl-2 members, whereas in other cases they promote 
apoptosis by activating proapoptotic Bax or Bak. In either case, the 
BH3-only proteins are the likely determinants as to whether a cell 
follows a pathway of survival or death. In a healthy cell, the BH3-only 
proteins are either absent or strongly inhibited, and the antiapoptotic 
Bcl-2 proteins are able to restrain proapoptotic members. The mech- 
anism by which this occurs is debated. It is only in the face of certain 
types of stress that the BH3-only proteins are expressed or activated, 
thereby shifting the balance in the direction of apoptosis. In these 
circumstances, the restraining effects of the antiapoptotic Bcl-2 pro- 
teins are overridden, and the proapoptotic protein Bax is free to 
translocate from the cytosol to the outer mitochondrial membrane 
(OMM). Although the mechanism is not entirely clear, it is thought 
that Bax molecules (and/or Bak molecules, which are permanent 
residents of the OMM) undergo a change in conformation that 
causes them to assemble into a multisubunit, protein-lined channel 
within the OMM. Once formed, this channel dramatically increases 
the permeability of the OMM and promotes the release of certain 
mitochondrial proteins, most notably cytochrome c, which resides in 
the intermembrane space (see Figure 5.17). Nearly all of the 
cytochrome c molecules present in all of a cell’s mitochondria can be 
released from an apoptotic cell in a period as short as five minutes. 
Cells lacking both Bax and Bak are protected from apoptosis, reveal- 
ing the essential roles of these proapoptotic proteins in this process. 

Release of proapoptotic mitochondrial proteins such as 
cytochrome c is apparently the “point of no return,’ that is, an event 
that irreversibly commits the cell to apoptosis. Once in the cytosol, 
cytochrome c forms part of a wheel-shaped multiprotein complex 
called the apoptosome, that also includes several molecules of procas- 
pase-9. Procaspase-9 molecules are thought to become activated by 
simply joining the multiprotein complex and do not require proteo- 
lytic cleavage (Figure 15.40). Like caspase-8, which is activated by the 
receptor-mediated pathway described above, caspase-9 is an initiator 
caspase that activates downstream executioner caspases, which bring 
about apoptosis.’ The extrinsic (receptor-mediated) and intrinsic (mito- 
chondria-mediated) pathways ultimately converge by activating the 
same executioner caspases, which cleave the same cellular targets. 

As cells execute the apoptotic program, they lose contact with 
their neighbors and start to shrink. Finally, the cell disintegrates 
into a condensed, membrane-enclosed apoptotic body. This entire 
apoptotic program can be executed in less than an hour. The apop- 
totic bodies are recognized by the presence of phosphatidylserine on 
their surface. Phosphatidylserine is a phospholipid that is normally 


>The first member of the family, Bcl-2 itself, was originally identified in 1985 as a 
cancer-causing oncogene in human lymphomas. The gene encoding Bcl-2 was over- 
expressed in these malignant cells as the result of a translocation. We now understand 
that Bcl-2 acts as an oncogene by promoting survival of potential cancer cells that 
would otherwise die by apoptosis. 


“Other intrinsic pathways that are independent of Apaf-1 and caspase-9, and possibly 
independent of cytochrome c, have also been described. 
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FIGURE 15.40 The intrinsic (mitochondria-mediated) pathway of 
apoptosis. Various types of cellular stress cause proapoptotic members of 
the Bcl-2 family of proteins—either Bax or Bak—to oligomerize within the outer 
mitochondrial membrane, forming channels that facilitate the release of 
cytochrome c molecules from the intermembrane space. Once in the cytosol, 
the cytochrome c molecules form a multisubunit complex with a cytosolic 
protein called Apaf-1 and procaspase-9 molecules. Procaspase-9 molecules 
are apparently activated to their full proteolytic capacity as the result of a 
conformational change induced by association with Apaf-1. Caspase-9 
molecules cleave and activate executioner caspases, which carry out the 
apoptotic response. The intrinsic pathway can be triggered in some cells 
(e.g., hepatocytes) by extracellular signals. This occurs as the initiator caspase 
of the extrinsic pathway, caspase 8, cleaves a BH3-only protein called Bid, 
generating a protein fragment (tBid) that binds to Bax, inducing insertion of 
Bax into the OMM and release of cytochrome c from mitochondria. 


present only on the inner leaflet of the plasma membrane. During 
apoptosis, a phospholipid “scramblase” moves phosphatidylserine 
molecules to the outer leaflet of the plasma membrane where they are 
recognized as an “eat me” signal by specialized macrophages. 
Apoptotic cell death thus occurs without spilling cellular content into 
the extracellular environment (FIGURE 15.41). This is important 
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FIGURE 15.41 Clearance of apoptotic cells is accomplished by 
phagocytosis. This electron micrograph shows an apoptotic cell “corpse” 
within the cytoplasm of a phagocyte. Note the compact nature of the 
engulfed cell and the dense state of its chromatin. 

Source: From Peter M. Henson, Donna L. Bratton, and Valerie A. Fadok, 
Curr. Biol. 11:R796, 2001, Fig. 1a, © 2001, with permission from Elsevier. 


because the release of cellular debris can trigger inflammation, which 
can cause a significant amount of tissue damage. 


Necroptosis 


In contrast to apoptosis, necroptotic cell death results in the ruptur- 
ing of the plasma membrane, causing cellular contents to be spilled 
into the environment, and often triggering an inflammatory 
response. This inflammatory response may often be beneficial, as a 
means to guide immune cells to a site of infection, for example. 
However, misregulation of the necroptotic pathway is suspected of 
triggering inflammatory diseases such as Crohn's disease and inflam- 
matory bowel disease. 

Understanding the molecular mechanisms underlying necrop- 
tosis is currently an active area of research. Recent studies have dem- 
onstrated that necroptosis can be triggered by the same signal—TNF 
binding to TNF receptor—that also leads to the extrinsic pathway of 
apoptosis. In the necroptotic pathway, however, caspase 8 is not ulti- 
mately activated. Rather, a series of interactions leads to the recruit- 
ment of the kinases RIPK1 and RIPK3, which are thought to 
oligomerize into a structure dubbed the necrosome, and the activa- 
tion of RIPK3 kinase activity (Figure 15.39). Recent evidence sug- 
gests that a substrate of RIPK3, called MLKL, is targeted to the 
plasma membrane after its phosphorylation. It is thought that MLKL 
can oligomerize to form transmembrane pores that result in loss of 
plasma membrane integrity and cell death. 


Signaling Cell Survival 


Just as there are signals that commit a cell to self-destruction, there 
are opposing signals that maintain cell survival. In fact, inter- 
action of TNF with a TNF receptor often transmits two distinct and 


opposing signals into the cell interior: one stimulating apoptosis and 
another stimulating cell survival. As a result, most cells that possess 
TNF receptors do not undergo apoptosis when treated with TNE 
This was a disappointing finding because it was originally hoped 
that TNF could be used as an agent to kill tumor cells. Cell survival 
is typically mediated through the activation of a key transcription 
factor called NF-«B, which activates the expression of genes encod- 
ing cell-survival proteins. It would appear that the fate of a cell 
depends on a delicate balance between signals promoting survival, 
apoptosis and necroptosis. 


Analytic Questions | 


1. The subject of cell signaling was placed near the end of the book 
because it ties together so many different topics in cell biology. 
Now that you have read the chapter, would you agree or disagree 
with this statement? Support your conclusions by example. 


2. Suppose the signaling pathway in Figure 15.3 were to lead to the 
activation of a gene that inhibits a cyclin-dependent kinase 
responsible for moving a cell into S phase of the cell cycle. How 
would a debilitating mutation in protein kinase 3 affect the cell’s 
growth? 


3. What might be the effect on liver function of a mutation in a gene 
that encodes a cAMP phosphodiesterase? of a mutation in a 
gene encoding a glucagon receptor? of a mutation in a gene 
encoding phosphorylase kinase? of a mutation that altered the 
active site of the GTPase of a G, subunit? (Assume in all cases 
that the mutation causes a loss of function of the gene product.) 


4. Cat, IP3, and cAMP have all been described as second messen- 
gers. In what ways are their mechanisms of action similar? In 
what ways are they different? 


5. In the reaction cascade illustrated in Figure 15.22, which steps 
lead to amplification and which do not? (See Quantitative Tutorial 
Video) 


6. Suppose that epinephrine and norepinephrine could initiate a 
similar response in a particular target cell. How could you deter- 
mine whether the two compounds act by binding to the same 
cell-surface receptor? 


7. One of the key experiments to show that gap junctions (Section 
7.12) allowed the passage of small molecules was carried out by 
allowing cardiac muscle cells (which respond to norepinephrine 
by contraction) to form gap junctions with ovarian granulosa 
cells (which respond to FSH by undergoing various metabolic 
changes). The researchers then added FSH to the mixed cell cul- 
ture and observed the contraction of the muscle cells. How 
could muscle cells respond to FSH, and what does this tell you 
about the structure and function of gap junctions? 


8. How would you expect a GTP analogue that the cell could not 
hydrolyze (a nonhydrolyzable analogue) to affect signaling 
events that take place during the stimulation of a liver cell by 
glucagon? What would be the effect of the same analogue on 
signal transduction of an epithelial cell after exposure to epider- 
mal growth factor (EGF)? How would this compare to the effects 
of the cholera toxin (page 590) on these same cells? 


9. You suspect that phosphatidylcholine might be serving as a pre- 
cursor for a second messenger that triggers secretion of a hor- 
mone in a type of cultured endocrine cell that you are studying. 


REVIEW | 


1. What are some of the functions of apoptosis in 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


. What is the role of the formation of caspase- 


vertebrate biology? Describe the steps that occur 
between (a) the time that a TNF molecule binds to its 
receptor and the eventual death of the cell and (b) the 
time a proapoptotic Bcl-2 member binds to the outer 
mitochondrial membrane and the death of the cell. 


containing complexes in the process of apoptosis? 
oo 


Furthermore, you suspect that the second messenger released 
by the plasma membrane in response to a stimulus is choline 
phosphate. What type of experiment might you perform to ver- 
ify your hypothesis? 


Figure 15.27 shows the localized changes in [Ca*] within the 
dendritic tree of a Purkinje cell. Calcium ions are small, rapidly 
diffusible agents. How is it possible for a cell to maintain differ- 
ent concentrations of this free ion in different regions of its cyto- 
sol? What do you suspect would happen if you injected a small 
volume of a calcium chloride solution into one region of a cell 
that had been injected previously with a fluorescent calcium 
probe? 


Formulate a hypothesis that might explain how contact of the 
outer surface of an egg by a fertilizing sperm at one site causes 
a wave of Ca** release that spreads through the entire egg, as 
shown in Figure 15.29. 


Because calmodulin activates many different effectors (e.g., pro- 
tein kinases, phosphodiesterases, calcium transport proteins), a 
calmodulin molecule must have many different binding sites on 
its surface. Would you agree with this statement? Why or why 
not? 


Diabetes mellitus is a disease that can result from a number of 
different defects involving insulin function. Describe three different 
molecular abnormalities in a liver cell that could cause different 
patients to exhibit a similar clinical picture, including, for exam- 
ple, high concentrations of glucose in the blood and urine. 


Would you expect a cell's response to EGF to be more sensitive 
to the fluidity of the plasma membrane than its response to insu- 
lin? Why or why not? 

Would you expect a mutation in Ras to act dominantly or reces- 
sively as a cause of cancer? Why? (A dominant mutation causes 
its effect when only one of the homologous alleles is mutated, 
whereas a recessive mutation requires that both alleles of the 
gene are mutated) (See Quantitative Tutorial Video). 


Speculate on a mechanism by which apoptosis might play a cru- 
cial role in combating the development of cancer, a topic dis- 
cussed in the following chapter. 


You are working with a type of fibroblast that normally responds 
to epidermal growth factor by increasing its rate of growth and 
division, and to epinephrine by lowering its rate of growth and 
division. You have determined that both of these responses 
require the MAP kinase pathway, and that EGF acts by means of 
an RTK and epinephrine by means of a G protein-coupled recep- 
tor. Suppose you identified a mutant strain of these cells that can 
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still respond to EGF but is no longer inhibited by epinephrine. 
You suspect that the mutation is affecting the cross-talk between 
two pathways (shown in Figure 15.35). Which component in this 
figure might be affected by such a mutation? 


In what way is the calcium wave that occurs at fertilization similar 
to a nerve impulse that travels down a neuron? 


Now that you have read the section on taste perception, why do 
you suppose it has been difficult to find effective rat poisons? 


One of the genes of the cowpox virus encodes a protein called 
CrmA that is a potent inhibitor of caspases. What effect would 
you expect this inhibitor to have on an infected cell? Why is this 
advantageous to the infecting virus? 


Most RTKs act directly on downstream effectors, whereas the 
insulin RTK acts through an intermediate docking protein, an 
insulin receptor substrate (IRS). Are there any advantages in sign- 
aling that might accrue from the use of these IRS intermediates? 


Researchers have reported that (1) most of the physiologic 
effects of insulin on target cells can be blocked by incubation of 


23. 


24. 


25. 


cells with wortmannin, a compound that specifically inhibits the 
enzyme PI3K and (2) that causing cells to overexpress a consti- 
tutively active form of PKB (i.e., a form of the enzyme that is 
continually active regardless of circumstances) induces a 
response in cells that is virtually identical to addition of insulin to 
these cells. Looking at Figure 15.25, are these observations ones 
that you might have predicted? Why or why not? 


Knockout mice that are unable to produce caspase-9 die as a 
result of a number of defects, most notably a greatly enlarged 
brain. Why would these mice have this phenotype? How would 
you expect the phenotype of a cytochrome c knockout mice to 
compare with that of the caspase-9 knockout? 


Why do you suppose that some people find a compound called 
PROP to have a bitter taste, whereas others do not report this 
perception? 

The inhibition of a specific protein kinase often leads to an increased 
phosphorylation of many cellular proteins. How can you explain 
this observation? 
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Cancer 


HENRIETTA LACKS 
AND THE HELA CELL 


Fleanets Lacks was only 31 years old when she died of 
an aggressive cervical cancer in 1951, but she has left a 
remarkable and enduring legacy. While Lacks was under 
treatment at Johns Hopkins hospital, a tissue sample of 
her cancer was removed and provided to a biologist 
named George Gey. Lacks and her family were not 
consulted, because, at the time, it was not required to 
receive consent from a patient before providing his or 
her tissue for medical research. 


Gey had worked for decades on establishing a human 
cell line that might provide insights into cancer. Just a 
year before receiving the sample from Lacks, he had 
established a tissue culture facility at Johns Hopkins that 


included handmade devices to culture and study cells. Microscopic image of HeLa cells fixed and stained against actin (red), 
One machine, called the roller drum, slowly rotated microtubules (cyan), and DNA (blue). 
numerous handblown glass tubes while maintaining Source: Courtesy of Tom Deerinck. 


them at a warm temperature. In each tube, Gey would 
place a tissue sample and bathe it in a nutrient broth 
that he had devised to sustain the cells. However, in 
tube after tube, the tissues would survive and grow only 
for 20-50 generations before eventually dying. 


continues 
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Gey observed something he had never seen before. The cells 
grew abundantly, growing into a thick sheet inside the tube. 
Gey divided the cells into additional tubes, where they con- 
tinued to divide and grow. It wasn’t long before Gey realized 
he had created the first immortal human cell line, which he 
called HeLa after the patient from which it was derived. 


Since its establishment in the early 1950s, HeLa cells have 
been used for over 74,000 research studies around the 
globe. The fact that permission was never granted for the 


16.1 Basic Properties 
of a Cancer Cell 


Cancer is a genetic disease because it can be traced to alterations 
within specific genes, but in most cases, it is not an inherited disease. 
In an inherited disease, the genetic defect is present in the chromo- 
somes of a parent and is transmitted to the zygote. In contrast, the 
genetic alterations that lead to most cancers arise in the DNA of a 
somatic cell during the lifetime of the affected individual. Because of 
these genetic changes, cancer cells become freed from many of the 
restraints to which normal cells are subjected. Normal cells do not 
divide unless they are stimulated to do so by the body’s homeostatic 
machinery; nor do they survive if they have incurred irreparable 
damage; nor do they wander away from a tissue to start new colonies 
elsewhere in the body. In contrast, most cancer cells experience a 
breakdown in all of these regulatory influences that protect the body 
from chaos and self-destruction. Most importantly, cancer cells pro- 
liferate uncontrollably, producing malignant tumors that invade sur- 
rounding healthy tissue (FIGURE 16.1). As long as the growth of the 
tumor remains localized, the disease can usually be treated and 
cured by surgical removal of the tumor. But malignant tumors tend 
to metastasize, that is, to spawn renegade cells that break away from 
the parent mass, enter the lymphatic or vascular circulation, and spread 


FIGURE 16.1 The invasion of normal tissue by a growing tumor. This 
light micrograph of a section of human liver shows a metastasized 
melanosarcoma (in red) that is invading the normal liver tissue. 


Source: Astrid and Hanns-Frieder Michler/Photo Researchers, Inc. 


scientific use of Lacks’ cells, however, has brought up 
numerous difficult ethical questions that have only recently 
started to be addressed. For example, in 2013, controversy 
surrounded the public release of the recently sequenced 
HeLa genome due to privacy concerns of Lacks’ descend- 
ants. Discussions between representatives from the 
National Institutes of Health and Lacks’ family led to the 
creation of a group, which includes Lack family 

members, that would provide controlled access to the 
genomic data. 


to distant sites in the body where they establish lethal secondary 
tumors (metastases) that are no longer amenable to surgical removal. 
The subject of metastasis is discussed in the Human Perspective of 
Chapter 7 (Section 17.8). 

Because of its impact on human health and the hope that a cure 
might be developed, cancer has been the focus of a massive research 
effort for decades. Although these studies have led to a remarkable 
breakthrough in our understanding of the cellular and molecular 
basis of cancer, prognoses for many types of cancers remain rela- 
tively unchanged. There has been progress, however. In 2011, the 
American Association for Cancer Research reported that death rates 
for all cancers combined dropped during the years between 1990 
and 2007 by 22 percent for men and 14 percent for women. Much of 
this progress is attributed to earlier diagnosis and treatment of three 
major types of cancer: breast cancer, prostate cancer, and colon can- 
cer. The incidence of various types of cancer in the United States and 
the corresponding mortality rates are shown in FIGURE 16.2. Most 
current treatments, such as chemotherapy and radiation, lack the 
specificity needed to kill cancer cells without simultaneously damag- 
ing normal cells, as evidenced by the serious side effects that accom- 
pany these treatments. As a result, patients cannot usually be 
subjected to high enough doses of chemicals or radiation to kill all of 
the tumor cells in their body. Researchers have been working for 
many years to develop more effective and less debilitating targeted 
therapies. Some of these newer strategies in cancer therapy will be 
discussed at the end of the chapter. 

The behavior of cancer cells is most easily studied when the cells 
are growing in culture. As discussed in the chapter opener, cancer cells 
can be obtained by removing a malignant tumor, dissociating the tis- 
sue into its separate cells, and culturing the cells in vitro. Over the 
years, many different lines of cultured cells that were originally derived 
from human tumors have been collected in cell banks and are available 
for study. Alternatively, normal cells can be converted to cancer cells 
by treatment with carcinogenic chemicals, radiation, or tumor viruses. 
Cells that have been transformed in vitro by chemicals or viruses can 
generally cause tumors when introduced into a suitable host animal. 
There are many differences in properties from one type of cancer cell 
to another. At the same time, there are a number of basic properties 
that are shared by cancer cells, regardless of their tissue of origin. 

At the cellular level, the most important characteristic of a cancer 
cell—whether residing in the body or on a culture dish—is its loss 
of growth control. The capacity for growth and division is not 
dramatically different between a cancer cell and most normal cells. 
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FIGURE 16.2 Estimates of new cancer cases and deaths 
in the United States in 2015. A total of 1,658,370 new 
cancer cases and 589,430 cancer deaths are estimated 

for 2015. 


Source: Data from the American Cancer Society, Inc. 


Breast 


When normal cells are grown in tissue culture under conditions that 
promote cell proliferation, they grow and divide at a rate similar to 
that of their malignant counterparts. However, when the normal 
cells proliferate to the point where they cover the bottom of the cul- 
ture dish, their growth rate decreases markedly, and they tend to 
remain as a single layer (monolayer) of cells (FIGURE 16.3a,b). 
Growth rates drop as normal cells respond to inhibitory influences 
from their environment. Growth-inhibiting influences may arise as 
the result of depletion of growth factors in the culture medium or 
from contact with surrounding cells on the dish. In contrast, when 
malignant cells are cultured under the same conditions, they con- 
tinue to grow, piling on top of one another to form clumps (Figure 
16.3c,d). It is evident that malignant cells are not responsive to the 
types of signals that cause their normal counterparts to cease growth 
and division. 


Normal cells 


Cancer cells grow in clumps (foci) 
(c) (d) 


Ovarian i 


Soft tissue Fr 


Prostate 
Bladder 
Uterine 
Kidney 


Colorectal —_-——— 


Lymphoma 


Leukemia —— 
Pancreatic — 


Melanoma 
Brain and nervous — 


Lung and bronchus 


Not only do cancer cells ignore inhibitory growth signals, they 
continue to grow in the absence of stimulatory growth signals that 
are required by normal cells. Normal cells growing in culture depend 
on growth factors, such as epidermal growth factor and insulin, that 
are present in serum (the fluid fraction of blood), which is usually 
added to the growth medium (FIGURE 16.4). Cancer cells can prolif- 
erate in the absence of serum because their cell cycle does not depend 
on the interaction between growth factors and their receptors, which 
are located at the cell surface (Section 15.10). As we will see below, 
this transformation is a result of basic changes in the intracellular 
pathways that govern cell proliferation and survival. 

Normal cells growing in culture exhibit a limited capacity for 
cell division; after a finite number of mitotic divisions, they undergo 
an aging process that renders them unfit to continue to grow and 
divide. Cancer cells, on the other hand, are seemingly immortal 


FIGURE 16.3 Growth properties of normal and cancerous 
cells. Normal cells typically grow in a culture dish until they cover 
the surface as a monolayer (a and b). In contrast, cells that have 
been transformed by viruses or carcinogenic chemicals (or 
malignant cells that have been cultured from tumors) typically 
grow in multilayered clumps, or foci (c and d). 


Source: (b, d) Courtesy of G. Steven Martin, University of 
California at Berkeley. 
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FIGURE 16.4 The effects of serum deprivation on the growth of 
normal and transformed cells. Whereas the growth of cancer cells 
continues regardless of the presence or absence of exogenous growth 
factors, normal cells require these substances in their medium for growth to 
continue. The growth of normal cells levels off as the growth factors in the 
medium are depleted. 


because they continue to divide indefinitely. This difference in growth 
potential is often attributed to the presence of telomerase in cancer 
cells and its absence in normal cells. Recall from page 475 that telom- 
erase is the enzyme that maintains the telomeres at the ends of the 
chromosomes, thus allowing cells to continue to divide. The absence 
of telomerase from most types of normal cells is thought to be one of 
the body’s major defenses that protects against tumor growth. 

The most striking alterations in the nucleus following transfor- 
mation occur within the chromosomes. Unlike normal cells that 
replicate their DNA at a very low error rate (page 526), cancer cells 
are genetically unstable and often have highly aberrant chromosome 
complements, a condition termed aneuploidy (FIGURE 16.5), which 


may occur primarily as a result of defects in the mitotic checkpoint 
(Section 14.5) or the presence of an abnormal number of cen- 
trosomes (see Figure 14.17c).' It is evident from Figure 16.5 that the 
growth of cancer cells is much less dependent on a standard diploid 
chromosome content than the growth of normal cells. In fact, when 
the chromosome content of a normal cell becomes disturbed, a sign- 
aling pathway is usually activated that leads to the self-destruction 
(apoptosis) of the cell. In contrast, cancer cells typically fail to elicit 
the apoptotic response even when their chromosome content 
becomes highly deranged. Protection from apoptosis is another 
important hallmark that distinguishes many cancer cells from nor- 
mal cells. Finally, it can be noted that cancer cells often depend on 
glycolysis, which is considered an anaerobic metabolic pathway 
(Figure 3.24). This property may reflect the high metabolic require- 
ments of cancer cells and an inadequate blood supply within the 
tumor. Under conditions of hypoxia (reduced O3), cancer cells acti- 
vate a transcription factor called HIF that induces the formation of 
new blood vessels and promotes the migratory properties of the 
cells, which may contribute to the spread of the tumor. However, 
even when oxygen is plentiful, many tumor cells continue to gener- 
ate much of their ATP by glycolysis (called aerobic glycolysis). Even 
though glycolysis generates much less ATP per glucose than does 
oxidative phosphorylation in the mitochondrion, it produces ATP at 
a more rapid rate. The increased uptake of glucose by tumor cells 
compared to normal cells can be used as a means to locate metastatic 
tumors within the body using PET scans (see Figure 16.24). 


'There is controversy as to whether the development of aneuploidy occurs at an early 
stage in tumor formation and is a cause of the genetic instability that characterizes 
cancer cells, or is a late event and is simply a consequence of abnormal cancer growth. 


FIGURE 16.5 Karyotype of a cell from a breast cancer line showing a highly abnormal chromosome complement. A normal diploid cell would have 
22 pairs of autosomes and two sex chromosomes. The two members of a pair would be identical, and each chromosome would be a single continuous color 
(as in the karyotype of a normal cell in Figure 12.22b that uses a similar spectral visualization technique). The chromosomes of this cell are highly deranged 


as evidenced by the presence of extra and missing chromosomes (i.e., aneuploidy) as well as chromosomes of more than one color. These multicolored 
chromosomes reflect the large numbers of translocations that have occurred in previous cell generations. A cell with normal cell cycle checkpoints and 
apoptotic pathways could never have attained a chromosome complement approaching that seen here. 


Source: Courtesy of Joanne Davidson and Paul A. W. Edwards. 


It is these properties, which can be demonstrated in culture, 
together with their tendency to spread to distant sites within the 
body, that make cancer cells such a threat to the well-being of the 
entire organism. 


REVIEW | 


1. Describe some of the properties that distinguish 
cancer cells from normal cells. 


2. How do the properties of cancer cells manifest 
themselves in culture? 


16.2 The Causes of Cancer 


In 1775, Percivall Pott, a British surgeon, made the first known cor- 
relation between an environmental agent and the development of 
cancer. Pott concluded that the high incidence of cancer of the nasal 
cavity and the skin of the scrotum in chimney sweeps was due to 
their chronic exposure to soot. Within the past several decades, the 
carcinogenic chemicals in soot have been isolated, along with 
hundreds of other compounds shown to cause cancer in laboratory 
animals. In addition to a diverse array of chemicals, a number of 
other types of agents are also carcinogenic, including ionizing radia- 
tion and a variety of DNA- and RNA-containing viruses. All of these 
agents have one property in common: They alter the genome. 
Carcinogenic chemicals, such as those present in soot or cigarette 
smoke, can almost always be shown either to be directly mutagenic 
or to be converted to mutagenic compounds by cellular enzymes. 
Similarly, ultraviolet radiation, which is the leading cause of skin 
cancer, is also strongly mutagenic. 

A number of viruses can infect mammalian cells growing in cell 
culture, transforming them into cancer cells. These viruses are 
broadly divided into two large groups: DNA tumor viruses and 
RNA tumor viruses, depending on the type of nucleic acid found 
within the mature virus particle. Among the DNA viruses capable of 
transforming cells are polyoma virus, simian virus 40 (SV40), adeno- 
virus, and herpes-like viruses. RNA tumor viruses, or retroviruses, 
are similar in structure to HIV (see Figure 1.22b) and are the subject 
of the Experimental Pathways in Section 16.3. Tumor viruses can 
transform cells because they carry genes whose products interfere 
with the cell’s normal growth-regulating activities. Although tumor 
viruses were an invaluable tool for researchers in identifying numer- 
ous genes involved in cell transformation, they are associated with 
only a small number of human cancers. Other types of viruses are, 
however, linked to as many as 20 percent of cancers worldwide. In 
most cases, these viruses greatly increase a person’s risk of develop- 
ing the cancer, rather than being the sole determinant responsible for 
the disease. This relationship between viral infection and cancer is 
illustrated by human papilloma virus (HPV), which can be transmit- 
ted through sexual activity and is increasing in frequency in the pop- 
ulation. Although the virus is present in about 90 percent of cervical 
cancers, indicating its importance in development of the disease, the 
vast majority of women who have been infected with the virus will 
never develop this malignancy. HPV is also linked as a primary caus- 
ative agent of cancers of the mouth and tongue in both men and 
women. Effective vaccines against this virus are now available. Other 
viruses linked to human cancers include hepatitis B virus, which is 
associated with liver cancer; Epstein-Barr virus, which is associated 


with Burkitt's lymphoma in areas where malaria is common; anda 631 


herpes virus (HHV-8), which is associated with Kaposi’s sarcoma. 
Certain gastric lymphomas are associated with chronic infection by 
the stomach-dwelling bacterium Helicobacter pylori, which can also 
cause ulcers. Recent evidence suggests that many of these cancers linked 
to persistent viral and bacterial infections are actually caused by the 
chronic inflammation that is triggered by the presence of the pathogen. 
Inflammatory bowel disease (IBD), which is also characterized by 
chronic inflammation, has been associated with an increased risk of 
colon cancer. These findings have caused researchers to look more 
closely at the general process of inflammation as a previously unex- 
plored factor in the development of many types of cancers. 
Determining the causes of different types of cancer is an 
endeavor carried out by epidemiologists, researchers who study dis- 
ease patterns in populations. The causes of certain cancers are obvi- 
ous: Smoking causes lung cancer, exposure to ultraviolet radiation 
causes skin cancer, and inhaling asbestos fibers causes mesotheli- 
oma. But despite a large number of studies, we are still uncertain as 
to the causes of most types of human cancer. Humans live in com- 
plex environments and are exposed to many potential carcinogens in 
a changing pattern over a period of decades. Attempting to deter- 
mine the causes of cancer from a mountain of statistical data obtained 
from the answers to questionnaires about individual lifestyles has 
proven very difficult. The importance of environmental factors (e.g., 
diet) is seen most clearly in studies of the children of couples that 
have moved from Asia to the United States or Europe. These indi- 
viduals no longer exhibit a high rate of gastric cancer, as occurs in 
Asia, but instead are subject to an elevated risk of colon and breast 
cancer, which is characteristic of Western countries (FIGURE 16.6). 
There is a general consensus among epidemiologists that diet 
can play a major role in the risk of developing cancer. Cancer rates 
are higher among obese individuals than the non-obese population 
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FIGURE 16.6 Changing cancer incidence in persons of 
Japanese descent following migration to Hawaii. The incidence 
of stomach cancer declines, whereas that of breast and colon 
cancer rises. However, of the three types of cancer, only colon 
cancer has reached rates equivalent to Caucasian Hawaiians by the 
second generation. 


Source: From L. N. Kolonel et al., reprinted with permission from 
Nature Revs. Cancer 4:3, 2004; copyright 2004. Nature Reviews 
Cancer by Nature Publishing Group. Reproduced with permission 
of Nature Publishing Group in the format reuse in a book/textbook 
via Copyright Clearance Center. 
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(Section 3.14) protects against cancer. Recent attention has focused 
on elevated levels of insulin and insulin-like growth factor (IGF-1) 
that are found in obese individuals as being a primary cause of the 
increased cancer incidence in this group. There is also evidence that 
some ingredients in the diet, such as animal fat and alcohol, can 
increase the risk of developing cancer, whereas certain compounds 
found in food items may reduce that risk. Examples of the latter 
include isoflavones found in soy, sulforaphanes found in broccoli, 
and EGCG found in tea. Several widely prescribed drugs also have a 
preventive effect. Drugs that interfere with the action of estrogen 
(e.g., tamoxifen or raloxifene) or the metabolism of testosterone 
(e.g., finasteride) can reduce the incidence of breast cancer or pros- 
tate cancer, respectively. Long-term use of nonsteroidal anti-inflam- 
matory drugs (NSAIDs) such as aspirin and indomethacin has been 
shown to markedly decrease the risk of colon cancer. They are 
thought to have this effect by inhibiting cyclooxygenase-2, an 


enzyme that catalyzes the synthesis of hormone-like prostaglandins, 
which promote the growth of intestinal polyps. The cancer-sup- 
pressing action of NSAIDs supports the idea that inflammation 
plays a major role in the development of various cancers. Persons 
who have taken the antidiabetes drug metformin also appear to have 
a significantly reduced risk of developing cancer. In this case, the 
benefit may be a result of the drug’s action in lowering the circulat- 
ing levels of insulin and IGF-1. 


REVIEW | 


1. Describe at least three different agents that are 
known to be carcinogenic. 


2. Describe several factors that can lead to an 
increased risk of cancer. 


16.3 EXPERIMENTAL PATHWAYS 


The Discovery of Oncogenes 


In 1911, Peyton Rous of The Rockefeller Institute for Medical 
Research published a paper that was less than one page in length 
(it shared the page with a note on the treatment of syphilis) and 
had virtually no impact on the scientific community. Yet this paper 
reported one of the most farsighted observations in the field of 
cell and molecular biology.' Rous had been working with a chicken 
sarcoma that could be propagated from one hen to another by 
inoculating a host of the same strain with pieces of tumor tissue. 
In this paper, Rous described a series of experiments that strongly 
suggested that the tumor could be transmitted from one animal 
to another by a “filterable virus,” which is a term that had been 
coined a decade or so earlier to describe pathogenic agents that 
were small enough to pass through filters that were impermeable 
to bacteria. 

In his experiments, Rous removed the tumors from the breasts 
of hens, ground the cells in a mortar with sterile sand, centrifuged 
the particulate material into a pellet, removed the supernatant, 
and forced the supernatant fluid through filters of varying porosity 
including those small enough to prevent the passage of bacteria. 
He then injected the filtrate into the breast muscle of a recipient 
hen and found that a significant percentage of the injected ani- 
mals developed the tumor. 

The virus discovered by Rous in 1911 is an RNA-containing 
virus. By the end of the 1960s, similar viruses were found to be 
associated with mammary tumors and leukemias in rodents and 
cats. Certain strains of mice had been bred that developed specific 
tumors with very high frequency. RNA-containing viral particles 
could be seen within the tumor cells and also budding from the cell 
surface, as shown in the micrograph of FIGURE 1. It was apparent 
that the gene(s) causing tumors in these inbred strains are trans- 
mitted vertically, that is, through the fertilized egg from mother to 
offspring, so that the adults of each generation invariably develop 
the tumor. These studies provided evidence that the viral genome 
can be inherited through the gametes and subsequently trans- 
mitted from cell to cell by means of mitosis without having any 


FIGURE 1 Electron micrograph of a Friend mouse leukemia virus 
budding from the surface of a cultured leukemic cell. 


Source: Courtesy of E. De Harven. 


obvious effect on the behavior of the cells. The presence of inher- 
ited viral genomes is not a peculiarity of inbred laboratory strains, 
because it was shown that wild (feral) mice treated with chemical 
carcinogens develop tumors that often contain the same antigens 
characteristic of RNA tumor viruses and that exhibit virus particles 
under the electron microscope. 


TABLE 1 Characterization of the Polymerase Product 


Percentage 
undigested 
product 


Acid-insoluble 


Treatment radioactivity 


Untreated 1,425 (100) 
20 ug deoxyribonuclease 125 


20 ug micrococcal 69 
nuclease 

20 ug ribonuclease 

Untreated 

NaOH hydrolyzed 


Source: From D. Baltimore, Nature 226:1210, 1970, copyright 1970. Nature by 
Nature Publishing Group. Reproduced with permission of Nature Publishing 
Group in the format reuse in a book/textbook via Copyright Clearance Center. 


One of the major questions concerning the vertical transmis- 
sion of RNA tumor viruses was whether the viral genome is passed 
from parents to progeny as free RNA molecules or is somehow 
integrated into the DNA of the host cell. Evidence indicated 
that infection and transformation by these viruses required the 
synthesis of DNA. Howard Temin of the University of Wisconsin 
suggested that the replication of RNA tumor viruses occurs by 
means of a DNA intermediate—a provirus—which then serves as 
a template for the synthesis of viral RNA. But this model requires 
a unique enzyme—an RNA-dependent DNA polymerase—which 
had never been found in any type of cell. Then in 1970, an enzyme 
having this activity was discovered independently by David 
Baltimore of the Massachusetts Institute of Technology and by 
Temin and Satoshi Mizutani.23 

Baltimore examined the virions (the mature viral particles) 
from two RNA tumor viruses, Rauscher mouse leukemia virus 
(R-MLV) and Rous sarcoma virus (RSV). A preparation of purified 
virus was incubated under conditions that would promote the 
activity of a DNA polymerase, including Mg?" (or Mn?*), NaCl, 
dithiothreitol (which prevents the —SH groups of the enzyme 
from becoming oxidized), and all four deoxyribonucleoside tri- 
phosphates, one of which (TTP) was radioactively labeled. Under 
these conditions, the preparation incorporated the labeled DNA 
precursor into an acid-insoluble product that exhibited the prop- 
erties of DNA (Table 1). As is characteristic of DNA, the reaction 
product was rendered acid soluble (indicating that it had been 
converted to low-molecular-weight products) by treatment with 
pancreatic deoxyribonuclease or micrococcal nuclease, but it was 
unaffected by pancreatic ribonuclease or by alkaline hydrolysis (to 
which RNA is sensitive; Table 1). The DNA-polymerizing enzyme 
was found to co-sediment with the mature virus particles, suggest- 
ing that it was part of the virion itself and not an enzyme donated 
by the host cell. Although the product was insensitive to treatment 
with pancreatic ribonuclease, the template was very sensitive to 
this enzyme (FIGURE 2), particularly if the virions were pretreated 
with the ribonuclease prior to addition of the other components 
of the reaction mixture (Figure 2, curve 4). These results strength- 
ened the suggestion that the viral RNA was providing the tem- 
plate for synthesis of a DNA copy, which presumably served as a 
template for the synthesis of viral mRNAs required for infection 
and transformation. Not only did these experiments suggest that 
cellular transformation by RNA tumor viruses proceeded through 
a DNA intermediate, they also overturned the long-standing 
concept originally proposed by Francis Crick and known as the 
Central Dogma, which stated that information in a cell always 
flowed from DNA to RNA to protein. The RNA-dependent DNA 
polymerase became known as reverse transcriptase. 

During the 1970s, attention turned to the identification of the 
genes carried by tumor viruses that were responsible for trans- 


3H-TMP incorporated (c.p.m. x 1072) 


FIGURE 2 Incorporation of radioactivity from [SH]TTP into an acid- 
insoluble precipitate by the Rauscher murine leukemia virus DNA 
polymerase in the presence and absence of ribonuclease. (Note: The 
labeled TTP precursor is converted to TMP as it is incorporated into 
DNA.) Curve 1, no added ribonuclease; curve 2, preincubated without 
added ribonuclease for 20 minutes before addition of [PH]TTP; curve 3, 
ribonuclease added to the reaction mixture; curve 4, preincubated with 
ribonuclease before addition of [PHITTP. 

Source: From D. Baltimore, Nature 226:1210, 1970, copyright 1970. 
Nature by Nature Publishing Group. Reproduced with permission of 
Nature Publishing Group in the format reuse in a book/textbook via 
Copyright Clearance Center. 


formation and the mechanism of action of the gene products. 
Evidence from genetic analyses indicated that mutant strains 
of viruses could be isolated that retained the ability to grow in 
host cells, but were unable to transform the cell into one exhibit- 
ing malignant properties.* Thus, the capacity to transform a cell 
resided in a restricted portion of the viral genome. 

These findings set the stage for a series of papers by 
Harold Varmus, J. Michael Bishop, Dominique Stehelin, and 
their co-workers at the University of California, San Francisco. 
These researchers began by isolating mutant strains of the avian 
sarcoma virus (ASV) carrying deletions of 10 to 20 percent of 
the genome that render the virus unable to induce sarcomas in 
chickens or to transform fibroblasts in culture. The gene respon- 
sible for transformation, which is missing in these mutants, was 
referred to as src (for sarcoma). To isolate the DNA corresponding 
to the deleted regions of these mutants, which presumably carry 
the genes required for transformation, the following experimental 
strategy was adopted.° RNA from the genomes of complete 
(oncogenic) virions was used as a template for the formation of 
a radioactively labeled, single-stranded, complementary DNA 
(cDNA) using reverse transcriptase. The labeled cDNA (which is 
present as fragments) was then hybridized to RNA obtained from 
one of the deletion mutants. Those DNA fragments that failed 
to hybridize to the RNA represent portions of the genome that 
had been deleted from the transformation-defective mutant and 
thus were presumed to contain the gene required by the virus to 
cause transformation. DNA fragments that did not hybridize to 

continued 
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the RNA were separated from those that were part of DNA-RNA 
hybrids by column chromatography. Using this basic strategy, 
a DNA sequence referred to as cCDNA,,,. was isolated, which cor- 
responded to approximately 16 percent of the viral genome (1600 
nucleotides out of a total genomic length of 10,000 nucleotides). 

Once isolated, cDNAga;< proved to be a very useful probe. 
It was first shown that this labeled cDNA hybridizes to DNA 
extracted from cells of a variety of avian species (chicken, turkey, 
quail, duck, and emu), indicating that the cellular genomes of 
these birds contain a DNA sequence that is closely related to src. 

These findings provided the first strong evidence that a gene 
carried by a tumor virus that causes cell transformation is actu- 
ally present in the DNA of normal (uninfected) cells and thus is 
presumably a part of the cells’ normal genome. These results 
indicated that the transforming genes of the viral genome (the 
oncogenes) are not true viral genes, but rather are cellular genes 
that were picked up by RNA tumor viruses during a previous infec- 
tion. Possession of this cell-derived gene apparently endows the 
virus with the power to transform the very same cells in which this 
gene is normally found. The fact that the src sequence is present 
in all of the avian species tested suggests that the sequence has 
been conserved during avian evolution and, thus, is presumed to 
govern a basic activity of normal cells. In a subsequent study, it 
was found that cCDNA,,,< binds to DNA from all vertebrate classes, 
including mammals, but not to the DNA from sea urchins, fruit 
flies, or bacteria. Based on these results it was possible to con- 
clude that the src gene is not only present in the RNA of the ASV 
genome and the genome of the chicken cells it can infect, but a 
homologous gene is also present in the DNA of distantly related 
vertebrates, suggesting that it plays some critical function in the 
cells of all vertebrates.’ 

These findings raised numerous questions; foremost among 
these were (1) what is the function of the src gene product, and 
(2) how does the presence of the viral src gene (referred to as 
v-src) alter the behavior of a normal cell that already possesses a 
copy of the cellular gene (referred to as c-src)? 

The product of the src gene was initially identified by Ray 
Erikson and co-workers at the University of Colorado by two 
independent procedures: (1) precipitation of the protein from 
extracts of transformed cells by antibodies prepared from RSV- 
infected animals, and (2) synthesis of the protein in a cell-free 
protein-synthesizing system using the isolated viral gene as a tem- 
plate. Using these procedures, the src gene product was found to 
be a protein of 60,000 daltons, which they named pp60°°.8 When 
pp60*" was incubated with [°P]ATP, radioactive phosphate groups 
were transferred to the heavy chains of the associated antibody 
(IgG) molecules used in the immunoprecipitation. This finding 
suggested that the src gene codes for an enzyme that possesses 
protein kinase activity.” When cells infected with ASV were fixed, 
sectioned, and incubated with ferritin-labeled antibodies against 
ppd60*", the antibodies were found to be localized on the inner 
surface of the plasma membrane, suggesting a concentration of 
the src gene product in this part of the cell (FIGURE 3).'° 

These were the first studies to elucidate the function of an 
oncogene. A protein kinase is the type of gene product that might 
be expected to have potential transforming activity, because it 
can regulate the activities of numerous other proteins, each of 
which might serve a critical function in one or another activity 
related to cell growth. Further analysis of the role of the src gene 
product turned up an unexpected finding. Unlike all the other pro- 
tein kinases whose function had been studied, pp60** transferred 
phosphate groups to tyrosine residues on the substrate protein 
rather than to serine or threonine residues.'' The existence of 
phosphorylated tyrosine residues had escaped previous detec- 
tion because phosphorylated serine and threonine residues are 


0.1 um 


FIGURE 3 Electron micrograph of a section through a pair of 
adjacent fibroblasts that had been treated with ferritin-labeled 
antibodies against the pp60** protein. The protein is localized (as 
revealed by the dense ferritin granules) at the plasma membrane of the 
cell and is particularly concentrated at the sites of gap junctions. 


Source: From Mark C. Willingham, Gilbert Jay, and Ira Pastan, Cell 
18:128, 1979, with permission from Elsevier. 


approximately 3000 times more abundant in cells than phospho- 
tyrosine, and because phosphothreonine and phosphotyrosine 
residues are difficult to separate from one another by traditional 
electrophoretic procedures. Not only did the product of the viral 
src gene (v-src) code for a tyrosine protein kinase, so too did c-src, 
the cellular version of the gene. However, the number of phos- 
phorylated tyrosine residues in proteins of RSV-transformed cells 
was approximately eight times higher than that of control cells. 
This finding suggested that the viral version of the gene may 
induce transformation because it functions at a higher level of 
activity than the cellular version. 

The results from the study of RSV provided preliminary evi- 
dence that an increased activity of an oncogene product could be 


a key to converting a normal cell into a malignant cell. Evidence 
soon became available that the malignant phenotype could also 
be induced by an oncogene that contained an altered nucleotide 
sequence. A key initial study was conducted by Robert Weinberg 
and his colleagues at the Massachusetts Institute of Technology 
using the technique of DNA transfection." 

Weinberg began the studies by obtaining 15 different malig- 
nant cell lines that were derived from mouse cells that had been 
treated with a carcinogenic chemical. Thus these cells had been 
made malignant without exposing them to viruses. The DNA 
from each of these cell lines was extracted and used to transfect 
a type of nonmalignant mouse fibroblast called an NIH3T3 cell. 
NIH3T3 cells were selected for these experiments because they 
take up exogenous DNA with high efficiency and they are readily 
transformed into malignant cells in culture. After transfection with 
DNA from the tumor cells, the fibroblasts were grown in vitro, and 
the cultures were screened for the formation of clumps (foci) that 
contained cells that had been transformed by the added DNA. Of 
the 15 cell lines tested, five of them yielded DNA that could trans- 
form the recipient NIH3T3 cells. DNA from normal cells lacked 
this capability. These results demonstrated that carcinogenic 
chemicals produced alterations in the nucleotide sequences of 
genes that gave the altered genes the ability to transform other 
cells. Thus cellular genes could be converted into oncogenes in 
two different ways: as the result of becoming incorporated into 
the genome of a virus or by becoming altered by carcinogenic 
chemicals. 

Up to this point, virtually all of the studies on cancer-causing 
genes had been conducted in mice, chickens, or other organisms 
whose cells were highly susceptible to transformation. In 1981, 
attention turned to human cancer when it was shown that DNA 
isolated from human tumor cells can transform mouse NIH3T3 
cells following transfection.'? Of 26 different human tumors that 
were tested in this study, two provided DNA that was capable of 
transforming mouse fibroblasts. In both cases, the DNA had been 
extracted from cell lines taken from a bladder carcinoma (identi- 
fied as EJ and J82). Extensive efforts were undertaken to deter- 
mine if the genes had been derived from a tumor virus, but no 
evidence of viral DNA was detected in these cells. These results 
provided the first evidence that some human cancer cells con- 
tain an activated oncogene that can be transmitted to other cells, 
causing their transformation. 

The finding that cancer can be transmitted from one cell 
to another by DNA fragments provided a basis for determining 
which genes in a cell, when activated by mutation or some other 
mechanism, are responsible for causing the cell to become malig- 
nant. To make this determination, it was necessary to isolate the 
DNA that was taken up by cells, causing their transformation. 
Once the foreign DNA responsible for transformation had been 
isolated, it could be analyzed for the presence of cancer-causing 
alleles. Within two months of one another in 1982, three different 
laboratories reported the isolation and cloning of an unidenti- 
fied gene from human bladder carcinoma cells that can transform 
mouse NIH3T3 fibroblasts. '4"6 

Once the transforming gene from human bladder cancer 
cells had been isolated and cloned, the next step was to deter- 
mine if this gene bears any relationship to the oncogenes car- 
ried by RNA tumor viruses. Once again, within two months of 
one another, three papers appeared from different laboratories 
reporting similar results.'7-'7 All three showed that the onco- 
gene from human bladder carcinomas that transforms NIH3T3 
cells is the same oncogene (named ras) that is carried by the 
Harvey sarcoma virus, which is a rat RNA tumor virus. Preliminary 
comparisons of the two versions of ras—the viral version and its 
cellular homologue—failed to show any differences, indicating 


that the two genes are either very similar or identical. These find- 
ings suggested that cancers that develop spontaneously in the 
human population are caused by a genetic alteration that is sim- 
ilar to the changes in cells that have been virally transformed in 
the laboratory. It is important to note that the types of cancers 
induced by the Harvey sarcoma virus (sarcomas and erythroleu- 
kemias) are quite different from the bladder tumors, which have 
an epithelial origin. This was the first indication that alterations 
in the same human gene—RAS—can cause a wide range of dif- 
ferent tumors. 

By the end of 1982, three more papers from different labo- 
ratories reported on the precise changes in the human RAS gene 
that leads to its activation as an oncogene.”°** Once the section 
of the large DNA fragment that is responsible for causing trans- 
formation was pinned down, nucleotide sequence analysis indi- 
cated that the DNA from the malignant bladder cells is activated 
as a result of a single base substitution within the coding region 
of the gene. Remarkably, cells of both human bladder carcinomas 
studied (identified as EJ and T24) contain DNA with precisely the 
same alteration: a guanine-containing nucleotide at a specific site 
in the DNA of the proto-oncogene had been converted to a thy- 
midine in the activated oncogene. This base substitution results 
in the replacement of a valine for a glycine as the twelfth amino 
acid residue of the polypeptide. 

Determination of the nucleotide sequence of the v-ras gene 
carried by the Harvey sarcoma virus revealed an alteration in 
base sequence that affected precisely the same codon found to 
be modified in the DNA of the human bladder carcinomas. The 
change in the viral gene substitutes an arginine for the normal 
glycine. It was apparent that this particular glycine residue plays a 
critical role in the structure and function of this protein. It is inter- 
esting to note that the human RAS gene is a proto-oncogene that, 
like SRC, can be activated by linkage to a viral promoter. Thus 
RAS can be activated to induce transformation by two totally dif- 
ferent pathways either by increasing its expression or by altering 
the amino acid sequence of its encoded polypeptide. 

The research described in this Experimental Pathway 
provided a great leap forward in our understanding of the genetic 
basis of malignant transformation. Much of the initial research on 
RNA tumor viruses stemmed from the belief that these agents 
may be an important causal agent in the development of human 
cancer. The search for viruses as a cause of cancer led to the 
discovery of the oncogene, which led to the realization that the 
oncogene is a cellular sequence that is acquired by the virus, 
which ultimately led to the discovery that an oncogene can 
cause cancer without the involvement of a viral genome. Thus 
tumor viruses, which are not themselves directly involved in most 
human cancers, have provided the necessary window through 
which we can view our own genetic inheritance for the presence 
of information that can lead to our own undoing. 


References 


1. Rous, P. 1911. Transmission of a malignant new growth by means of a 
cell-free filtrate. J. Am. Med. Assoc. 56:198. 

2. Baltimore, D. 1970. RNA-dependent DNA polymerase in virions of 
RNA tumour viruses. Nature 226:1209-1211. 

3. Temin, H. & Mizutani, S. 1970. RNA-dependent DNA polymerase in 
virions of Rous sarcoma virus. Nature 226:1211-1213. 

4. Martin, G. S. 1970. Rous sarcoma virus: A function required for the 
maintenance of the transformed state. Nature 227:1021-1023. 

5. Stehelin, D. et al. 1976. Purification of DNA complementary to nucleo- 
tide sequences required for neoplastic transformation of fibroblasts 
by avian sarcoma viruses. J. Mol. Biol. 101:349-365. 


continued 


635 


skemujed |Pe}USWAdXA e €'9/ 


636 


Je0UueD ©e 91 YALdVHD 


6. Stehelin, D. et al., 1976. DNA related to the transforming gene(s) of 
avian sarcoma viruses is present in normal avian DNA. Nature 260: 
170-173. 

7. Spector, D. H., Varmus, H. E., & Bishop, J. M. 1978. Nucleotide 
sequences related to the transforming gene of avian sarcoma virus 
are present in DNA of uninfected vertebrates. Proc. Nat'l. Acad. Sci. 
U.S.A. 75:4102-4106. 

8. Purchio, A. F. et al. 1978. Identification of a polypeptide encoded by 
the avian sarcoma virus src gene. Proc. Nat'l. Acad. Sci. U.S.A. 
75:1567-1671. 

9. Collett, M. S. & Erikson, R. L. 1978. Protein kinase activity associated 
with the avian sarcoma virus src gene product. Proc. Nat'l. Acad. Sci. 
U.S.A. 75:2021-2924. 

10. Willingham, M. C., Jay, G., & Pastan, |. 1979. Localization of the ASV 
src gene product to the plasma membrane of transformed cells by 
electron microscopic immunocytochemistry. Cell 18:125-134. 

1. Hunter, T. & Sefton, B. M. 1980. Transforming gene product of Rous 
sarcoma virus phosphorylates tyrosine. Proc. Nat'l. Acad. Sci. U.S.A. 
77:1311-1315. 

12. Shih, C. et al., 1979. Passage of phenotypes of chemically transformed 
cells via transfection of DNA and chromatin. Proc. Nat'l. Acad. Sci. 
U.S.A. 76:5714-5718. 

3. Krontiris, T. G. & Cooper, G. M. 1981. Transforming activity of human 
tumor DNAs. Proc. Nat'l. Acad. Sci. U.S.A. 78:1181-1184. 


4. Goldfarb, M. et al., 1982. Isolation and preliminary characterization of 
a human transforming gene from T24 bladder carcinoma cells. Nature 
296:404-409. 

5. Shih, C. & Weinberg, R. A. 1982. Isolation of a transforming sequence 
from a human bladder carcinoma cell line. Cel! 29:161-169. 

6. Pulciani, S. et al., 1982. Oncogenes in human tumor cell lines: 
Molecular cloning of a transforming gene from human bladder carci- 
noma cells. Proc. Nat'l. Acad. Sci. U.S.A. 79:2845-2849. 

7. Parada, L. F. et al., 1982. Human EJ bladder carcinoma oncogene is a 
homologue of Harvey sarcoma virus ras gene. Nature 297:474-478. 

8. Der, C. J. et al., 1982. Transforming genes of human bladder and lung 
carcinoma cell lines are homologous to the ras genes of Harvey and 
Kirsten sarcoma viruses. Proc. Nat'l. Acad. Sci. U.S.A. 79:3637-3640. 

9. Santos, E. et al., 1982. T24 human bladder carcinoma oncogene is an 
activated form of the normal human homologue of BALB- and 
Harvey-MSV transforming genes. Nature 298:343-347. 

20. Tabin, C. J. et al., 1982. Mechanism of activation of ahuman oncogene. 

Nature 300:143-149. 

21. Reddy, E. P. et al., 1982. A point mutation is responsible for the acqui- 
sition of transforming properties by the T24 human bladder carci- 
noma oncogene. Nature 300:149-152. 

22. Taparowsky, E. et al., 1982. Activation of the T24 bladder carcinoma 

transforming gene is linked to a single amino acid change. Nature 

300: 762-765. 


16.4 Cancer: A Genetic Disorder 


Cancer is one of the two leading causes of death in Western coun- 
tries, afflicting approximately one in every three individuals. Viewed. 
in this way, cancer is a very common disease. But at the cellular level, 
the development of a cancer is a remarkably rare event. Whenever 
the cells of a cancerous tumor are genetically scrutinized, they are 
invariably found to have arisen from a single cell. Thus, unlike other 
diseases that require modification of a large number of cells, cancer 
results from the uncontrolled proliferation of a single wayward cell. 
(Cancer is said to be monoclonal.) Consider for a moment that the 
human body contains trillions of cells, billions of which undergo cell 
division on any given day. Although almost any one of these dividing 
cells may have the potential to change in genetic composition and 
grow into a malignant tumor, this only occurs in about one-third of 
the human population during an entire lifetime. 

One of the primary reasons why a greater number of cells do not 
give rise to cancerous tumors is that malignant transformation requires 
more than a single genetic alteration. We can distinguish between two 
types of genetic alterations that might make us more likely to develop 
a particular type of cancer—those that we inherit from our parents 
(germ-line mutations) and those that occur during our own lifetime 
(somatic mutations). There are a few types of mutations that we can 
inherit that make us much more likely to develop cancer. The study of 
these mutations has taught us a great deal about how malfunctioning 
genes can lead to the development of cancer; some of these inherited 
cancer syndromes will be discussed later in this section. However, for 
the most part, inherited mutations are not a major factor in the occur- 
rence of most cases of the disease. One way to determine an overall 
estimate of the impact of inheritance in tumor formation is to ascer- 
tain the likelihood that two identical twins will develop the same type 
of cancer by the time the individuals reach a certain age. Studies of this 
type suggest that the likelihood two 75-year-old identical twins will 
share a particular cancer, such as breast cancer or prostate cancer, is 
generally between 10 and 15 percent, depending on the type of cancer. 
Clearly, the genes that we inherit have a significant influence on our 


risks of developing cancer, but the greatest impact comes from genes 
that are altered during our lifetime. 

The development of a malignant tumor (tumorigenesis) is a 
multistep process characterized by a progression of permanent 
genetic alterations in a single line of cells, which may occur over the 
course of many successive cell divisions and take decades to com- 
plete. Each genetic change may elicit a particular feature of the 
malignant state, such as protection from apoptosis, as discussed in 
Section 16.1. As these genetic changes gradually occur, the cells in 
the line become increasingly less responsive to the body’s normal 
regulatory machinery and better able to invade normal tissues. 
According to this concept, tumorigenesis requires that the cell 
responsible for initiating the cancer be capable of a large number of 
cell divisions. This requirement has focused a great deal of attention 
on the types of cells that are present in a tissue that might have the 
potential to develop into a tumor. 

The most common solid tumors—such as those of the breast, 
colon, prostate, and lung—arise in epithelial tissues that are nor- 
mally engaged in a relatively high level of cell division. The same is 
true of leukemias, which develop in rapidly dividing blood-forming 
tissues. The cells of most tissues can be roughly divided into three 
groups: (1) stem cells, which possess unlimited proliferation poten- 
tial, have the capacity to produce more of themselves, and can give 
rise to all of the cells of the tissue (Section 1.6); (2) progenitor cells, 
which are derived from stem cells and possess a limited ability to 
proliferate; and (3) the differentiated end products of the tissue, 
which generally lack the capability to divide. Examples of these three 
groups of cells are illustrated in Figure 17.6. 

Given the fact that tumor formation requires that a cell be capa- 
ble of extensive division, two general scenarios have been considered 
for the origin of tumors. According to one scenario, cancer arises 
from within the relatively small population of stem cells that inhabit 
each adult tissue. Given their long life and unlimited division poten- 
tial, stem cells have the opportunity to accumulate the mutations 
required for malignant transformation. According to another scenario, 
progenitor cells can give rise to malignant tumors by acquiring 


certain properties, such as the capacity for unlimited proliferation, 
as part of the process of tumor progression. As illustrated in 
FIGURE 16.7, these two scenarios are not mutually exclusive, in that 
some tumors are thought to arise from stem cells and others from 
the progenitor cell population. 

As a cancer grows, the cells in the tumor mass are subjected to a 
type of natural selection that drives the accumulation of cells with 
properties that are most favorable for tumor growth. For example, 
only those tumors containing cells that maintain the length of their 
telomeres will be capable of unlimited growth (page 473). Any cell 
that appears within a tumor that happens to express telomerase will 
have a tremendous growth advantage over other cells that fail to 
express the enzyme. Over time, the telomerase-expressing cells will 
flourish while the nonexpressing cells will die off, and all of the 
cells in the tumor will contain telomerase. Expression of telomerase 
illustrates another important feature of tumor progression: Not all of 
these changes result from genetic mutation. The activation of telom- 
erase expression can be considered an epigenetic change, one that 
results from the activation of a gene that is normally repressed. As 
discussed in Chapter 12, this type of activation process likely involves 
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FIGURE 16.7 The proposed cells of origin of malignant tumors. 
Tissues contain cells in various stages of commitment and differentiation. 
These include stem cells, multipotent progenitor cells that can give rise to a 
variety of types of differentiated cells, committed progenitor cells that can 
give rise to only one type of differentiated cell, and the differentiated cells 
themselves (see Figure 17.6 for examples). According to the model depicted 
here, tumors can arise from either tissue stem cells or progenitor cells, 
although in some cases at least, these different cells of origin give rise to 
different types of cancers (indicated by the three different colors of the tumors. 
Source: J. E. Visvader, Nature 469:316, 2011 Figure 2b. Nature by Nature 
Publishing Group. Reproduced with permission of Nature Publishing Group 
in the format reuse in a book/textbook via Copyright Clearance Center. 
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or a change in the state of DNA methylation. Once the epigenetic 
change has occurred, it is transmitted to all of the progeny of that cell 
and, consequently, represents a permanent, inheritable alteration. Even 
after they have become malignant, cancer cells continue to accumulate 
mutations and epigenetic changes that make them increasingly abnor- 
mal (as is evident in Figure 16.5). This genetic instability makes the dis- 
ease difficult to treat by conventional chemotherapy because cells that 
are resistant to the drug often arise within the tumor mass. 

The genetic changes that occur during tumor progression are 
often accompanied by histological changes, that is, changes in the 
appearance of the cells. The initial changes often produce cells that can 
be identified as “precancerous,” which indicates that they have gained 
some of the properties of a cancer cell, such as loss of certain growth 
controls, but lack the capability to invade normal tissues or metastasize 
to distant sites. The Pap smear is a test for detecting precancerous cells 
in the epithelial lining of the cervix. The development of cervical can- 
cer typically progresses over a period of more than 10 years and is 
characterized by cells that appear increasingly abnormal (less well dif- 
ferentiated than normal cells, with larger nuclei, as in FIGURE 16.8). 


(b) 


FIGURE 16.8 Detection of abnormal (premalignant) cells in a Pap 
smear. (a) Normal squamous epithelial cells of the cervix. The cells have a 
uniform shape with a small centrally located nucleus. (6) Abnormal cells from 
a case of carcinoma in situ, which is a preinvasive cancer of the cervix. The 
cells have heterogeneous shapes and large nuclei. 

Source: (a) Dr. E. Walker/Photo Researchers, Inc.; (b) SPL/Photo 
Researchers, Inc. 
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638 When cells having an abnormal appearance are detected, the precan- 
cerous lesion in the cervix can be located and destroyed by laser treat- 
ment, freezing, or surgery. Some tissues often generate benign tumors, 
which contain cells that have proliferated to form a mass that poses 
little threat of becoming malignant. The moles that we all possess are 
an example of benign tumors. Studies indicate that the pigment cells 
that compose a mole have undergone a response that causes them to 
enter a permanent state of growth arrest, referred to as senescence. 
Senescence is apparently triggered in these pigment cells after they 
have undergone certain of the genetic changes that would have other- 
wise set them on a course to becoming a malignant cancer. This pro- 
cess of “forced senescence” represents another pathway that has 
evolved to restrict the development of cancers in higher organisms. 
The molecular basis of senescence is discussed further in Section 16.7. 
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1. What is meant by the statement that cancer arises as 
the result of a genetic progression? 
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16.5 An Overview of 
Tumor-Suppressor Genes 
and Oncogenes 


The genes that have been implicated in carcinogenesis are divided 
into two broad categories: tumor-suppressor genes and oncogenes. 
Tumor-suppressor genes act as a cell’s brakes; they encode pro- 
teins that restrain cell growth and prevent cells from becoming 
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malignant (FIGURE 16.9a). The existence of such genes originally 
came to light from studies in the late 1960s in which normal and 
malignant rodent cells were fused to one another. Some of the cell 
hybrids formed from this type of fusion lost their malignant charac- 
teristics, suggesting that a normal cell possesses factors that can sup- 
press the uncontrolled growth of a cancer cell. Further evidence for 
the existence of tumor-suppressor genes was gathered from observa- 
tions that specific regions of particular chromosomes are consist- 
ently deleted in cells of certain types of cancer. If the absence of such 
genes is correlated with the development of a tumor, then it follows 
that the presence of these genes normally suppresses the formation 
of the tumor. 

Oncogenes, on the other hand, encode proteins that promote 
the loss of growth control and the conversion of a cell to a malignant 
state (Figure 16.9b). Most oncogenes act as accelerators of cell prolif- 
eration, but they have other roles as well. Oncogenes may lead to 
genetic instability, prevent a cell from becoming a victim of apopto- 
sis, or promote metastasis. The existence of oncogenes was discov- 
ered through a series of investigations on RNA tumor viruses that is 
documented in the Experimental Pathways. These viruses transform 
a normal cell into a malignant cell because they carry an oncogene 
that encodes a protein that interferes with the cell’s normal activities. 
The turning point in these studies came in 1976, when it was discov- 
ered that an oncogene called src, carried by an RNA tumor virus 
called avian sarcoma virus, was actually present in the genome of 
uninfected cells. The oncogene, in fact, was not a viral gene, but a 
cellular gene that had become incorporated into the viral genome 
during a previous infection. It soon became evident that cells possess 
a variety of genes, now referred to as proto-oncogenes, that have 
the potential to subvert the cell's own activities and push the cell 
toward the malignant state. 
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FIGURE 16.9 Contrasting effects of mutations in tumor-suppressor genes (a) and oncogenes (b). Whereas a mutation in one of the two copies (alleles) 
of an oncogene may be sufficient to cause a cell to lose growth control, both copies of a tumor-suppressor gene must be knocked out to induce the same 
effect. As discussed shortly, oncogenes arise from proto-oncogenes as the result of gain-of-function mutations, that is, mutations that cause the gene 
product to exhibit new functions that lead to malignancy. Tumor-suppressor genes, in contrast, suffer loss-of-function mutations and/or epigenetic inactiva- 


tion that render them unable to restrain cell growth. 


As discussed below, proto-oncogenes encode proteins that have 
various functions in a cell’s normal activities. Proto-oncogenes can 
be converted into oncogenes (i.e., activated) by several mechanisms 
(FIGURE 16.10): 


1. The gene can be mutated in a way that alters the properties of the 
gene product so that it no longer functions normally (Figure 
16.10, path a). 


2. The gene can become duplicated one or more times, resulting in 
gene amplification and excess production of the encoded protein 
(Figure 16.10, path b). 


3. A chromosome rearrangement can occur that brings a DNA 
sequence from a distant site in the genome into close proximity 
of the gene, which can either alter the expression of the gene or 
the nature of the gene product (Figure 16.10, path c). 


Any of these genetic alterations can cause a cell to become less 
responsive to normal growth controls. Oncogenes act dominantly, 
which is to say that a single copy of an oncogene can cause the cell to 
express the altered phenotype, regardless of whether there is a nor- 
mal, unactivated copy of the gene on the homologous chromosome 
(Figure 16.9b). Researchers have taken advantage of this property to 
identify oncogenes by introducing the DNA suspected of containing 
the gene into cultured cells and monitoring the cells for evidence of 
altered growth properties (Section 16.1). 

We saw earlier that the development of a human malignancy 
requires more than a single genetic alteration. The reason becomes 
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more apparent with the understanding that there are two types of 639 
genes responsible for tumor formation. As long as a cell has its full 
complement of tumor-suppressor genes, it is thought to be protected 
against the effects of an oncogene for reasons that will be evident 
when the functions of these genes are discussed below. Most tumors 
contain alterations in both tumor-suppressor genes and oncogenes, 
suggesting that the loss of a tumor-suppressor function within a cell 
must be accompanied by the conversion of a proto-oncogene into an 
oncogene before the cell can become fully malignant. Even then, the 
cell may not exhibit all of the properties required to invade sur- 
rounding tissues or to form secondary colonies by metastasis. 
Mutations in additional genes, such as those encoding cell-adhesion 
molecules or extracellular proteases (discussed in Section 7.8), may 
be required before these cells acquire a metastatic phenotype. 

We can now turn to the functions of the products encoded by 
both tumor-suppressor genes and oncogenes and examine how 
mutations in these genes can cause a cell to become malignant. 
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1. Contrast a benign tumor and malignant tumor; 
tumor-suppressor genes and oncogenes; dominantly 
acting and recessively acting mutations; proto- 
oncogenes and oncogenes. 


SeuaHOoUO Pue seUuay jossaiddns-iOWN] JO MAIAIBAC UY e S'9L 


oe 


Increased synthesis 


Gene duplication of encoded protein 


nies — CCD) 


OCD 


S 
RD) 


Synthesis of a protein 
containing portions 
encoded by different 
genes. The fusion protein 
is no longer under 
normal control 


A protein-coding gene 
translocated from distant 
site fuses with portion of 
gene causing formation 
of a fusion gene 


FIGURE 16.10 Activation of a proto-oncogene to an oncogene. Activation can be accomplished in several ways as indicated in this figure. In pathway a, 
a mutation in the gene alters the structure and function of the encoded protein. In pathway b, gene amplification results in overexpression of the gene. In 
pathway c, a rearrangement of the DNA brings a new DNA segment into the vicinity or up against the gene, altering either its expression or the structure of 


the encoded protein. 
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The RB Gene 


The transformation of a normal cell to a cancer cell is accompanied 
by the loss of function of one or more tumor-suppressor genes. 
High-throughput sequencing studies have identified hundreds of 
genes that are implicated as tumor suppressors in humans. Some of 
the better characterized genes, listed in Table 16.1, include genes that 
encode transcription factors (e.g., TP53 and WT1), cell cycle regula- 
tors (e.g., RB and INK4a), components that regulate G proteins 
(NF1), a phosphoinositide phosphatase (PTEN), and a protein that 
regulates protein degradation (VHL).” In one way or another, most 
of the proteins encoded by tumor-suppressor genes act as negative 
regulators of cell proliferation, which is why their elimination pro- 
motes uncontrolled cell growth. The products of tumor-suppressor 
genes also help maintain genetic stability, which may be a primary 
reason that tumors contain such an aberrant karyotype (Figure 
16.5). Some tumor-suppressor genes are involved in the develop- 
ment of a wide variety of different cancers, whereas others play a role 
in the formation of one or a few cancer types. 

It is common knowledge that members of some families are at 
high risk of developing certain types of cancers. Although these 
inherited cancer syndromes are rare, they provide an opportunity to 
identify tumor-suppressor genes that, when missing, contribute to 
the development of both inherited and sporadic (i.e., noninherited) 
forms of cancer. The first tumor-suppressor gene to be studied and 
eventually cloned—and one of the most important—is associated 
with a rare childhood cancer of the retina of the eye, called retinoblas- 
toma. The gene responsible for this disorder is named RB. The inci- 
dence of retinoblastoma follows two distinct patterns: (1) It occurs at 
high frequency and at young age in members of certain families, and 
(2) it occurs sporadically at an older age among members of the pop- 
ulation at large. The fact that retinoblastoma runs in certain families 
suggested that the cancer can be inherited. Examination of cells from 
children suffering from retinoblastoma revealed that one member of 
the thirteenth pair of homologous chromosomes was missing a small 
piece from the interior portion of the chromosome. The deletion was 
present in all of the children’s cells—both the cells of the retinal cancer 
and cells elsewhere in the body—indicating that the chromosomal 
aberration had been inherited from one of the parents. 


*For the present chapter, which deals primarily with human biology, we will follow a 
convention that is commonly used: human genes are written in capital letters (e.g., 
APC), mouse genes are written with the first letter capitalized (e.g., Brca1), and viral 
genes are written in lower case (e.g., src). 


TABLE 16.1 Tumor-Suppressor Genes 


Gene Primary tumor 


Proposed function 


Retinoblastoma is inherited as a dominant genetic trait because 
members of high-risk families that develop the disease inherit one 
normal allele and one abnormal allele. But unlike most dominantly 
inherited conditions, such as Huntington's disease, where an indi- 
vidual who inherits a missing or an altered gene invariably develops 
the disorder, children who inherit a chromosome missing the retino- 
blastoma gene have a strong disposition toward developing retino- 
blastoma. In fact, approximately 10 percent of individuals who 
inherit a chromosome with an RB deletion never develop the retinal 
cancer. How is it that a small percentage of these predisposed indi- 
viduals escape the disease? 

The genetic basis of retinoblastoma was explained in 1971 by 
Alfred Knudson of the University of Texas. Knudson proposed that 
the development of retinoblastoma requires that both copies of the 
RB gene of a retinal cell be either eliminated or mutated before the 
cell can give rise to a retinoblastoma. In other words, the cancer 
arises as the result of two independent “hits” in a single cell. In cases 
of sporadic retinoblastoma, the tumor develops from a retinal cell in 
which both copies of the RB gene have undergone successive spon- 
taneous mutation (FIGURE 16.11a). Because the chance that both 
alleles of the same gene will be the target of debilitating mutations in 
the same cell is extremely unlikely, the incidence of the cancer in the 
general population is extremely low. In contrast, the cells of a person 
who inherits a chromosome with an RB deletion are already halfway 
along the path to becoming malignant. Mutation or deletion of the 
remaining RB allele in any of the cells of the retina produces a cell 
that lacks a normal RB gene and thus cannot produce a functional 
RB gene product (Figure 16.11b). This explains why individuals 
who inherit an abnormal RB gene are so highly predisposed to 
developing the cancer. The second “hit” fails to occur in approxi- 
mately 10 percent of these individuals, who do not develop the 
disease. Knudson’s hypothesis was subsequently confirmed by 
examining cells from patients with an inherited disposition to ret- 
inoblastoma and finding that, as predicted, both alleles of the gene 
were missing or mutated in the cancer cells. Individuals with spo- 
radic retinoblastomas had normal cells that lacked RB mutations 
and tumor cells in which both alleles of the gene were mutated. 

Although deficiencies in the RB gene are first manifested in the 
development of retinal cancers, this is not the end of the story. People 
who suffer from the inherited form of retinoblastoma are also at 
high risk of developing other types of tumors later in life, particu- 
larly soft-tissue sarcomas (tumors of mesenchymal rather than epi- 
thelial origin). The consequences of RB mutations are not confined 
to persons who inherit a mutant allele. Mutations in RB alleles are a 
common occurrence in sporadic breast, prostate, and lung cancers 
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FIGURE 16.11 Mutations in the RB gene that can lead to retinoblastoma. (a) In sporadic (i.e., nonfamilial) cases of the disease, an individual begins life 
with two normal copies of the RB gene in the zygote, and retinoblastoma occurs only in those rare individuals in whom a given retinal cell accumulates 
independent mutations in both alleles of the gene. (b) In familial (i.e., inherited) cases of the disease, an individual begins life with one abnormal allele of the 
RB gene, usually present as a deletion. Thus all the cells of the retina have at least one of their two RB genes nonfunctional. If the other RB allele in a retinal 
cell becomes inactivated, usually as the result of a point mutation, that cell gives rise to a retinal tumor. 


among individuals who have inherited two normal RB alleles. When 
cells from these tumors are cultured in vitro, the reintroduction of a 
wild-type RB gene back into the cells is generally sufficient to sup- 
press their cancerous phenotype, indicating that the loss of this gene 
function contributes significantly to tumorigenesis. Let’s look more 
closely at the role of the RB gene. 

The importance of the cell cycle in cell growth and proliferation 
was discussed in Chapters 14 and 15, where it was noted that factors 
that control the cell cycle can play a pivotal role in the development 
of cancer. In its best studied role, the protein encoded by the RB 
gene, pRB, helps regulate the passage of cells from the G, stage of the 
cell cycle into S phase, during which DNA synthesis occurs. As dis- 
cussed in Section 14.4, the transition from G, to S is a time of com- 
mitment for the cell; once a cell enters S phase, it invariably proceeds 
through the remainder of the cell cycle and into mitosis. The transi- 
tion from G, to S is accompanied by the activation of many different 
genes that encode proteins ranging from DNA polymerases to cyc- 
lins and histones. Among the transcription factors involved in acti- 
vating genes required for S-phase activities are members of the E2F 
family of transcription factors, which are key targets of pRB. 
A model depicting the role of pRB in controlling E2F activity is illus- 
trated in FIGURE 16.12. During G,, E2F proteins are normally bound 
to pRB, which prevents the E2F molecules from activating a number 
of genes encoding proteins required for S-phase activities (e.g., 
cyclin E and DNA polymerase «). Studies have shown (as indicated 
in step 1 of Figure 16.12) that the E2F-pRB complex is associated 
with DNA but acts as a gene repressor rather than a gene activator. 
As the end of G, approaches, the pRB subunit of the pRB-E2F com- 
plex is phosphorylated by the cyclin-dependent kinases that regulate 
the G,-S transition. Once phosphorylated, pRB releases its bound 


E2F, allowing the transcription factor to activate gene expression, 
which marks the cell’s irreversible commitment to enter S phase. 
A cell that loses pRB activity as the result of RB mutation would be 
expected to lose its ability to inactivate E2F, thereby removing cer- 
tain restraints over the entry to S phase. E2F is only one of dozens of 
proteins capable of binding to pRB, suggesting that pRB has numer- 
ous other functions. The complexity of pRB interactions is also sug- 
gested by the fact that the protein contains at least 16 different serine 
and threonine residues that can be phosphorylated by cyclin- 
dependent kinases. It is likely that phosphorylation of different com- 
binations of amino acid residues allows the protein to interact with 
different downstream targets. 

The importance of pRB as a negative regulator of the cell cycle 
is demonstrated by the fact that DNA tumor viruses (including ade- 
noviruses, human papilloma virus, and SV40) encode a protein that 
binds to pRB, blocking its ability to bind to E2E The ability of these 
viruses to induce cancer in infected cells depends on their ability to 
block the negative influence that pRB has on progression of a cell 
through the cell cycle. By using these pRB-blocking proteins, these 
viruses accomplish the same result as when the RB gene is deleted, 
leading to the development of tumors. 
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1. Describe how retinoblastoma develops in an 
individual who has inherited a chromosome with a 
RB deletion. 


2. What is the normal role of RB in the cell? 
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FIGURE 16.12 The role of pRB in controlling transcription of genes 
required for progression of the cell cycle. During most of Gj, the 
unphosphorylated pRB is bound to the E2F protein. The E2F-pRB complex 
binds to regulatory sites in the promoter regions of numerous genes 
involved in cell cycle progression, acting as a transcriptional repressor that 
blocks gene expression. Repression probably involves the methylation of 
lysine 9 of histone H3 that modulates chromatin architecture (Section 12.3). 
Activation of the cyclin-dependent kinase (Cdk) leads to the phosphoryla- 
tion of pRB, which can no longer bind the E2F protein (step 2). In the 
pathway depicted here, loss of the bound pRB converts the DNA-bound 
E2F into a transcriptional activator, leading to expression of the genes 
being regulated (step 3). The mRNA is translated into proteins (step 4) that 
are required for the progression of cells from G, into S phase of the cell 
cycle (step 5). Other roles of pRB have been identified but are not 
discussed. 


16.7 Tumor-Suppressor Genes: 
The TP53 Gene 


The TP53 gene may have more to do with the development of human 
cancer than any other component of the genome. The gene gets its 
name from the product it encodes, p53, which is a polypeptide hav- 
ing a molecular mass of 53,000 Daltons. In 1990, TP53 was recog- 
nized as the tumor-suppressor gene that, when absent, is responsible 
for a rare inherited disorder called Li-Fraumeni syndrome. Victims 
of this disease are afflicted with a very high incidence of various 
cancers, including breast and brain cancer and leukemia. Like indi- 
viduals with the inherited form of retinoblastoma, persons with 
Li-Fraumeni syndrome inherit one normal and one abnormal (or 
deleted) allele of the TP53 tumor-suppressor gene and are thus 
highly susceptible to cancers that result from random mutations in 
the normal allele. 


The Role of p53: Guardian of the Genome 


The importance of p53 as an antitumor weapon is most evident from 
the fact that TP53 is the most commonly mutated gene in human 
cancers; approximately half of all human tumors contain cells with 
point mutations or deletions in both alleles of the TP53 gene 
(FIGURE 16.13a). Furthermore, tumors composed of cells bearing 
TP53 mutations are correlated with a poorer prognosis than those 
containing a wild-type TP53 gene. Clearly, the elimination of TP53 
function is an important step in the progression of many cancer cells 
toward the fully malignant state. 
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FIGURE 16.13 The role of the tumor suppressor gene TP53 in human 
cancer. (a) The frequency with which both alleles of the TP53 gene are 
mutated in different types of cancers. The data refers to the most common 
form of each of these 10 types of cancers. (b) p53 function is particularly 
sensitive to mutations in its DNA-binding domain. p53 functions as a 
tetramer, each subunit of which consists of several domains with different 
functions. This image shows a ribbon drawing of the DNA-binding domain. 
The six amino acid residues most often mutated in p53 molecules that have 
been debilitated in human cancers are indicated in a single-letter nomen- 
clature (Figure 2.26). These residues occur at or near the protein-DNA 
interface and either directly impact the binding of the protein to DNA or 
alter its conformation. 

Source: (a) From Janet E. Dancey, et al., reprinted from Cell 148:412, 

2012. with permission from Elsevier. (6) from Y. Cho, S. Gorina, P. D. 
Jeffrey, N. P. Pavletich, Science 265:352, 1994. Reprinted with permission 
from AAAS. 


Why is the presence of p53 so important in preventing a cell 
from becoming malignant? For one thing, p53 seems to bind a very 
long list of different proteins, as well as DNA, and is involved in a 
diverse array of cellular activities. In its best studied role, p53 serves 
as a transcription factor that acts as crucial player in a cell’s response 
to stress. When a cell sustains DNA damage, p53 responds by alter- 
ing the expression of a large number of genes involved in cell cycle 
regulation, apoptosis, and/or senescence. The importance of the 
transcription-regulating role of p53 is evident in Figure 16.13), 
which shows the location of the six mutations most commonly 
found to disable p53 in human cancers; all of them map in the region 
of the protein that interacts with DNA. One of the best studied genes 
activated by p53 encodes a protein called p21 that inhibits the cyc- 
lin-dependent kinase that normally drives a cell through the G, 
checkpoint. As the level of p53 rises in the damaged G, cell, expres- 
sion of the p21 gene is activated, and progression through the cell 
cycle is arrested (see Figure 14.9). This gives the cell time to repair 
the genetic damage before it initiates DNA replication. When both 
copies of the TP53 gene in a cell have been mutated so that their 
product is no longer functional, the cell can no longer produce the 
p21 inhibitor or exercise the feedback control that prevents it from 
entering S phase when it is not prepared to do so. Failure to repair 
DNA damage leads to the production of abnormal cells that have the 
potential to become malignant. 

Cell cycle arrest is not the only way that p53 protects an organ- 
ism from developing cancer. Alternatively, p53 can direct a geneti- 
cally damaged cell along a pathway that leads to death by apoptosis 
or necrosis, thereby ridding the body of cells with a malignant 
potential. p53 is thought to direct cell death through several path- 
ways, including the activation of expression of the BAX gene, whose 
encoded protein initiates apoptosis (Figure 15.40). Not all actions of 
p53 are dependent on its activation of transcription. p53 is also 
capable of binding directly to several members of the Bcl-2 family 
proteins (page 623) in a manner that stimulates apoptosis. For exam- 
ple, p53 can bind to Bax proteins at the outer mitochondrial mem- 
brane, directly triggering membrane permeabilization and release of 
apoptotic factors. If both alleles of TP53 should become inactivated, 
a cell that is carrying damaged DNA fails to be destroyed, even 
though it lacks the genetic integrity required for controlled growth 
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(FIGURE 16.14). Several studies have shown that established tumors 643 


in mice will undergo regression when the activity of their p53 genes 
is restored. This finding suggests that tumor development continues 
to depend on the absence of a functional TP53 gene, even after its 
cells become genetically unstable. For these reasons, the develop- 
ment of therapies that restore p53 function to p53-deficient cells has 
become an active area of research. The most advanced therapy based 
on this strategy involves injection into the tumor of an adenovirus 
that carries a wild-type TP53 gene. This approach has been used 
widely in China, but a similar adenoviral vector has not been 
approved by the FDA at the time of this writing. 

The level of p53 in a healthy G, cell is very low, which keeps its 
potentially lethal action under control. However, if a G, cell sustains 
genetic damage, as occurs if the cell is subjected to ultraviolet light or 
chemical carcinogens, the concentration of p53 rises rapidly. A simi- 
lar response can be elicited simply by injecting a cell with DNA con- 
taining broken strands. The increase in p53 levels is not due to 
increased expression of the gene but to an increase in the stability of 
the protein. In unstressed cells, p53 has a half-life of a few minutes. 
p53 degradation is facilitated by a protein called MDM2, which 
binds to p53 and escorts it out of the nucleus and into the cytosol. 
Once in the cytosol, MDM2 adds ubiquitin molecules to the p53 
molecule, leading to its destruction by a proteasome (Section 12.21). 
How does DNA damage lead to stabilization of p53? We saw in 
Section 14.5 that persons suffering from ataxia telangiectasia lack a 
protein kinase called ATM and are unable to respond properly to 
DNA-damaging radiation. ATM is normally activated following 
DNA damage, and p53 is one of the proteins ATM phosphorylates. 
The phosphorylated version of the p53 molecule is no longer able to 
interact with MDM2, which stabilizes existing p53 molecules in the 
nucleus and allows them to activate the expression of genes such as 
p21 and BAX (see Figure 16.16). 

Some tumor cells have been found that contain a wild-type 
TP53 gene but extra copies of MDM2. Such cells are thought to pro- 
duce excessive amounts of MDM2, which prevents p53 from build- 
ing to required levels to stop the cell cycle or induce apoptosis 
following DNA damage (or other oncogenic stimuli). A major effort 
is underway to develop drugs that block the interaction between 
MDM2 and p53 in an attempt to restore p53 activity in cancer cells 
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FIGURE 16.14 A model for the function of p53. (a) Cell division does not normally require the involvement of p53. (b) If, however, the DNA of a cell 
becomes damaged as the result of exposure to mutagens, the level of p53 rises and acts either to arrest the progression of the cell through G, or to direct 
the cell toward apoptosis. (c) If both copies of the TP53 gene are inactivated, the cell loses the ability to arrest the cell cycle or commit the cell to apoptosis 
following DNA damage. As a result, the cell either dies from mitotic failure or continues to proliferate with genetic abnormalities that may lead to the 


formation of a malignant growth. 


Source:(a—c) D. P. Lane, reprinted with permission from Nature 358:15, 1992; copyright 1992. Nature by Nature Publishing Group. Reproduced with 
permission of Nature Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center. 


BUED EGq] SU] :seUuaeyd Josseiddns-iowN, e /'9, 


644 that retain this key tumor suppressor. The relationship between 


4aoueD) e OL YALdVHO 


MDM2 and p53 has also been demonstrated using gene knockouts. 
Mice that lack a gene encoding MDM2 die at an early stage of devel- 
opment, presumably because their cells undergo p53-dependent 
apoptosis. This interpretation is supported by the finding that mice 
lacking genes that encode both MDM2 and p53 (double knockouts) 
survive to adulthood but are highly prone to cancer. Because these 
embryos cannot produce p53, they dont require a protein such as 
MDM2 that facilitates p53 destruction. This observation illustrates an 
important principle in cancer genetics: even if a “crucial” gene such as 
RB or TP53 is not mutated or deleted, the function of that gene can be 
affected as the result of alterations in other genes whose products are 
part of the same pathway as the “crucial” gene. In this case, overex- 
pression of MDM2 can have the same effect as the absence of p53. As 
long as the tumor-suppressor pathway is blocked, the tumor-suppressor 
gene itself need not be mutated. Numerous studies indicate that both 
the p53 and pRB pathways have to be inactivated, one way or another, 
to allow the progression of most tumor cells. 

Because of its ability to trigger apoptosis, p53 plays a pivotal role 
in treatment of cancer by radiation and chemotherapy. It was 
assumed for many years that cancer cells are more susceptible than 
normal cells to drugs and radiation because cancer cells divide more 
rapidly. But some cancer cells divide more slowly than their normal 
counterparts, yet they are still more sensitive to drugs and radiation. 
An alternate theory suggests that normal cells are more resistant to 
drugs or radiation because, once they sustain genetic damage, they 
either arrest their cell cycle until the damage is repaired or they 
undergo apoptosis. In contrast, cancer cells that have sustained DNA 
damage are more likely to become apoptotic—as long as they possess 
a functioning TP53 gene. If cancer cells lose p53 function, they often 
cannot be directed into apoptosis and they become highly resistant 
to further treatment (FIGURE 16.15). This may be the primary reason 
why tumors that typically lack a functional TP53 gene (e.g., colon 
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cancer, prostate cancer, and pancreatic cancer) respond much more 
poorly to radiation and chemotherapy than tumors that possess a 
wild-type copy of the gene (e.g., testicular cancer and childhood 
acute lymphoblastic leukemias). 


The Role of p53 in Promoting Senescence 


We have seen how p53 can direct a potential cancer cell into either 
growth arrest or apoptosis. Recent studies indicate that p53 also con- 
trols signaling pathways that lead to cellular senescence, another 
mechanism that has evolved as a barrier that stops wayward cells 
from developing into malignant tumors. Unlike apoptotic cells, 
senescent cells can remain alive and metabolically active but are per- 
manently arrested in a nondividing state, as exemplified by the 
senescent melanocytes found in moles (discussed on page 638). In 
other cases, senescent cells may be ingested by phagocytic immune 
cells. Senescence can be triggered in an otherwise normal cell by the 
experimental activation of an oncogene, such as Ras, which might 
occur with some frequency during the day-to-day activities of divid- 
ing cells in a normal tissue. Studies suggest that oncogene activation 
triggers a period of accelerated division after which the senescence 
program takes effect and slams on the brakes. This is the apparent 
route that is taken during the formation of benign moles. One of the 
pathways leading to senescence involves expression of a tumor- 
suppressor gene called INK4a, which is often disabled in human 
cancers (Table 16.1). INK4a encodes two separate tumor-suppressor 
proteins (proteins that are translated in alternate reading frames of 
the mRNA): p16, which is an inhibitor of cyclin-dependent kinases 
required for progression through the cell cycle, and ARF, which sta- 
bilizes p53 by inhibiting MDM2. The precise role of p53 in directing 
cells toward the senescent state remains unclear, but inactivation of 
the TP53 gene within senescent cells can cause the cells to resume 
their progress toward full malignancy. 


Etoposide Adriamycin 


FIGURE 16.15 Experimental demonstration of the role of p53 in the survival of cells treated with chemotherapeutic agents. Cells were cultured 


from mice that had two functional alleles of the gene encoding p53 (top row), one functional allele of the gene (middle row), or were lacking a functional 


allele of the gene (bottom row). Cultures of each of these cells were grown either in the absence of a chemotherapeutic agent (first column) or in the 
presence of one of the three compounds indicated at the top of the other three columns. It is evident that the compounds had a dramatic effect on arresting 


growth and inducing cell death (apoptosis) in normal cells, whereas the cells lacking p53 continued to proliferate in the presence of these compounds. 


Source: From Scott W. Lowe, H. E. Ruley, T. Jacks, and D. E. Housman, Cell 74:959, 1993, with permission from Elsevier. 


Whether p53 moves a cell toward cell cycle arrest, apoptosis, or 
senescence apparently depends on the type of posttranslational 
modifications to which it is subjected. As in the case of the core his- 
tones (page 465), modifications include phosphorylation, acetyla- 
tion, methylation, and ubiquitination and affect more than three 
dozen residues within the p53 molecule. Moreover, the fact (1) that 
the T'P53 transcript can be alternately spliced into numerous p53 
isoforms, (2) that these p53 proteins can interact with a host of dif- 
ferent proteins, and (3) that p53 has been found to influence several 
other major tumor-related pathways (e.g., DNA repair, glucose 
metabolism, and autophagy) add additional layers of complexity to 
the p53 story. Dissecting the roles of these various factors in the 
function of this “multitasking” protein will be a daunting challenge. 
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1. Why is p53 described as the “guardian of the 
genome”? 


2. Give three mechanisms by which p53 acts to prevent a 
cell from becoming malignant. 


16.8 Other Tumor-Suppressor 
Genes 


Although mutations in RB and TP53 are associated with a wide vari- 
ety of human malignancies, mutations in a number of other tumor- 
suppressor genes are detected in only a few types of cancer. 

Familial adenomatous polyposis coli (FAP) is an inherited disor- 
der in which individuals develop hundreds or even thousands of 
premalignant polyps (adenomas) from epithelial cells that line the 
colon wall. If not removed, cells within some of these polyps are very 
likely to progress to a fully malignant stage. The cells of patients with 
this condition were found to contain a deletion of a small portion of 
chromosome 5, which was subsequently identified as the site of a 
tumor-suppressor gene called adenomatous polyposis coli, or APC. 
A person inheriting an APC deletion is in a similar position to one 
who inherits an RB deletion: If the second allele of the gene is 
mutated in a given cell, the protective value of the gene function is 
lost. The loss of the second allele of APC causes the cell to lose 
growth control and proliferate to form a polyp rather than differen- 
tiating into normal epithelial cells of the intestinal wall. The conver- 
sion of cells in a polyp to the more malignant state, characterized by 
the ability to metastasize and invade other tissues, is presumably 
gained by the accumulation of additional mutations, including those 
in TP53 (see Figure 16.20). Mutated APC genes are found not only in 
inherited forms of colon cancers, but also in the majority of sporadic 
colon tumors, suggesting that the gene plays a major role in the 
development of this disease. APC is known to suppress the Wnt 
pathway, which activates the transcription of genes (e.g., MYC and 
CCND1) that promote cell proliferation. APC has also been identi- 
fied as a microtubule plus-end binding protein and is believed to 
play a role in the attachment of microtubules to the kinetochores of 
mitotic chromosomes. Loss of APC function could therefore lead 
directly to abnormal chromosome segregation and aneuploidy 
(page 566). The presence of mutated APC DNA has been found in 
the blood of persons with early-stage colon cancer, which raises the 
possibility of a diagnostic test for the disease. 


It is estimated that breast cancer strikes approximately one in 645 


eight women living in the United States, Canada, and Europe. Of 
these cases, 5-10 percent are due to the inheritance of a gene that 
predisposes the individual to development of the disease. After an 
intensive effort by several laboratories, two genes named BRCA1 and 
BRCA2 were identified in the mid-1990s as being responsible for the 
majority of the inherited cases of breast cancer. BRCA mutations 
also predispose a woman to the development of ovarian cancer, 
which has an especially high mortality rate. 

It was pointed out in Section 14.5, that cells possess checkpoints 
that halt progression of the cell cycle following DNA damage. The 
BRCA proteins are part of one or more large protein complexes that 
respond to DNA damage and activate DNA repair by means of homol- 
ogous recombination. Cells with mutant BRCA proteins accumulate 
chromosomal breaks and exhibit a highly aneuploid karyotype. In 
cells with a functional TP53 gene, failure to repair DNA damage leads 
to the activation of p53, which causes the cell to either arrest cell cycle 
progress or undergo apoptosis, as illustrated in FIGURE 16.16. 

We have seen in this chapter that apoptosis is one of the body’s 
primary mechanisms of ridding itself of potential tumor cells. 
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FIGURE 16.16 DNA damage initiates activity of a number of proteins 
encoded by both tumor-suppressor genes and proto-oncogenes. 

In this simplified figure, DNA damage is seen to cause double-strand 
breaks in the DNA (step 1) that are repaired by a proposed multiprotein 
complex that includes BRCA1 and BRCA2 (step 2a). Mutations in either of 
the genes that encode these proteins can block the repair process (step 
2b). If DNA damage is not repaired, a checkpoint is activated that leads to 
a rise in the level of p53 activity (step 3a). The p53 protein is normally 
inhibited by interaction with the protein MDM2 (step 3b). p53 is a 
transcription factor that activates expression of either (1) the p27 gene (step 
4a), whose product (p21) causes cell cycle arrest, or (2) the BAX gene (step 
4b), whose product (Bax) causes apoptosis. p53 activation can also promote 
cellular senescence, but the pathway is unclear. 

Source: Reprinted with permission after J. Brugarolas and T. Jacks, Nature 
Med 3:721, 1997, copyright 1997, Nature Medicine by Nature Publishing 
Group. Reproduced with permission of Nature Publishing Group in the 
format reuse in a book/textbook via Copyright Clearance Center. 
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were pathways that promoted cell survival rather than cell destruc- 
tion. The best studied cell-survival pathway involves the activation 
of a kinase called PKB (AKT) by the phosphoinositide PIP3. PIP3, in 
turn, is formed by the catalytic activity of the lipid kinase PI3K (see 
Figure 15.25). Activation of the PI3K/PKB pathway leads to an 
increased likelihood that a cell will survive a stimulus that normally 
would lead to its destruction. Whether a cell lives or dies following a 
particular event depends to a large degree on the balance between 
proapoptotic and antiapoptotic signals. Mutations that affect this 
balance, such as those that contribute to the overexpression of PKB 
or PI3K, can shift this balance in favor of cell survival, which can 
provide a potential cancer cell with a tremendous advantage. 
Another protein that can affect the balance between life and death of 
a cell is the lipid phosphatase, PTEN, which removes the phosphate 
group from the 3-position of PIP3, converting the molecule into 
PI(4,5)P>, which cannot activate PKB. Cells in which both copies of 
the PTEN gene are inactivated tend to have an excessively high level 
of PIP3, which leads to an overactive population of PKB molecules. 
When a normal PTEN gene is introduced into tumor cells that lack a 
functioning copy of the gene, the cells typically undergo apoptosis, 
as would be expected. Like the other tumor-suppressor genes listed 
in Table 16.1, mutations in PTEN cause a rare hereditary disease 
characterized by an increased risk of cancer, and such mutations are 
also found in a variety of sporadic cancers. 

Mutation or deletion is not the only mechanism by which tumor- 
suppressor genes can be inactivated. Tumor suppressor genes, such as 
BRCAI or PTEN, are often rendered nonfunctional as the result of 
epigenetic mechanisms, such as DNA methylation or histone modifi- 
cation, which silences transcription of the gene (Section 12.17). 


REVIEW PI 


1. What gene is responsible for causing Familial 
adenomatous polyposis coli? What is its normal role in 
the cell? 


2. Describe the role of the BRCA proteins and how their 


absence may lead to the development of cancer. 
eo“ 


16.9 Oncogenes 


As described above, oncogenes encode proteins that promote the loss 
of growth control and the conversion of a cell to a malignant state. 
Oncogenes are derived from proto-oncogenes (page 638), which are 
genes that encode proteins having a function in the normal cell. 
Numerous oncogenes were initially identified as part of the genomes 
of RNA tumor viruses, but many more have been identified because 
of their importance in tumorigenesis as determined in laboratory 
animals or human tumor samples. Different oncogenes become acti- 
vated in different types of tumors, which reflects variations in the 
signaling pathways that operate in diverse cell types. The oncogene 
mutated most frequently in human tumors is RAS, which encodes a 
GTP-binding protein (RAS) that functions as an on-off switch for a 
number of key signaling pathways controlling cell proliferation 
(Section 15.11) and metabolism (see Figure 16.18).’ Oncogenic RAS 


>The human genome actually contains three different RAS genes and three different 
RAF genes that are active in different tissues. Of these, KRAS and BRAF are most 
often implicated in tumor formation. 


mutants typically encode a protein whose GTPase activity cannot be 
stimulated, which leaves the molecule in an active GTP-bound form, 
sending continuous proliferation signals along the pathway. Despite 
extensive efforts to develop anti-RAS strategies for cancer therapy, no 
drugs that block RAS function have yet to be approved. The func- 
tions of a number of oncogenes are summarized in FIGURE 16.17 and 
discussed below.* 


Oncogenes That Encode Growth 
Factors or Their Receptors 


The first connection between oncogenes and growth factors was made 
in 1983, when it was discovered that the cancer-causing simian sar- 
coma virus contained an oncogene (sis) derived from the cellular gene 
for platelet-derived growth factor (PDGF), a protein present in human 
blood. Cultured cells that are transformed with this virus secrete large 
amounts of PDGF into the medium, which causes the cells to prolifer- 
ate in an uncontrolled fashion. Overexpression of PDGF has been 
implicated in the development of brain tumors (gliomas). 


“The reader is referred to the Human Perspective of Chapter 7 (Section 7.8) for a 
discussion of genes that encode cell-surface molecules and extracellular proteases that 
play an important role in tissue invasion and metastasis. 
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FIGURE 16.17 A schematic diagram summarizing the types of 
proteins encoded by proto-oncogenes. These include growth factors (1), 
receptors for growth factors (2), protein kinases and the proteins that 
activate them (3), proteins that regulate the cell cycle (4), transcription 
factors (5), proteins that modify chromatin (6), metabolic enzymes (7), and 
proteins that inhibit apoptosis (8). Proteins involved in mitosis, tissue 
invasion, and metastasis are not included. 


Another oncogenic virus, avian erythroblastosis virus, was found 
to carry an oncogene (erbB) that encodes an EGF receptor that is 
missing part of the extracellular domain of the protein that binds the 
growth factor. One might expect that the altered receptor would be 
unable to signal a cell to divide, but just the reverse is true. This altered 
version of the receptor stimulates the cell constitutively, that is, regard- 
less of whether the growth factor is present in the medium. This is the 
reason why cultured cells that carry the altered gene proliferate in an 
uncontrolled manner. A number of spontaneous human cancers have 
been found to contain cells with genetic alterations that affect growth 
factor receptors, including EGFR. Most commonly, the malignant 
cells contain a much larger number of the receptors in their plasma 
membranes than do normal cells. The presence of excess receptors 
makes the cells sensitive to much lower concentrations of the growth 
factor, thus they are stimulated to divide under conditions that would 
not affect normal cells. As discussed below, growth factor receptors 
have become a favored target for therapeutic antibodies, which bind 
to the receptor’s extracellular domain, and for small molecule inhibi- 
tors, which bind to the receptor’s intracellular tyrosine kinase domain. 


Oncogenes That Encode Cytoplasmic 
Protein Kinases 


Overactive protein kinases function as oncogenes by generating sig- 
nals that lead to inappropriate cell proliferation or survival. Raf, for 
example, is a serine-threonine protein kinase that heads the MAP 
kinase cascade, the primary growth-controlling signaling pathway in 
cells (Section 15.11). It is evident that Raf is well positioned to wreak 
havoc within a cell should its enzymatic activity become altered as the 
result of mutation. As with the growth factor receptors and Ras, muta- 
tions that turn Raf into an enzyme that remains in the “on” position are 
most likely to convert the proto-oncogene into an oncogene and con- 
tribute to the cell’s loss of growth control. Raf is most closely linked to 
melanoma, where BRAF mutations play a causative role in the devel- 
opment of approximately 70 percent of these cancers. Another group 
of cytoplasmic kinases that are often deregulated in cancer are the cyc- 
lin-dependent kinases, especially Cdk4 and Cdké6 (Figure 14.8). Cyclin 
D1, a regulator of these Cdks, is also a frequent oncogene. 

The first oncogene to be discovered, SRC, is also a protein 
kinase, but one that phosphorylates tyrosine residues on protein 
substrates rather than serine and threonine residues. Transformation 
of a cell by a src-containing tumor virus is accompanied by the phos- 
phorylation of a wide variety of proteins. Included among the appar- 
ent substrates of Src are proteins involved in signal transduction, 
control of the cytoskeleton, and cell adhesion. For an unknown 
reason, SRC mutations appear only rarely among the repertoire of 
genetic changes in human tumor cells. 


Oncogenes That Encode 
Transcription Factors 


A number of oncogenes encode proteins that act as transcription 
factors. The progression of cells through the cell cycle requires the 
timely activation (or repression) of a large variety of genes whose 
products contribute in various ways to cell growth and division. It is 
not surprising, therefore, that alterations in the proteins that control 
the expression of these genes could seriously disturb a cell's normal 
growth patterns. Probably the best studied oncogene whose product 
acts as a transcription factor is MYC. 


Myc regulates the expression of a huge number of proteins and 
noncoding RNAs (rRNAs, tRNAs, and miRNAs) involved in cell 
growth and proliferation. When MYC expression is selectively 
blocked, the progression of the cell through G, is blocked. The MYC 
gene is one of the proto-oncogenes most commonly altered in 
human cancers, often being amplified within the genome or rear- 
ranged as the result of a chromosome translocation. These chromo- 
somal changes are thought to remove the MYC gene from its normal 
regulatory influences and increase its level of expression in the cell, 
producing an excess of the Myc protein. One of the most common 
types of cancer among populations in Africa, called Burkitt’s lym- 
phoma, results from the translocation of a MYC gene to a position 
adjacent to an antibody gene. The disease occurs primarily in per- 
sons who have also been infected with Epstein-Barr virus. This same 
virus causes only minor infections (e.g., mononucleosis) in people 
living in Western countries and is not associated with tumorigenesis. 


Oncogenes That Encode Proteins 
That Affect the Epigenetic State 
of Chromatin 


As discussed in Chapter 12, two of the most important factors in 
determining the epigenetic state of chromatin are (1) whether par- 
ticular sites in the DNA of gene promoters are methylated or not and 
(2) the particular modifications present in the tails of certain core 
histones within the nucleosomes of these same gene promoters. 
DNA methylation tends to silence genes whereas histone modifica- 
tions may either activate or repress gene transcription. Recent stud- 
ies have indicated that a number of oncogenes encode proteins that 
affect DNA methylation or histone modifications. These include 
DNA methyltransferases, histone acetylases and deacetylases, his- 
tone methyltransferases and demethylases, and proteins present 
within chromatin remodeling complexes. Mutations in any of these 
classes of genes can promote tumorigenesis by increasing or decreas- 
ing transcription of genes involved in the various signaling and reg- 
ulatory pathways that affect cell proliferation, survival, migration, 
and other functions. To cite just one example, acute myeloid leuke- 
mia is characterized by recurrent mutations in DNMT3A, a gene 
whose product is involved in maintaining DNA methylation pat- 
terns during DNA replication. A reduction in the level of DNA 
methylation could lead to increased movement of transposable ele- 
ments, which would cause genetic instability, as well as increased 
transcription of certain proto-oncogenes. Conversely, an increase in 
the level of DNA methylation of the promoter regions of tumor sup- 
pressor genes is known to silence the expression of genes that exert a 
key inhibitory influence on tumorigenesis. 


Oncogenes That Encode 
Metabolic Enzymes 


It was mentioned in Section 16.1 that tumor cells depend much more 
on glycolysis than do normal cells. This is only one of a number of 
major differences in metabolism between normal and tumor cells. 
Another difference, which was one of the surprising discoveries to 
emerge from genome sequencing studies, was the repeated presence 
of mutations in the TCA cycle enzyme isocitrate dehydrogenase 
(IDH1 and IDH2) in the tumor cells of patients with glioblastoma 
(brain cancer) and acute myeloid leukemia (AML). These mutations 
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to a-ketoglutarate (Figure 5.7) and instead convert the substrate to 
an abnormal metabolite called 2-hydroxyglutarate (2-HG), which 
accumulates to high levels in the tumor. The elevated levels of 2- HG 
have an impact on a number of processes including histone demeth- 
ylation and DNA methylation. It is proposed that the disruption of 
these epigenetic processes would likely result in the aberrant regula- 
tion of gene expression within tumor cells. 


Oncogenes That Encode Products 
That Affect Apoptosis 


Apoptosis is one of the body’s key mechanisms to rid itself of tumor 
cells at an early stage in their progression toward malignancy. 
Consequently, any alteration that diminishes a cell’s ability to self- 
destruct would be expected to increase the likelihood of that cell giv- 
ing rise to a tumor. This was evident in the previous discussion of the 
role of the PI3K/PKB pathway in cell survival and tumorigenesis 
(page 597). With this in mind, it is not surprising that both PI3K and 
PKB are encoded by documented oncogenes.” The oncogene most 


°Mutations in PI3K or PKB/AKT are oncogenic for reasons other than just their roles 
in cell survival. For example, the PI3K pathway is a major driver of the aerobic glyco- 
lytic pathway characteristic of tumor cells (discussed in Section 16.1). It does this by 
activating the transcription factors HIF and MYC, which turn on the expression of 
genes encoding glucose transporters and glycolytic enzymes. 


closely linked to apoptosis is BCL-2, which encodes a membrane- 
bound protein that inhibits apoptosis (page 623). 

The role of BCL-2 in apoptosis is most clearly revealed in the 
phenotypes of knockout mice that are lacking a BCL-2 gene. Once 
formed, the lymphoid tissues of these mice undergo dramatic regres- 
sion as the result of widespread apoptosis. Like MYC, the product of 
the BCL-2 gene becomes oncogenic when it is expressed at higher- 
than-normal levels, as can occur when the gene is translocated to an 
abnormal site on the chromosome. Certain human lymphoid can- 
cers (called follicular B-cell lymphomas) are correlated with the 
translocation of the BCL-2 gene next to a gene that codes for the 
heavy chain of antibody molecules. It is suggested that overexpres- 
sion of the BCL-2 gene leads to the suppression of apoptosis in lym- 
phoid tissues, allowing abnormal cells to proliferate to form 
lymphoid tumors. The BCL-2 gene may also play a role in reducing 
the effectiveness of chemotherapy by keeping tumor cells alive and 
proliferating despite damage by the drug treatment. 

Over the past few sections, we have discussed a number of the 
most important tumor suppressors and oncogenes involved in tum- 
origenesis. FIGURE 16.18 provides a simplified overview of some of 
these proteins and the signaling pathways in which they operate. 
Tumor suppressors and tumor-suppressive pathways are shown in 
red, oncogenes and tumor-promoting pathways are shown in blue. 
The basic functions of each of the tumor suppressors and oncogenes 
depicted in the figure are noted in the legend of Figure 16.18. The 
remarkable diversity of protein activities that can contribute to tum- 
origenesis is evident from this figure. 
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FIGURE 16.18 An overview of several of the 
signaling pathways involved in tumorigenesis that 
were discussed in this section. Tumor suppressors and 
tumor suppression are shown in red, whereas onco- 
genes and tumor stimulation are shown in blue. Arrows 
indicate activation, perpendicular lines indicate 
inhibition. Among the proteins depicted in this figure 
are transcription factors (p53, MYC, and E2F), a 
transcriptional coactivator or corepressor (pRB), a lipid 
kinase (PI3K) and lipid phosphatase (PTEN), a cytoplas- 
mic tyrosine kinase (RAF) and its activator (RAS), a 
GTPase activating protein for RAS (NF1), a protein 
kinase that promotes cell survival (PKB/AKT), a protein 
that senses DNA breaks (BRCA), subunits of a cyclin- 
dependent kinase (CYCLIN D1 and CDK4), a Cdk 
inhibitor (p21), an antiapoptotic protein (BCL-2), a 
ubiquitin ligase (MDM2), an enzyme that elongates 
DNA (telomerase), and a protein that binds growth 
factors (e.g., EGFR). The arrows and lines do not 
necessarily represent direct activation or inhibition. For 
example, PTEN inhibits PKB through removal of a 
phosphate from PIP; and EGFR activates RAS via GRB2 
and SOS. The dashed line indicates indirect action by 
activation of expression of the MYC gene. 
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1. What types of proteins are encoded by proto- 
oncogenes, and how do mutations in each type of 


proto-oncogene cause a cell to become malignant? 
e_____—_—_ 


16.10 The Mutator Phenotype: 
Mutant Genes Involved 
in DNA Repair 


If one considers cancer as a disease that results from alterations in the 
DNA of somatic cells, then it follows that any activity that increases 
the frequency of genetic mutations is likely to increase the risk of 
developing cancer. As discussed in Chapter 13, nucleotides that 
become chemically altered or nucleotides that are incorporated incor- 
rectly during replication are selectively removed from the DNA strand 
by DNA repair. DNA repair processes require the cooperative efforts 
of a substantial number of proteins, including proteins that recognize 
the lesion, remove a portion of the strand containing the lesion, and 
replace the missing segment with complementary nucleotides. If any 
of these proteins are defective, the affected cell can be expected to 
display an abnormally high mutation rate, which is described as a 
“mutator phenotype.” Cells with a mutator phenotype are likely to 
incur secondary mutations in both tumor-suppressor genes and 
oncogenes, which increases their risk of becoming malignant. 


—_ 


16.11 MicroRNAs: A New Player 
in the Genetics of Cancer 


REVIEW | 


1. Describe the mutator phenotype. 


Recall from Section 11.12 that microRNAs are tiny regulatory RNAs 
that negatively regulate the expression of target mRNAs. Given that 
cancers arise as the result of abnormal gene expression, it would not 
be surprising to discover that miRNAs are somehow involved in tumo- 
rigenesis. In 2002, it was reported that the locus that encodes two 
microRNAs, miR-15a and miR-16, was either deleted or underex- 
pressed in most cases of chronic lymphocytic leukemia. It was subse- 
quently shown that these two miRNAs act to inhibit expression of the 
mRNA that encodes the antiapoptotic protein BCL-2, a known proto- 
oncogene. In the absence of the miRNAs, the oncogenic BCL-2 protein 
is overexpressed, which promotes development of leukemia. Because 
these miRNAs act to inhibit tumorigenesis, they can be thought of as 
tumor suppressors. When the leukemic cells lacking miR-15a and 
miR-16 were genetically engineered to reexpress these RNAs, they 
underwent apoptosis as would be expected if a missing tumor 
suppressor activity was restored. The locus that encodes miR-15a 
and miR-16 is also deleted in other types of cancers, suggesting it has 
widespread importance in tumor suppression. The expression of two 
of the most important human oncogenes, RAS and MYC, has also been 
shown to be inhibited by an miRNA, namely, let-7, which was the first 
miRNA to be discovered (Section 11.12). It has also been shown that 
the tumor suppressor p53 regulates the expression of numerous 
miRNAs, and is itself transcriptionally regulated by other miRNAs. 


Some miRNAs act more like oncogenes than tumor suppres- 649 


sors. One specific cluster of miRNA genes, for example, is overex- 
pressed during the formation of certain human lymphomas. 
Overexpression of these miRNAs can occur because the gene cluster 
encoding them is present in increased number (amplified) in the 
tumor cells, or it can occur because the gene cluster is excessively 
transcribed as the result of overly active transcription factors, includ- 
ing MYC. When mice are genetically engineered to overexpress 
these particular miRNAs, the animals develop lymphomas as would 
be predicted if the genes encoding them were acting as oncogenes. 
The abnormal expression of miRNAs has also been implicated as a 
causal factor in tumor cell invasiveness and metastasis, which ele- 
vates the interest in these RNAs to an even higher level. 

A number of microarray studies that survey large numbers of 
these tiny regulatory RNAs suggest that most human cancers have a 
characteristic miRNA expression profile, just as they have a charac- 
teristic mRNA expression profile. These miRNA expression profiles 
may soon serve as sensitive and accurate biomarkers to identify the 
exact type of tumor a person is suffering from and the best avenue of 
treatment. Researchers are also actively investigating the use of 
miRNAs as possible anticancer therapeutics. It may be possible, for 
example, to treat patients with synthetic RNAs that act as “miRNA 
sponges.” Therapeutic RNAs of this class would possess sequences 
that were complementary to oncogenic miRNAs and would act to 
bind and sequester such miRNAs, thereby blocking their carcino- 
genic activity. 


REVIEW | 


1. Discuss how the presence or absence of particular 
microRNAs influences the development of cancer. 
ee... 


16.12 The Cancer Genome 


All cancers arise as the result of genetic alterations. As was evident 
from the previous discussion, the genes involved in tumorigenesis 
constitute a specific subset of the genome whose products are 
involved in such activities as the progression of a cell through the cell 
cycle, adhesion of a cell to its neighbors, apoptosis, and repair of 
DNA damage. Taken together, hundreds of different genes have been 
identified as “cancer genes,’ that is, genes that are thought to have 
some causal role in the development of at least one type of malig- 
nancy. Over the past few years a concerted effort has been made to 
determine which of these genes are altered—either by point muta- 
tion, translocation, deletion, or duplication—in various types of 
tumors. This effort has been aided by recent advances in DNA 
sequencing that allow researchers to determine the nucleotide 
sequences of specific regions of the genome much more rapidly and 
inexpensively than ever before. 

It was hoped that most types of cancers would be characterized 
by alterations in a relatively small number of genes. It has been 
known for quite a while, for example, that a large percentage of mela- 
nomas exhibit mutant BRAF genes and a large percentage of colorec- 
tal cancers exhibit mutant APC genes. Similarly, the various types of 
leukemias are characterized by specific translocations, as exemplified 
by the BCR-ABL translocation in CML (Section 16.16). In all of these 
cases, these mutations occur at an early stage in tumor development 
and are thought to be critically important in turning the cell toward 
an eventual malignant state. However, results from the initial studies 
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Colorectal Cancer Mx38 


(a) 


Colorectal Cancer Mx32 


(b) 


FIGURE 16.19 The genomic landscape of colorectal cancers. These two-dimensional maps depict the genes most frequently mutated in colorectal 
tumors. Each reddish projection represents a different gene. The five genes that are mutated in a large proportion of tumors are represented by the tallest 
projections, referred to as “mountains,” and are specifically named. The 50 or so other genes that are mutated at a much lower frequency constitute the 
smaller “hills” of the genomic landscape. To depict the degree to which colorectal tumors from different patients share commonly mutated genes, the 
mutational landscapes from two individual tumors (identified as Mx38 and Mx32) are depicted in this illustration. Genes that were found to be somatically 
mutated in each of these individual tumors are indicated by the white circles. It is evident that very few mutations are shared between the tumors of these 
two individuals. In the example depicted here, only the APC and TP53 genes are mutated in both cases of the disease. (Note: The positions of the genes in 
this two-dimensional landscape are ordered with loci from one end of chromosome 1 at the bottom left of the landscape, proceeding through each of the 
autosomes in ascending order, until finally reaching the loci from chromosome X at the right edge of the landscape.) 


Source: (a, b) From Laura D. Wood et al., Courtesy of Bert Vogelstein, Science 318:1113, 2007; © 2007. Reprinted with permission from AAAS. 


of cancer genomes suggest that the same types of tumors taken from 
different patients possess widely divergent combinations of aberrant 
genes. This observation presumably reflects the many different 
routes that individual tumors can take to escape the cell’s normal 
antitumor protections. These findings can be displayed as shown in 
FIGURE 16.19, where mutated genes identified in a large number of 
colorectal cancers are shown as peaks within a two-dimensional 
“mutational landscape.” The height of the various peaks reflects the 
frequency with which that particular gene is mutated in this particu- 
lar type of cancer. It is evident from this type of display that a small 
number of genes are mutated in a large proportion of tumors; these 
can be thought of as “mountains” in the landscape. For the most part, 
these are the oncogenes that cancer researchers have been focusing 
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on over the years. Each type of cancer has its own characteristic com- 
plement of frequently-mutated genes. In the case of human colorectal 
cancers, the three most frequently mutated genes shown in 
Figure 16.19, namely APC, KRAS, and TP53, tend to become mutated. 
at distinct stages in the progression of this type of cancer 
(FIGURE 16.20). Mutations in both copies of the APC gene are found 
in over 60 percent of the smallest benign adenomas of the colon, sug- 
gesting that mutations in this gene often represent a first step in the 
formation of colon cancers. Larger adenomas as well as cell masses in 
the early stages of malignancy tend to contain mutations in one of the 
RAS oncogenes, namely KRAS. In contrast, the TP53 gene tends 
to mutate (or become epigenetically silenced) only at later stages 
along the path when the tumor is clearly malignant. Cells displaying 
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FIGURE 16.20 A model describing the sequence of genetic mutations that often occurs during the development of colon cancer. The histological 
changes that occur at the various stages in the development of this cancer are indicated by the drawings. Adenomas are benign growths (polyps) that, if not 
removed during a colonoscopy, have the potential to develop over the years into malignant tumors. The genes indicated at each step in the development of 
these tumors are some of the primary drivers of colorectal tumorigenesis, as discussed in the text. About 70 percent of colon cancers exhibit chromosome 


instability and aneuploidy. 


Source: A. Walther, et al., Nature.com; Nature Revs Cancer 9:491, 2009, Figure 1. Nature Reviews Cancer by Nature Publishing Group. Reproduced with 
permission of Nature Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center. 


chromosome instability, which is reflected in increasingly abnormal 
karyotypes, only appear after mutations in KRAS. 

Figures 16.19 and 16.20 illustrate how a small number of genes 
tend to be mutated at relatively high frequency in colorectal cancer. 
However, a surprisingly large number of genes are mutated at a lower 
frequency (less than 5 percent of cases); these have been described as 
“hills” There are approximately 50 different genes representing the 
hills in the mutational landscape of colorectal cancers in Figure 
16.19. While we can presume that the genes that are mutated at high 
frequency (the mountains) are important factors in driving the cells 
into malignancy, considerable debate is centered on the roles of the 
genes that are mutated at lower frequency (the hills). Many of the 
genes represented by the hills almost certainly have a causal role in 
determining properties of the malignant phenotype, even if they 
provide the tumor with only a small selective advantage. Mutations 
that cause or contribute to the malignant phenotype are described as 
drivers. Other genes that constitute a hill may simply represent “pas- 
sengers,’ that is, genes that tend to become mutated for some reason 
but have no effect on the phenotype of the cancer cell. It may be a 
daunting challenge to determine which of these genes are drivers 
and which are simply passengers. In addition to the genes of the 
mountains and hills, there is a large number of other genes that show 
up in a mutant state at very low frequency in a population of tumors. 
Close examination of the locations of the white circles in the two 
landscapes in 16.194 and b indicates that only a small number of 
mutated genes in these cancers are shared by the two individuals. 
Thus, in a sense, each person suffers from his or her own distinct 
type of disease. Even within a given patient, metastatic lesions that 
have spread to different regions of the body can have remarkably 
different complements of mutated genes. These differences presum- 
ably reflect the different genetic changes that occur during the 
growth of each secondary tumor. 

It is evident from this and a large number of other studies on 
other types of cancers that the mutational landscape of the cancer 
genome is highly complex. However, numerous genomic studies 
have shown that most of these large numbers of genes encode pro- 
teins that participate as components of a relatively small number 
of pathways. A study of the mutational landscape of over 3,000 
tumors from 12 major cancer types identified 127 gene mutations 
that impacted a core set of 20 cellular pathways or processes 
(FIGURE 16.21). Although there is significant variation among the 
different cancers studied, most tumors exhibited mutations that 
affected two major pathways: p53 and PI3K. Thus, as discussed else- 
where in this chapter, cancer can be thought of not simply as a dis- 
ease of aberrant genes but as one of aberrant cellular pathways. 
Mutations in any one of a number of genes disrupt the same pathway 
and thus have the same consequence for the cells. For example, the 
PI3K pathway in breast cancer can be activated by mutations in the 
catalytic subunit of PI3K (PIK3CA), amplification of the growth fac- 
tor receptor HER2, mutations in the PI3K effector PKB (AKTI), or 
loss of the PI3K antagonist PTEN. A few of the most prominent 
pathways that tend to be deregulated in many different types of can- 
cers were indicated in Figure 16.18. Looking at cancer as a “pathway” 
disease rather than a “genetic” disease generates more optimism 
among drug developers because it suggests that disrupting (or restor- 
ing) any one of the key steps in a single essential pathway may be 
sufficient to derail the malignant cells and lead to tumor regression. 

Before leaving the subject of cancer genetics, it should be noted 
that although some cancers may result from the gradual multistep 
progression of individual point mutations as described above, other 


cancers may not follow this same path. Some forms of carcinogene- 651 


sis are thought to be initiated by catastrophic events that lead to 
widespread genetic instability over a relatively small number of cell 
divisions. For example, mutations in a gene involved in DNA repli- 
cation or DNA repair, as occurs in cases of colorectal cancer, might 
rapidly spawn cells carrying widespread genetic abnormalities. 
Tumor formation may also be initiated when cells undergo an abnor- 
mal cell division and possess aberrant numbers of chromosomes. 
The best way to determine how a cancer originated is to analyze the 
state of the genome in cells at very early stages in tumor formation. 
Unfortunately, it is virtually impossible to identify tumors when they 
are composed of a small number of cells. By the time that tumors are 
detected, the cells already exhibit a high degree of genetic derange- 
ment, making it difficult to determine whether these genetic altera- 
tions are a cause or an effect of tumor growth. 


REVIEW | 


1. What is meant by the term mutational landscape in 
reference to the genome of a cancer cell? What do 
mountains and hills represent? 
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16.13 Gene-Expression Analysis 


A technology for analyzing gene expression using DNA microarrays 
(or DNA chips), was described in detail in Section 12.10. Briefly, a 
glass slide is prepared that can contain anywhere from a handful to 
thousands of spots of DNA, each spot containing the DNA corre- 
sponding to a single, known gene. Any particular set of genes, such 
as those thought to be involved in growth and division or those 
thought to be involved in the development and differentiation of 
lymphocytes or some other cell type, can be included within the 
microarray. Once prepared, the microarray is incubated with fluo- 
rescently labeled cDNAs synthesized from the mRNAs of a particu- 
lar population of cells, such as those from a tumor mass that has 
been removed during surgery, or from the cancerous blood cells of a 
patient with leukemia. The fluorescently labeled cDNAs hybridize to 
spots of complementary DNA immobilized on the slide, and subse- 
quent analysis of the fluorescence pattern tells the investigators 
which mRNAs are present within the tumor cells and their relative 
abundance within the mRNA population. 

Studies with DNA microarrays have shown that gene- 
expression profiles can provide invaluable information about the 
properties of a tumor. It has been found, for example, that (1) pro- 
gression of a tumor is correlated with a change in expression of par- 
ticular genes, (2) certain cancers that appear to be similar by 
conventional criteria can be divided into subtypes that have differ- 
ent clinical features on the basis of their gene-expression profiles, 
(3) the gene-expression profile of an individual patient’s tumor can 
reveal how aggressive (i.e., how lethal) the cancer is likely to be, and 
(4) the gene-expression profile of an individual patient’s tumor can 
provide clues as to which type of therapeutic strategy will be most 
likely to induce tumor regression. We can look at a few of these 
issues in more detail. 

FIGURE 16.22 depicts the levels at which 50 different genes were 
found to be transcribed in two different types of leukemia, acute 
lymphoblastic leukemia (ALL) and acute myeloid leukemia (AML). 
The genes used in this figure, which are named on the right side, 
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FIGURE 16.21 A small number of signaling pathways are commonly altered in cancers. This table shows how frequently certain genes are mutated in a 
diverse group of 3281 tumor samples representing 12 different cancers. 127 significantly mutated genes were identified and classified into one of 20 major 
cellular processes. The 12 cancers investigated were breast adenocarcinoma (BRCA), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), 
uterine corpus endometrial carcinoma (UCEC), glioblastoma multiforme (GBM), head and neck squamous cell carcinoma (HNSC), colon and rectal 

carcinoma (COAD, READ), bladder urothelial carcinoma (BLCA), kidney renal clear cell carcinoma (KIRC), ovarian serous carcinoma (OV), and acute myeloid 


leukaemia (LAML). 


Source: From C. Kandoth et al., Nature 505:333-339, 2013. 


represent those whose transcription showed the greatest difference 
between these two types of blood-cell cancers. Each column repre- 
sents the results from a single patient with ALL or AML, so that dif- 
ferent columns allow one to compare the similarities in expression of 
the gene from one patient to the next. Gene-expression levels are 
indicated from dark blue (lowest level) to dark red (highest level). 
The top half of the figure identifies genes that are transcribed at a 
much higher level in ALL cells, whereas the bottom half identifies 
genes that are transcribed to a much higher level in AML cells. These 
studies make it evident that there are many differences in gene 
expression between different types of tumors. Some of these differ- 
ences can be correlated with biological and clinical differences 
between the tumors; one, for example, is derived from a myeloid cell 
and the other from a lymphoid cell (see Figure 17.6). Most differ- 
ences, however, cannot be explained. Why, for example, is the gene 
encoding catalase (the last gene on the list) expressed at a low level 
in ALL and a high level in AML? Even if these studies can’t answer 


this question, they can provide cancer researchers with a list of genes 
to look at more closely as potential targets for therapeutic drugs. The 
earlier a cancer is discovered, the more likely a person can be cured; 
this is one of the cardinal principles of cancer treatment. Yet a cer- 
tain percentage of tumors will prove fatal, even if discovered and 
removed at an early stage. It is apparent, for example, that some 
early-stage breast cancers already contain cells capable of seeding the 
formation of secondary tumors (metastases) at distant locations, 
whereas others do not. These differences determine the prognosis of 
the patient. It was found in a landmark study by a team of Dutch 
researchers in 2002 that the prognosis of a given breast cancer is 
likely to be revealed in the level of expression of approximately 70 
genes out of the thousands that were studied in DNA microarrays 
(FIGURE 16.23). This finding has had important clinical applications 
in guiding the treatment of breast cancer patients. Those patients 
with early-stage tumors that display a “poor prognosis” profile on 
the basis of their gene-expression pattern (Figure 16.23a) can be 
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FIGURE 16.22 Gene-expression profiling that distinguishes two types of leukemia. Each row depicts the expression level of a single gene that is 
named to the right of the row. Altogether the levels of expression of 50 different genes are indicated. The color key is shown at the bottom of the figure, 
indicating the lowest level of expression is in dark blue and highest in dark red. Each column represents data from a different sample (patient). The columns 
on the left show the expression profiles from patients with acute lymphoblastic leukemia (ALL), whereas the columns on the right show the profiles from 
patients with acute myeloid leukemia (AML) (indicated by the brackets at the top). It is evident that the genes in the upper box are expressed at a much 
greater level in patients with ALL, whereas the genes in the lower box are expressed at a much greater level in patients with AML. (The genes selected for 
inclusion in this figure were chosen for the illustration because of these differences in expression between the two diseases.) 


Source: From T. R. Golub et al., Science 286:534, 1999; Copyright 1999, Science by Moses King. Reproduced with permission of American Association for 
the Advancement of Science in the format reuse in a book/textbook via Copyright Clearance Center. 


treated aggressively with chemotherapy to maximize the chance that 
the formation of secondary tumors can be prevented. If gene- 
expression data is not considered, these individuals are not likely to 
receive any type of chemotherapy because there is no indication 
using conventional criteria that their tumor had spread. Conversely, 
those patients whose tumors exhibit a “good prognosis” profile 
might be spared the more debilitating chemotherapeutic drugs, even 
if their tumors appear more advanced (Figure 16.23b). In recent 


years, several companies have introduced laboratory tests (e.g., 
MammaPrint and Oncotype DX) to analyze gene-expression pro- 
files of individual breast cancers in an attempt to help guide the best 
course of treatment for these patients. Even though these prognostic 
tests have been in widespread use for several years, their validity is 
still being evaluated in large clinical trials. It is hoped that gene- 
expression profiles can soon be used to improve the diagnosis and 


optimize treatment for individual patients with all types of cancer. | 
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FIGURE 16.23 The use of DNA microarray data in determining the 
choice of treatment. Each graph shows the survival rate over time in breast 
cancer patients that had either a good-prognosis signature or a poor-prog- 
nosis signature based on the level of expression of 70 selected genes. The 
patients in a showed no visible evidence that the cancer had spread to 
nearby lymph nodes at the time of surgery. As indicated in the plot: (1) not 
all of these patients survive and (2) the likelihood of survival can be 
predicted to a large extent by the gene-expression profiles of their tumors. 
This allows physicians to treat those patients with a poor signature more 
aggressively than those with a good signature. The patients in b did show 
visible evidence of spread of cancer cells to nearby lymph nodes. As 
indicated in the plot, the likelihood of survival in this group can also be 
predicted by gene-expression data. Normally, all of the patients in this 
group would be treated very aggressively, which may not be necessary for 
those with a good signature. 


Source: From M. J. Van De Vijver et al., New England Journal Medicine 
347:2004, 2005, Copyright 2002, The New England Journal of Medicine by 
Massachusetts Medical Society reproduced with permission of 
Massachusetts Medical Society, in the format reuse in a standard/custom 
book (basic rights) via Copyright Clearance Center. 
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1. How can DNA microarrays be used to determine the 
type of cancer a patient suffers from? How might they 
be used to optimize cancer treatment? 


16.14 Strategies for 
Combating Cancer 


It is painfully evident that conventional approaches to combating can- 
cer, namely, surgery, chemotherapy, and radiation, are not usually 
successful in curing a patient of metastatic cancer, that is, one that has 
already spread from a primary tumor. Because they kill large numbers 
of normal cells, along with the cancer cells, chemotherapy and radia- 
tion tend to have serious side effects, in addition to having limited 
curative value for most advanced cancers. There has been hope for 
decades that these “brute-force” strategies would be replaced by tar- 
geted therapies, based on our newly formed insights into the molecu- 
lar basis of malignancy. There are several ways that a therapy can be 
considered to be “targeted”: It can be targeted to attack only cancer 
cells, leaving normal cells unscathed; it can be targeted to a particular 
protein whose inactivation leaves the cancer cells unable to grow or 


survive; and/or it can be targeted to the cancer cells of a particular 
patient based on their unique pattern of somatic mutations. Although 
the cure rate for most types of cancers has not improved significantly 
over the past 50 years, there is reason to believe that effective targeted 
therapies will become available to treat many of the common cancers 
in the foreseeable future. This optimism is based largely on the 
remarkable success that has been achieved with a small number of 
targeted treatments that will be discussed in the following pages. Even 
though these successes have been scattered among a much larger 
number of failed prospective treatments, they demonstrate that the 
concept of targeted therapy is sound. In other words, these successes 
can be considered as “proof-of-principle.” Just as important, they give 
both researchers and biotechnology companies the incentive to spend 
time and money to continue the pursuit of better cancer treatments. 

The anticancer strategies to be discussed in the following sec- 
tions can be divided into three groups: (1) those that depend on 
antibodies or immune cells to attack tumor cells, (2) those that 
inhibit the activity of cancer-promoting proteins, and (3) those that 
prevent the growth of blood vessels that nourish the tumor. 


REVIEW | 


1. What is meant by targeted strategies for fighting 
cancer, and how is this different from more 
conventional approaches? 


16.15 Immunotherapy 


We have all heard or read about persons with metastatic cancer who 
were told they had only months to live, yet they defy the prognosis 
and remain alive and cancer-free years later. The best studied cases 
of such “spontaneous remissions” were recorded in the late 1800s by 
a New York physician named William Coley. Coley’s interest in the 
subject began in 1891 when he came across the hospital records of a 
patient with an inoperable tumor of the neck, who had gone into 
remission after contracting a streptococcal infection beneath the 
skin. Coley located the patient and found him to be free of the cancer 
that had once threatened his life. Coley spent his remaining years 
trying to develop a bacterial extract that, when injected under the 
skin or into a tumor, would stimulate a patient's immune system to 
attack and destroy the malignancy. The work was not without its 
successes, particularly against certain uncommon soft-tissue sarco- 
mas. Although the use of Coley’s toxin, as it was later called, was 
never widely embraced, many studies have confirmed anecdotal 
observations that the body has the capability to destroy a tumor, 
even after it has become well established. In recent years, two broad 
treatment strategies involving the immune system have been pur- 
sued: passive immunotherapy and active immunotherapy. 

Passive immunotherapy is an approach that attempts to treat can- 
cer patients by administering antibodies as therapeutic agents. These 
antibodies recognize and bind to specific proteins on the surface of the 
tumor cells being targeted. Once bound, the antibody either kills the 
cells directly or orchestrates an attack on the cell carried out by other 
elements of the immune system. As discussed in Section 18.18, the 
production of monoclonal antibodies capable of binding to particular 
target antigens was initially developed in the mid-1970s. During the 
next 20 years or so, attempts to use these proteins as therapeutic agents 


were foiled for a number of reasons. Most importantly, these antibod- 
ies were produced by mouse cells and encoded by mouse genes. As a 
result, the antibodies were recognized as foreign and cleared from the 
bloodstream before they had a chance to work. In subsequent efforts, 
investigators were able to produce “humanized antibodies,” which are 
antibodies that are largely human proteins except for a relatively small 
part that recognizes the antigen, which remains “mousey.” In the past 
few years, researchers have been able to produce antibodies that have 
a completely human amino acid sequence. 

Herceptin is a humanized antibody directed against a cell- 
surface receptor (HER2) that binds a growth factor that stimulates 
the proliferation of breast cancer cells. Herceptin is thought to 
inhibit activation of the receptor by the growth factor and stimulate 
receptor internalization (Section 8.19). Approximately 25 percent of 
breast cancers are composed of cells that overexpress the HER2 gene, 
which causes these cells to be especially sensitive to growth factor 
stimulation. Up until the development of Herceptin, patients whose 
tumors overexpressed HER2 had a very poor prognosis. Their prog- 
nosis is now greatly improved. For example, one study of 3000 
women with early-stage breast cancer reported that Herceptin 
reduced the chance of recurrence of the disease by about 50 percent 
in a four-year period. Recent trials suggest that survival can be 
increased by combining Herceptin with another monoclonal anti- 
body (Omnitarg) that interferes with HER2 dimerization. To date, 
the most effective humanized antibody is Rituxan, which was 
approved in 1997 for treatment of non-Hodgkin's B-cell lymphoma. 
Rituxan binds to a cell-surface protein (CD20) that is present on the 
malignant B cells in approximately 95 percent of the cases of this 
disease. Binding of the antibody to the CD20 protein inhibits cell 
growth and induces the cells to undergo apoptosis. Introduction of 
this antibody (together with chemotherapy) has reversed the pros- 
pects for people with this particular cancer. 

Over the past few years, a number of fully human antibodies 
have been tested in clinical trials against various types of cancers. 
One of these, called Vectibix, which is directed against the EGF 
receptor, has been approved as a single-agent treatment of EGFR- 
expressing metastatic colon cancer. Because it is a human protein, 
Vectibix remains in the circulation long enough to be administered 
once every other week. Another human monoclonal antibody, called 
Arzerra, has been approved for treatment of chronic lymphocytic 
leukemia. Like Rituxan, Arzerra binds to CD20 on the surface of 
B cells. In addition, a number of antibodies are being developed that 
contain a radioactive atom or a toxic compound conjugated to the 
antibody molecule. The antibody targets the complex to the cancer 
cell, and the associated atom or compound kills the targeted cell. At 
the time of this writing, two radioactively labeled anti-CD20 anti- 
bodies (Zevalin and Bexxar) have been approved for the treatment 
of non-Hodgkin’s B-cell lymphoma, and one anti-CD30 antibody 
linked to a toxic compound (Adcetris) has been approved to treat 
Hodgkin’s lymphoma. 

Active (or adoptive) immunotherapy is an approach that tries to 
get a person’s own immune system more involved in the fight against 
malignant cells. The immune system has evolved to recognize and 
destroy foreign materials, but cancers are derived from a person’s own 
cells. Although many tumor cells do contain proteins that are not 
expressed by their normal counterparts (so-called tumor-associated 
antigens or neoantigens), or mutated proteins (e.g., BRAFY”) that 
are different from those present in normal cells, they are still basically 
host proteins present in host cells. As a result, the immune system 


typically fails to recognize these proteins as inappropriate. Even if a 655 


person does possess immune cells that recognize certain tumor- 
associated antigens, tumors evolve a number of mechanisms that 
allow them to escape immune destruction. Many different strategies 
have been formulated to overcome these hurdles and artificially stim- 
ulate the immune system to mount a more vigorous response against 
tumor cells. 

Some tumor cells are able to evade the immune system by 
expressing ligands for T-cell receptors, called CTLA-4 and PD-1, 
that dampen T-cell activity when bound and limit the body’s 
antitumor immune response. Antibody-based therapies that target 
CTLA-4 and PD-1 have been approved for clinical use, but have been 
shown to have limited efficacy against most cancers. However, a new 
phase I trial of a monoclonal antibody against PD-L1 (the ligand for 
PD-1) has shown promise in treating a broad range of solid tumors, 
including non small-cell lung cancers and melanomas. 

Other approaches focus on stimulating an immune response 
against a protein known to be present in their cancer cells, such as 
HER2 in the case of a person suffering from breast cancer or CEA 
(carcinoembryonic antigen) in a person with pancreatic cancer. If the 
body can be stimulated to mount an immune response against the 
protein, that response has the potential to attack the cancer cells that 
have that protein on their surface. In most immunotherapeutic 
approaches, immune cells are isolated from the patient, modified or 
stimulated in one way or another in vitro, allowed to proliferate in 
culture, and then reintroduced into the patient. For many years, clini- 
cal trials using this strategy were disappointing, but recent publica- 
tions have provided reason for cautious optimism. In many of these 
studies, a significant minority of patients have exhibited a positive 
response to the treatment, which means that their tumors have at least 
shrunk in size or extent and the patient’s expected time of survival has 
increased significantly. Why some patients respond, while most do 
not, remains a focus of investigation. We will briefly consider two of 
the most promising examples of this general type of cell-based immu- 
notherapy. Because both of the treatments to be described have to be 
personalized to the individual patient, they are likely to be extremely 
expensive should they ever become approved and widely available. 

A handful of new treatments are focused on re-engineering 
immune cells to target tumor cells. DCVax is currently in phase II 
trials for the highly lethal brain tumor, glioblastoma. DCVax utilizes 
dendritic cells that have been taken from the patient’s blood, stimu- 
lated in vitro with antigens from the patient’s own tumor, and then 
injected back into the patient’s body. Because the strategy involves 
exposure of dendritic cells to specific proteins with the goal of gen- 
erating an immune response, DCVax is described as a “cancer vac- 
cine.” The “manufacturing process” takes about 10 days and provides 
enough cells for several years of treatment. Although the number of 
patients that have been treated to date is limited to a few dozen, the 
vaccine appears to have extended the median survival time of 
patients to more than three years from time of diagnosis. Without 
such treatment, these patients would presumably have died in about 
15 months. As of July 2011, 33 percent of treated patients had sur- 
vived at least four years and 27 percent had survived at least six years. 
Fewer than 5 percent of patients given conventional treatments sur- 
vive five years. DCVax is currently undergoing trials for treatment of 
additional cancers, including metastatic ovarian cancer, prostate 
cancer, and solid tumor cancers. 

In August 2011, the results of a phase I trial carried out on three 
patients with advanced chronic lymphocytic leukemia (CLL) was 
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are normally carried out to determine (1) whether a drug is safe and 
(2) its proper dosage. In this case, the treatment resulted in the com- 
plete remission of disease in two of the patients and at least partial 
remission in the third. To carry out the treatment, Carl June and his 
colleagues at the University of Pennsylvania first isolated T cells from 
the blood of each of the patients. As discussed in Chapter 17, T cells 
carry a multisubunit protein on the surface of their cells called the 
T cell receptor (or TCR). The TCR determines which specific mole- 
cules (antigens) that a given T cell will react with and, consequently, 
which target cells that T cell will attack and likely kill. For the most 
part, T cells do not possess TCRs that would allow them to react with 
other cells in the body, which protects the body from self-attack. In 
this clinical trial, researchers isolated T cells and then genetically 
engineered these cells, using a viral vector, to carry a specific, high- 
affinity TCR that would allow the T cells to react with and kill host 
cells that possess a CD19 protein on their surface. The only cells that 
normally carry CD19 are B cells—both the normal versions and 
those that are part of the CLL cancer. Once these T cells were geneti- 
cally modified, their numbers were greatly expanded in culture and 
were then infused back into the patient from which the T cells were 
originally obtained. Within a week or two, the patients were stricken 
with a sickness characterized by very high fever, lowered blood pres- 
sure, and kidney distress. These symptoms were the result of the war 
going on within their bodies between the infused T cells (and the 
massive numbers of their progeny formed in the body after infusion) 
and the huge number of cancerous cells that these patients carried. It 
was estimated that each infused T cell was responsible for the death 
of more than 1000 CLL cells. In the end, the patients were left with a 
relatively small number of genetically engineered T memory cells 
derived from the infusion, a virtual absence of normal B cells due to 
their destruction by the T cells, and either the absence of malignant 
cells or, in one case, a greatly reduced tumor burden. The presence of 
the memory T cells should prevent the reemergence of CLL, as well 
as the suppression of formation of normal B cells, which might make 
these patients more susceptible to infection. Keep in mind that the 
use of genetically modified T cells carries considerable risk, because 
these T cells can attack and kill any cell in the body that happens to 
carry the targeted antigen. As a result, these types of cell-based 
immunotherapies (called chimeric antigen receptor, or CAR ther- 
apy) have the potential to cause severe autoimmune side effects. 
While the most recent data from the trials reveal remarkable efficacy, 
with 15 of 17 patients showing complete remission, two deaths— 
most likely due to side effects of treatment—were also reported. 

Immunotherapies represent approximately one-quarter of the 
over 350 experimental anticancer drugs that are currently undergo- 
ing drug trials. Large pharmaceutical companies and smaller biotech 
companies alike have intensified efforts in developing and testing 
immunotherapeutic drugs and CAR-based treatments, and expecta- 
tions are high that some of these new, potentially high-impact treat- 
ments will soon become mainstream. 


REVIEW | 
1. Describe the differences between passive and active 
(or adoptive) immunotherapy. 


2. Describe how engineered T cells have been used to 
successfully treat cancer. 


16.16 Inhibiting the Activity 
of Cancer-Promoting Proteins 


Cancer cells behave as they do because they contain proteins that are 
either present at abnormal concentration or display abnormal activ- 
ity. A number of these proteins were illustrated in Figure 16.17. In 
many cases, the growth and/or survival of tumor cells is dependent 
on the continued activity of one or more of these deviant proteins. 
This dependency is known as “oncogene addiction.” If the activity of 
one of these proteins can be selectively blocked, it should be possible 
to kill the entire population of malignant cells. With this goal in 
mind, researchers have synthesized a virtual arsenal of small- 
molecular-weight compounds that inhibit the activity of cancer- 
promoting proteins. Some of these drugs were custom-designed to 
inhibit a particular protein of known structure, whereas others were 
identified by randomly screening large numbers of compounds that 
have been synthesized by pharmaceutical companies. Once a pro- 
tein-inhibiting compound has been identified, it is typically tested 
for effectiveness against a panel of approximately 60 different types 
of cultured cells, each originally isolated from a different human 
cancer. Using a diverse panel of cells allows researchers to reconsti- 
tute the genetic heterogeneity and diverse drug sensitivities found 
among human cancers. Success against cultured cells generally leads 
to tests of the agent in immunocompromised mice carrying human 
tumor transplants (xenografts). A list of some of the agents that have 
been tested in clinical trials is shown in Table 16.2. Although numer- 
ous compounds on this list have shown some promise in halting the 
growth of various types of tumors, one compound has had unparal- 
leled success in treating patients with chronic myelogenous leuke- 
mia (CML). 

It was noted earlier that certain types of cancers are initiated by 
specific chromosome translocations. CML is caused by a transloca- 
tion that brings a proto-oncogene (ABL) in contact with another 
gene (BCR) to form a chimeric gene (BCR-ABL). Blood-forming 
cells that carry this translocation express a high level of ABL tyrosine 
kinase activity, which causes the cells to proliferate uncontrollably 
and initiate the process of tumorigenesis. As discussed on page 75, a 
compound called Gleevec was identified that selectively inhibits ABL 
kinase by binding to the inactive form of the protein and preventing 
its phosphorylation by another kinase, which is required for ABL 
activation. Initial clinical trials on Gleevec were remarkably success- 
ful, causing nearly all CML patients who received the drug at suffi- 
ciently high dose to go into remission. These studies confirmed the 
notion that elimination of a single required oncogene product could 
stop the growth of a human cancer. The drug was rapidly approved 
and has been in use for years. Patients have to continue to take the 
drug to remain in remission, and many of them, especially those who 
began treatment at an advanced stage, eventually develop drug resis- 
tance. Most cases of resistance result from mutations in the ABL por- 
tion of the fusion gene. This has prompted the development of a 
second generation of targeted inhibitors that remain active against 
most of the mutated forms of the ABL kinase (see Figure 2.51d). 
These drugs appear to be effective in treating Gleevec-resistant cases 
of CML and suggest that the ideal drug regimen may consist of a 
cocktail of several different inhibitors that target different parts of the 
same protein, ensuring that drug-resistant mutants will not emerge. 

It was hoped that Gleevec would be followed rapidly by numer- 
ous other highly effective protein-inhibiting drugs. Although a 
number of protein-targeting, small-molecule inhibitors have shown 


TABLE 16.2 Examples of Small-Molecule Targeted Therapies That Have Either Been FDA Approved or Are Being Tested 


Drug 


Target 


Mechanism of action 


Gleevec, Tasigna 
Sprycel 

Iressa, Tarceva, 
Zactima 

Sutent, Votrient 
Tykerb 

Nexavar 

Zelboraf 

GDC-0973 

Velcade, Carfilzomib 
Zolinza, Istodax 
Erivedge 

Torisel, Afinitor 
BKM120, PX866 
BEZ235, BGT226 
Perifosine, MK2206 
Trisenox (arsenic trioxide) 
Seliciclib 

Tamoxifen, Raloxifene 
Arimidex, Aromasin 
Zytiga 

Genasense, ABT-263 
17-AAG 

Nutlins 

PRIMA-1 

PX-478 

Veliparib, Rucaparib 
Dacogen, Vidaza 


BCR-ABL, KIT, PDGFR 
BCR-ABL, SRC family 
EGFR 

VEGER, EGFR 
VEGER, PDGER, KIT 
EGFR, HER2 

BRAK, KIT, VEGFR 
BRA FV600E 

MEK 

proteasome 

HDACs 

Smoothened of Hh pathway 
mTOR 

PI3K 

PI3K, mTOR 
PKB/AKT 

NE-KB 

CDK2 

Estrogen receptor 
Aromatase 

CYP17 

BCL-2 

HSP90 

ps3 

p53 

HIF-1 

PARP-1 

DNMT 


Tyrosine kinase inhibitor 

Tyrosine kinase inhibitor 

Tyrosine kinase inhibitor 

Tyrosine kinase inhibitor 

Tyrosine kinase inhibitor 

Tyrosine kinase inhibitor 

Kinase inhibitor 

Kinase inhibitor 

Inhibits MAPK cascade 

Inhibits protein degradation 

Inhibits histone acetylation (epigenetic effect?) 
Blocks cell growth and survival pathway 
Blocks cell-survival pathway 

Blocks cell-survival pathway 

Blocks cell-survival pathway 

Blocks cell-survival pathway 

Blocks cell-survival pathway 

Induces apoptosis 

Blocks estrogen action 

Inhibits estrogen synthesis 

Inhibits androgen synthesis 

Induces apoptosis 

Inhibits molecular chaperone 

Inhibits p53-MDM2 interaction 
Restores mutant p53 activity 

Inhibits this transcription factor that is activated by hypoxia 
Blocks HR-based DNA repair 

Inhibits DNA methylation 


modest success in clinical trials and have been approved by the FDA, 
and hundreds of others are being tested in the clinic, none of them 
studied to date have been able to fully stop the growth of any of the 
common solid cancers, namely those of the breast, lung, prostate, or 
pancreas. One major success has been achieved (FIGURE 16.24). 
A drug called Zelboraf (vermurafenib) has recently been approved 
for the treatment of patients with metastatic melanoma whose can- 
cer is driven by a specific mutated version of BRAK, one in which the 
normal valine residue at position 600 is replaced by a glutamic acid 
(BRAFY®*), thereby changing the shape of the active site. This 
mutation occurs in approximately 50 percent of melanoma patients 
and Zelboraf specifically blocks the activity of this mutant enzyme 
without affecting the normal protein. As with Gleevec, patients 
develop resistance to Zelboraf but, unlike Gleevec, resistance appears 
much earlier (typically after seven months of treatment) and, in 
most cases, results from mutations in other genes rather than sec- 
ondary mutations in BRAF. Resistance to Zelboraf typically occurs 
because the MAPK pathway once again becomes constitutively 
active, bypassing the cells’ need for BRAF kinase activity, which con- 
tinues to be inhibited by the drug. Studies are underway to combine 
Zelboraf with inhibitors of downstream components of the pathway, 
such as MAPK or ERK. It is unlikely that a tumor will harbor cells 
that are resistant to two drugs that target two different pathways or 
two different proteins in the same pathway. Consequently, an 
approach that uses a combination of drugs may be the best way to 
block the emergence of resistant cells. This use of combined drug 
therapy has already proven highly effective in blocking the appear- 
ance of resistant strains of the HIV virus in infected patients and, 
like cancer cells, HIV is also highly mutable. Unfortunately, inhibiting 


(a) (b) 


FIGURE 16.24 Therapeutic effects of the drug Zelboraf (PLX4032) on 
patients with metastatic melanoma harboring a mutant BRAFY6°OF 
oncoprotein. These PET scans of a patient prior to (a) and two weeks after 
initiation of treatment (b) show the dramatic beneficial effects of this drug. 
The dark regions of the scan show the locations of the metabolically active 
cancerous lesions. Tumor regrowth occurs in many of the patients due to 
the emergence of drug-resistant clones. 


Source: From Gideon Bollag, et al., Nature 467:599, 2010; © 2010, 
reprinted by permission from Macmillan Publishers Ltd. 
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side effects as a result of their actions in normal cells of the body. 

The reasons for the failure to develop drugs that can eradicate 
solid, epithelial-based tumors (carcinomas) are not entirely clear. 
One reason may be that these tumors are more complex genetically 
and the cells are not as dependent on a single oncogene product and 
aberrant signaling pathway as are the blood-cell cancers and mela- 
noma. Another reason may be that only a fraction of the patients 
with a particular type of tumor are sensitive to a particular drug. This 
was originally suggested in studies of Iressa, an inhibitor of the tyro- 
sine kinase of the EGF receptor (EGFR). Iressa was originally tested 
on lung cancer patients because these tumors were known to exhibit 
high levels of EGFR. Initial clinical trials found that approximately 
10 percent of patients in the United States and 30 percent of Japanese 
patients responded positively to the drug, whereas the remaining 
population was unaffected. Subsequent studies on cancers that con- 
tained EGFR mutations indicated that targeting the mutant EGFR 
was only potentially successful if the patient had wild type KRAS. 
Those patients whose cancers also exhibited a mutant KRAS gene 
were nonresponders. As a result, lung (and colon) cancers are now 
tested for KRAS mutations before anti-EGFR therapy is initiated. 
More recently, it was discovered that the drug Xalkori is highly effec- 
tive in treating a select subgroup of lung cancer patients that had 
normal EGFR genes, but expressed an EML4-ALK fusion protein 
that results from a chromosomal translocation. Xalkori inhibits ALK 
kinase. The drug was approved along with a “companion diagnostic” 
test that uses fluorescence in situ hybridization (FISH) to identify the 
presence of the rearrangement, which occurs in about 5 percent of 
patients. These findings verify the notion that not all cases of a par- 
ticular cancer can be treated alike. Instead, targeted cancer therapies 
will ultimately have to be tailored to fit the specific genetic modifica- 
tions present in the tumors of individual patients. With this in mind, 
a number of major cancer centers have begun programs to sequence 
a panel of selected genes in patients with certain cancers. This infor- 
mation is also being used to help select individuals for participation 
in clinical trials, thereby increasing the likelihood of finding drugs 
that are effective in treating subsets of patients with a certain type of 
cancer. Without this type of genetic screening, the positive signals for 
a drug that might be useful for a subset of patients can be drowned 
out by the lack of clinical benefit to the population as a whole. The 
current challenge is to identify genetic markers that have value in 
predicting the success or failure of a given therapy. 

We have focused in this section upon targeted therapies that 
inhibit proteins that are either abnormal themselves or are abnor- 
mally expressed in cancer cells. But there may also be proteins that 
have a normal structure and expression, yet for some reason play an 
important role in the lives of cancer cells. Inhibitors that target such 
proteins might have considerable potential as therapeutic agents in 
the treatment of cancer. We have seen in this chapter how cancer 
cells contain mutations in a wide variety of genes, many of which 
lead to the overall inhibition of certain metabolic or signaling path- 
ways. While this may help promote the growth and survival of these 
cancer cells, it also may cause them to become more dependent than 
normal cells upon other pathways that continue to operate in a nor- 
mal fashion. Recent announcements hailing the promise of PARP-1 
inhibitors in the treatment of several types of cancer provide an 
example of this type of reasoning. PARP (which is an acronym for 
poly(ADP-ribose) polymerase) is a poorly understood enzyme that 
is involved in numerous processes involving DNA metabolism, 
including DNA repair. PARP-1 inhibitors have shown particular 


promise in the treatment of breast and ovarian cancers that exhibit 
deficiencies in BRCA1 or BRCA2. As discussed in Section 13.8, 
BRCA proteins are involved in DNA repair and it is reasoned that 
such cancer cells are more dependent than normal cells upon other 
DNA repair pathways, including those that require PARP-1. When 
PARP-1 is inhibited in BRCA-deficient tumor cells, certain types of 
DNA damage cannot be repaired, causing the cells to die by apopto- 
sis. This type of treatment is based on a strategy of “synthetic lethal- 
ity,’ which suggests that mutations or inhibition of only one protein 
(e.g., BRCA or PARP) has no effect on cell viability but that muta- 
tions and/or inhibition of two different proteins leaves the cell un- 
able to carry out one or more essential functions. More importantly, 
this approach provides a general strategy to target cancer cells that 
have lost the function of a particular tumor suppressor protein, in 
this case BRCA1 or BRCA2. A similar strategy might also be devised 
to target cancer cells that have lost p53 function, for example, because 
such cells should also be vulnerable to drugs that have no effect on 
normal cells, which possess intact tumor-suppressive pathways. It 
might be possible to find genes that might be sensitive to synthetic 
lethality by searching the data from cancer genomics studies, looking 
for either (1) two genes that are never mutated together in the same 
tumor or (2) genes that are never mutated in any cases of a particular 
cancer. Genes in this latter category are presumably required for sur- 
vival of the cancer cells and represent possible drug targets. 


REVIEW | 
1. Describe the mechanism of action of the drug 
Gleevec. 


16.17 The Concept of a Cancer 
Stem Cell 


Another reason for the failure to develop more effective targeted 
therapies may be that the agents are not targeting the appropriate 
cells within the tumor. This possibility requires further explanation 
but raises an important issue concerning both the basic biology of 
cancer and its treatment. Throughout this chapter, we have consid- 
ered a tumor to be a mass of relatively homogeneous cells. When 
viewed in this way, all of the cells in a tumor are capable of unlimited 
proliferation and all of the cells have the opportunity to evolve into a 
more malignant phenotype as the result of ongoing genetic change. 
In recent years, a new concept has emerged that suggests that while 
most of the cells of a tumor may be dividing at a rapid rate, they have 
relatively limited long-term potential to sustain the primary tumor 
or initiate a new secondary tumor. Instead, a relatively small fraction 
of cells scattered throughout the tumor are responsible for maintain- 
ing the tumor and promoting its spread. These “special” cells 
are known as cancer stem cells,° and there is considerable 


®Tt should be kept in mind that a number of experiments argue against the existence of 
rare cancer stem cells, and the issue is the focus of current debate. Cancer stem cells are 
essentially defined by a single assay: injection of cells into immunodeficient mice. The 
primary question in this debate is how large a fraction of cells in a tumor are capable 
of regrowing the tumor in this assay. A number of studies suggest that the answer to 
this question depends on the degree to which the animal that receives the tumor cells 
is immunocompromised rather than the nature of the cells that are injected. There is 
also considerable debate over the degree to which cancer stem cells can be defined by 
novel cell-surface markers. See Science 324:1670, 2009; Cell 138:822, 2009; Nature 
Med. 17:313, 2011; and Science 347:226, 2015 for a discussion of these issues. 


experimental evidence for their existence in leukemias, brain 
tumors, breast tumors, and other cancers. In most cases, the evi- 
dence for the existence of cancer stem cells rests on the finding that 
only a subpopulation of cells in a given human tumor is able to initi- 
ate the formation of a new tumor following injection into a suscepti- 
ble (immunodeficient) mouse. Those cells that are able to regrow the 
tumor in the inoculated animal often possess certain surface pro- 
teins that distinguish them from the cells that make up the bulk of 
the tumor, and it is these markers that serve to define the cancer 
stem cell. The term cancer stem cell does not suggest anything about 
the origin of these cells, whether from a tissue stem cell or some 
other type. Nor does it imply that cancer stem cells necessarily 
resemble normal tissue stem cells. Instead, they are defined simply 
as the cells capable of propagating or regrowing the tumor. 

The concept of the cancer stem cell is raised at this point in the 
chapter because it has important consequences for cancer therapy. If 
it is true that only a small subpopulation of cells in a tumor has the 
capability of continuing the life of that tumor, then developing drugs 
that rapidly kill off the bulk of a tumor mass but spare the cancer 
stem cells is doomed ultimately to fail. This is presumably the reason 
that patients with CML must continue to take Gleevec to keep their 
disease in remission; Gleevec does not seem to kill the leukemia- 
initiating cells responsible for the recurrence of disease. A number of 
drugs targeting cancer stem cells are currently undergoing trials, and 
a small number have shown promise. Tarextumab, for example, 
which targets signaling proteins of the Notch family, is thought to 
make cancer stem cells differentiate into tumor cells, making them 
susceptible to chemotherapy treatments. It was recently reported 
that 24 of 29 patients with pancreatic cancer (which is typically 
resistant to chemotherapy) who were treated with tarextumab in 
conjunction with conventional drugs saw stabilization or shrinkage 
of their tumors. 


REVIEW | 


1. What are cancer stem cells, and what is the evidence 
for their existence? 
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16.18 Inhibiting the Formation of 
New Blood Vessels (Angiogenesis) 


As a tumor grows in size, it stimulates the formation of new blood 
vessels, a process termed angiogenesis (FIGURE 16.25). Blood vessels 
are required to deliver nutrients and oxygen to the rapidly growing 
tumor cells and to remove their waste products. Blood vessels also 
provide the conduits for cancer cells to spread to other sites in the 
body. In 1971, Judah Folkman of Harvard University suggested that 
solid tumors might be destroyed by inhibiting their ability to form 
new blood vessels. After a quarter of a century of relative obscurity, 
this idea has been translated into an approved anticancer strategy. 

Cancer cells promote angiogenesis by secreting growth factors, 
such as VEGF, that act on the endothelial cells of surrounding blood 
vessels, stimulating them to proliferate and develop into new ves- 
sels. A variety of angiogenic inhibitors have been developed by 
biotechnology companies, including antibodies and synthetic com- 
pounds directed against integrins, growth factors, and growth- 
factor receptors. Preclinical studies on mice and rats suggested that 
angiogenic inhibitors might be effective in stopping tumor growth. 
Most importantly, tumors treated with these inhibitors in preclini- 
cal studies did not become resistant to repeated drug application. 
Tumor cells become resistant to the usual chemotherapeutic agents 
because the cells are genetically unstable and can evolve into resist- 
ant forms. Inhibitors of angiogenesis, however, target normal, 
genetically stable endothelial cells, which continue to respond to the 
presence of these agents. 

Inhibiting angiogenesis in human tumors is not as easily accom- 
plished as might have been expected based on studies with mice. To 
date, the most promising results have been obtained with a human- 
ized antibody (called Avastin) that is directed against VEGF, the 
endothelial cell growth factor that is overexpressed in most solid 
tumors. Avastin blocks VEGF from binding to and activating its 
receptor, VEGEFR. FDA approval was based on clinical trials demon- 
strating that Avastin, in combination with standard chemotherapy, 
could prolong the lives of patients with metastatic colorectal cancer 
for several months. Subsequent studies suggested that Avastin could 
add several months to the lives of patients with several other types of 


FIGURE 16.25 Angiogenesis and tumor growth. Steps in the vascularization of a primary tumor. In step 1, the tumor proliferates to form a small mass of 
cells. As long as it is avascular (without blood vessels), the tumor remains very small (1-2 mm). In step 2, the tumor mass has produced angiogenic factors 
that stimulate the endothelial cells of nearby vessels to grow out toward the tumor cells. In step 3, the tumor has become vascularized and is now capable of 


unlimited growth. 


Source: After B. R. Zetter, with permission from the Annual Review of Medicine, Vol 49; copyright 1998. Annual Review of Medicine by Annual Reviews. 
Reproduced with permission of Annual Reviews in the format republish in a book via Copyright Clearance Center. 
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660 solid tumors, but this conclusion has been debated, most notably in 
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cases of breast cancer. Whether inhibition of angiogenesis will ever 
prove itself as an effective therapy remains to be seen but, at the pre- 
sent time, interest in this therapeutic strategy has waned. 

For the forseeable future, the best anticancer strategy is early 
detection. The time that elapses between the initiation of one of the 
common epithelial cancers and its progression to a deadly metastatic 
disease is generally considered to be at least a decade and often much 
longer. Consequently, there is ample time to detect these cancers 
before they become life-threatening. A number of screening proce- 
dures are currently in use, including mammography for detecting 
breast cancer, Pap smears for detecting cervical cancer, PSA deter- 
minations for detecting prostate cancer, and colonoscopy for detect- 
ing colorectal cancer. It is hoped that advances in proteomics will 
evenually lead to the development of new screening tests based on 
the relative levels of various proteins present in blood or other bodily 
fluids. This approach was discussed in some detail on page 72. There 
are other bloodborne biological indicators (biomarkers) that could 


reveal the presence of cancer, including mutant DNA, particular 
miRNAs, abnormal carbohydrates, and distinctive metabolites. 
Screening tests based on each of these types of biomarkers are being 
investigated.’ 


’There is a widely held view that increased early screening procedures are likely to 
identify an increased number of cancers that will never become life-threatening and, 
thus, will increase the cost and debilitating side-effects of treatments that do not affect 
survival rates (discussed in Ann Rev. Med. 60:125, 2009). The most compelling evi- 
dence for the value of such procedures comes from a recent study that found that 
removal of polyps during a colonoscopy greatly reduces the likelihood of death from 
colon cancer (NEJM 366:687, 2012). 
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1. How do cancer cells promote angiogenesis? 
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The Immune Response 


FIGHTING EBOLA 


ls the spring of 2014, a child in western Africa became 
infected with Ebola virus, which resulted in a deadly 
hemorrhagic fever. Over the following year, this highly 
contagious disease quickly spread, claiming the lives of 
approximately half of those infected, and ultimately 
leaving a death toll in the thousands. Since its discovery 


in the 1960s, Ebola virus has caused over 30 outbreaks of 


hemorrhagic fever, each with a fatality rate that is 
consistently at or above 50 percent. What makes Ebola 
infection so lethal? 


Upon infection, usually through direct contact with the 
bodily fluids of an infected individual, the virus gains 
entry into target cells (mainly white blood cells) by 
binding to receptors on the cell surface and inducing 
the cell to internalize it. Similar to other viruses, Ebola 
then co-opts the cellular machinery to transform the 
infected cell into a viral factory. The cell begins to churn 
out high numbers of new viral particles, eventually 
resulting in the death of the host cell. Ebola-infected 
white blood cells can travel through the lymphatic 
system, allowing for the virus to quickly spread through- 
out the body. 


Numerous Ebola viral particles (blue) are shown budding from an infected cell 
(yellow) in this pseudo-colored electron micrograph. 


Source: Courtesy of National Institute of Allergy and Infectious Diseases (NIAID). 
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allows it to evade a large number of the cellular defenses 
that are normally triggered in response to a viral infection. 
Interferons, for example, are signaling proteins that play a 
key role in innate immunity by regulating a large number of 
antiviral activities, including reducing protein translation, 
destroying RNAs, inducing cell death and inflammation, 
and signaling to neighboring cells of an infection. Research 
has shown that Ebola has evolved numerous ways of 
circumventing different classes of interferons, preventing 
the activation of the innate immune response. The 
combination of host cell death, “bystander” apoptosis of 
uninfected T cells (which is currently not well understood), 
and the lack of additional white blood cell proliferation 
results in lymphopenia, or a low level of white blood cells. 
As the disease progresses, endothelial membranes and 


17.1. An Overview of the Immune 
Response 


Living organisms provide ideal habitats in which other organisms 
can grow. It is not surprising, therefore, that animals are subject to 
infection by viruses, bacteria, protists, fungi, and animal parasites. 
Vertebrates have evolved several mechanisms that allow them to rec- 
ognize and destroy these infectious agents. As a result, vertebrates 
are able to develop an immunity against invading pathogens. 
Immunity results from the combined activities of many different 
cells, some of which patrol the body, whereas others are concen- 
trated in lymphoid organs, such as the bone marrow, thymus, spleen, 
and lymph nodes (FIGURE 17.1). Together, these dispersed cells and 
discrete organs form the body’s immune system. 

The cells of the immune system engage in a type of molecular 
screening by which they recognize “foreign” macromolecules, that 
is, ones whose structure is different from those of the body’s normal 
macromolecules. If foreign material is encountered, the immune 
system mounts a specific and concerted attack against it. The weap- 
ons of the immune system include (1) cells that kill or ingest infected 
or altered cells and (2) soluble proteins that can neutralize, immobi- 
lize, agglutinate, or kill pathogens. Pathogens, in turn, are continu- 
ally evolving countermechanisms to avoid immune destruction. The 
fact that humans suffer from a number of chronic infective diseases, 
such as AIDS (caused by a virus), tuberculosis (caused by a bacte- 
rium), and malaria (caused by a protozoan) illustrates how our 
immune systems are not always successful in combating these 
microscopic pathogens. In some cases, the immune system may 
mount an inappropriate response that attacks the body’s own tissues. 
As discussed in the Human Perspective (Section 17.3) these 
incidents can lead to serious disease. 

It is impossible to cover the entire subject of immunity in a single 
chapter. Instead, we will focus on a number of selected aspects that illus- 
trate principles of cell and molecular biology that were discussed in pre- 
vious chapters. First, however, it is necessary to examine the basic events 
in the body’s response to the presence of an intruding microbe. 


organs become damaged, causing widespread 
hemorrhaging. 


Research suggests that when tissue damage becomes 
severe, immune cells are sometimes triggered to suddenly 
release a large number of pro-inflammatory signaling 
molecules, or cytokines, in a late attempt to control the viral 
infection. This massive dose of cytokines, called a cytokine 
storm, is thought to be one of the events that contributes to 
the high fatality rates of Ebola. 


Efforts are currently underway to develop a vaccine to help 
prevent future Ebola outbreaks and to develop antibody- 
based treatments for those already infected. Researchers 
are also seeking to further elucidate how Ebola and other 
pathogens are able to bypass our bodies’ numerous 
immune defenses. 
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FIGURE 17.1 The human immune system includes various lymphoid 
organs, such as the thymus, bone marrow, spleen, lymph nodes, and 
scattered cells located as patches within the small intestine, appendix, and 
tonsils. The thymus and bone marrow are often described as the central 
immune system because of their key roles in lymphocyte differentiation. 
Source: From Wessner, Microbiology, 1e, Figure 20.4. John Wiley & Sons 
Publishers. Reprinted by permission of John Wiley & Sons, Inc. 


The outer surface of the body and the linings of its internal 
tracts provide an excellent barrier to prevent penetration by viruses, 
bacteria, and parasites. If these surface barriers are breached, a series 
of immune responses are initiated that attempt to contain and then 
kill the invaders. Immune responses can be divided into two general 
categories: innate responses and adaptive (or acquired) responses. 
Both types of responses depend on the ability of the body to distin- 
guish between materials that are supposed to be there (i.e., “self”) 
and those that are not (i.e., foreign, or “nonself”). We can also distin- 
guish two categories of pathogens: those that occur primarily inside 
a host cell (all viruses, some bacteria, and certain protozoan parasites) 
and those that occur primarily in the extracellular compartments of 
the host (most bacteria and other cellular pathogens). Different 
types of immune mechanisms have evolved to combat these two 
types of infections. An overview of some of these mechanisms is 
shown in FIGURE 17.2. 


Innate Immune Responses 


Innate immune responses are those the body mounts immedi- 
ately without requiring previous contact with the microbe. Thus, they 
provide the body with a first line of defense. An invading pathogen 
typically makes its initial contact with the innate immune system 
when it is greeted by a phagocytic cell, such as a macrophage or a 
dendritic cell (see Figure 17.10), whose function is to recognize for- 
eign objects and sound an appropriate alarm. In 1989, Charles 
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Janeway, Jr. of Yale University published a far-sighted proposal to 663 


address the role of the innate immune system both in providing the 
body with immediate protection from pathogens and in stimulating 
the cells of the adaptive immune system to become subsequently 
involved in overcoming the threat. We will restrict the present dis- 
cussion to the innate immune system. Janeway proposed that cells of 
the innate system possess a variety of microbial sensors that directly 
recognize certain highly conserved macromolecules that play essen- 
tial roles in the propagation of viruses or bacteria but are not pro- 
duced by cells of the body. Janeway called such sensors pattern 
recognition receptors (or PRRs). Over the past two decades, a number 
of families of PRRs have been identified, the most important of 
which are the Toll-like receptors (TLRs), which is the only group 
that will be discussed in this chapter. The discovery of TLRs came 
about by way of an interesting and unexpected series of events. 

The fruit fly Drosophila melanogaster is well known for its iconic 
status in the field of genetics and also for key contributions to the 
study of development and neurobiology. But, as an invertebrate, 
Drosophila would not have been considered a likely organism for a 
major discovery concerning the workings of the human immune sys- 
tem. However, in 1996, Jules Hoffmann and colleagues in France iden- 
tified a mutant fruit fly that was highly susceptible to fungal infections, 
as is evident from the photograph in FIGURE 17.3. The fly pictured in 
Figure 17.3 is lacking a protein called Toll, which had been previously 
identified as a protein required for the normal development of dor 


soventral (“top-bottom”) polarity of the fly embryo. In addition they 


ADAPTIVE IMMUNITY 


B cell (SPECIFIC) 


Bacterium coated 
with antibody molecules 


Antibody 
layer 


Infected 
apoptotic cell 


o A 


Bacterial 
toxin 


FIGURE 17.2 An overview of some of the mechanisms by which the immune system rids the body of invading pathogens. The left panel depicts 
several types of innate immunity: (a) phagocytosis of a bacterial cell; (b) bacterial cell killing by complement; (c) apoptosis induced in an infected cell by a 
natural killer (NK) cell; and (d) induction of viral resistance by interferon a (IFN-a). The right panel depicts several types of adaptive immunity: (e) B cell 
producing antibodies that neutralize a bacterial toxin; (f) bacterial cell coated with antibody (opsonized), which makes it susceptible to phagocytosis or 
complement-induced death; and (g) apoptosis induced in an infected cell by an activated T lymphocyte (T cell). Innate and adaptive immune responses are 
linked to one another (horizontal green arrow) because cells, such as dendritic cells and macrophages, that phagocytize pathogens, use the foreign proteins 
to stimulate the production of specific antibodies and T cells directed against the pathogen. NK cells also produce substances (e.g., IFNy) that influence a 


T-cell response. 
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FIGURE 17.3 Scanning electron micrograph of a mutant fruit fly that 
had died from a fungal infection. The body is covered with germinating 
fungal hyphae. This individual was susceptible to infection because it 
lacked a functional Toll gene. 

Source: From Bruno Lemaitre et al., Cell 86:978, 1996, with permission from 
Elsevier. Courtesy of Jules A. Hoffmann. 


found that Toll acted in flies through a pathway involving the tran- 
scription factor NF-KB, which was known to be a key player in the 
activation of an immune response in vertebrates. It appeared that Toll 
serves a dual function in flies, as both a director of embryonic polarity 
and a factor that promotes innate immunity to infection. 

The discovery of the Toll gene as an important contributor to 
the fruit fly's defense system led Janeway and colleague Ruslan 
Medzhitov to clone and characterize a human homologue of the Toll 
protein. In addition, they found that the human Toll homologue also 
acted by way of the NF-KB pathway, inducing the expression of a 
number of immunological effector proteins (called cytokines). It was 
not known, however, what specific role this “human Toll” played in 
the immune response. Meanwhile, Bruce Beutler and his colleagues 
at the University of Texas, Southwestern Medical Center were 
searching for mammalian genes that were involved in recognizing 
bacterial components, particularly a component of the outer mem- 
brane of gram-negative bacteria, called lipopolysaccaride (LPS). 
They found that a mouse strain that was unable to respond to LPS 
was lacking a particular gene that encoded an LPS receptor. 
Strikingly, this LPS receptor was the same Toll homologue (TLR4) 
that had been characterized by Janeway and Medzhitov, thus defin- 
ing its role as a specific bacterial sensor. As it happens, humans 
express at least 10 functional TLRs, all of which are transmembrane 
proteins that are found in a variety of different types of cells. TLRs 
may be present either on the cell surface (where they interact with 
extracellular microbes) or within endosomal/lysosomal membranes 
(where they interact with endocytosed microbes). Within the human 
TLR family are receptors that recognize the lipopolysaccharide or 
peptidoglycan components of the bacterial cell wall, the protein 
flagellin found in bacterial flagella, double-stranded RNA character- 
istic of replicating viruses, and unmethylated CpG dinucleotides 
(which are characteristic of bacterial DNA). A model of a TLR 
bound to its double-stranded RNA ligand is shown in FIGURE 17.4. 

Activation of a TLR by one of these pathogen-derived molecules 
initiates a signal cascade within the cell that can lead to a variety of 
protective immune responses including the activation of cells of the 
adaptive immune system (represented by the horizontal green arrow 


in Figure 17.2). For this reason, a number of pharmaceutical compa- 
nies are working on drugs that stimulate TLRs with the aim of 
enhancing the body’s response against stubborn infections, such as 
that caused by the hepatitis C virus. Aldara, which was approved in 
1997 and is prescribed for a number of skin conditions including 
genital warts, was later discovered to act by stimulating a TLR. 

Innate responses to invading pathogens are typically accompa- 
nied by a process of inflammation at the site of infection where 
certain cells and plasma proteins leave the blood vessels and enter 
the affected tissues (Section 7.8). These events are accompanied by 
local redness, swelling, and fever. Inflammation provides a means for 
concentrating the body’s defensive agents at the site where they are 
needed. During inflammation, phagocytic cells migrate toward a site 
of infection in response to chemicals (chemoattractants) released at 
the site (Section 9.14). Once there, these cells recognize, engulf, and 
destroy the pathogen (Figure 17.2a). Inflammation is a dual-edged 
sword. Although it protects the body against an invading pathogen, 
if inflammation is not terminated in a timely manner, it can lead to 
damage to the body’s normal tissues and chronic disease. Regulation 
of inflammation is a complex and poorly understood process 
involving a balance between pro- and anti-inflammatory activities. 

A number of other mechanisms are also geared to attacking 
extracellular pathogens. Both epithelial cells and lymphocytes 
secrete a variety of antimicrobial peptides, called defensins, which 
are able to bind to viruses, bacteria, or fungi and bring about their 
demise. Blood also contains a group of soluble proteins called com- 
plement that binds to pathogens, triggering their destruction. In 
one of the complement pathways, an activated assembly of these 
proteins perforates the plasma membrane of a bacterial cell, leading 
to cell lysis and death (Figure 17.2b). 


FIGURE 17.4 Model of a Toll-like receptor (TLR3) bound to a double- 
stranded RNA (dsRNA) molecule. The extracellular ligand-binding 
domains of TLRs contain a large curved surface composed largely of 
leucine-rich repeats that provide flexibility in ligand binding. The dsRNA 
ligand (blue and orange double helix) binds to a TLR dimer whose 
transmembrane and cytoplasmic domains are brought into close contact as 
shown in this model. The complex is “ready” to send an intracellular signal 
alerting the cell of the presence of the foreign nucleic acid molecule. 


Source: From Lin Liu et al., Courtesy of David R. Davies, Science 320:381, 
2008; © 2008. Reprinted with permission from AAAS. 


FIGURE 17.5 Innate immunity. Pseudocolored scanning electron 
micrograph of a natural killer cell (yellow) bound to a target cell (orange). 
NK cells kill their targets by a similar mechanism as described for CTLs in 
Section 17.6. 


Source: Eye of Science / Science Source. 


Innate responses against intracellular pathogens, such as viruses, 
are targeted primarily against cells that are already infected. Cells 
infected with certain viruses are recognized by a type of nonspecific 
lymphocyte called a natural killer (NK) cell. As its name implies, 
NK cells act by causing the death of the infected cell (Figure 17.2c). 
The NK cell induces the infected cell to undergo apoptosis (Section 
15.17). NK cells can also kill certain types of cancer cells in vitro 
(FIGURE 17.5) and may provide a mechanism for destroying such 
cells in vivo before they develop into a tumor. Normal (i.e., nonin- 
fected and nonmalignant) cells possess surface molecules that pro- 
tect them from attack by NK cells. 

Another type of innate antiviral response is initiated within the 
infected cell itself. Virus-infected cells produce proteins called type 
1 interferons (IFN-a« and IFN-f) that are secreted into the extracel- 
lular space, where they bind to the surface of noninfected cells ren- 
dering them resistant to subsequent infection (Figure 17.2d). 
Interferons accomplish this by several means, including activation 
of a signal transduction pathway that results in phosphorylation 
and consequent inactivation of the translation factor eIF2 (page 
443). Cells that have undergone this response cannot synthesize 
viral proteins that are required for virus replication. IFN-B may also 
induce the synthesis of cellular microRNAs that target viral RNA 
genomes. 

The innate and adaptive immune systems do not function 
independently but work closely together to destroy a foreign 
invader. The dependence of adaptive immune responses on prior 
events orchestrated by cells of the innate system was an important 
tenet of Janeway’s original hypothesis (page 663). Most impor- 
tantly, the same phagocytic cells and NK cells that carry out an 
immediate innate response are also responsible for initiating the 
much slower, more specific adaptive immune response. As we will 
see below, it is the nonspecific cells of the innate system that are 
responsible for activating only those specific cells of the adaptive 
system that are capable of dealing with the particular threat that is 
at hand. 


Adaptive Immune Responses 


Unlike innate responses, adaptive immune responses require a 
lag period during which the immune system gears up for an attack 
against a foreign agent. Unlike innate responses, adaptive immune 


responses are highly specific and can discriminate between two very 665 


similar molecules. For example, the blood of a person who has just 
recovered from measles contains antibodies that react with the virus 
that causes measles, but not with a related virus, such as that which 
causes mumps. Unlike the innate system, the adaptive system also 
has a “memory,” which usually means that the person will not suffer 
again from the same pathogen later in life. 

Whereas all animals possess some type of innate immunity against 
microbes and parasites, only vertebrates are known to mount an adap- 
tive response. There are two broad categories of adaptive immunity: 


e Humoral immunity, which is carried out by antibodies 
(Figure 17.2e,f). Antibodies are globular, bloodborne proteins of 
the immunoglobulin superfamily (IgSF). 


e Cell-mediated immunity, which is carried out by cells 
(Figure 17.2g). 


Both types of adaptive immunity are mediated by lymphocytes, 
which are nucleated leukocytes (white blood cells) that circulate 
between the blood and lymphoid organs. Humoral immunity is 
mediated by B lymphocytes (or B cells) that, when activated, 
differentiate into cells that secrete antibodies. Antibodies are 
directed primarily against foreign materials that are situated out- 
side the body’s cells. Such materials include the protein and poly- 
saccharide components of bacterial cell walls, bacterial toxins, and 
viral coat proteins. In some cases, antibodies can bind to a bacterial 
toxin or virus particle and directly prevent the agent from entering 
a host cell (Figure 17.2e). In other cases, antibodies function as 
“molecular tags” that bind to an invading pathogen and mark it for 
destruction. Bacterial cells coated with antibody molecules (Figure 
17.2f) are rapidly ingested by wandering phagocytes or destroyed 
by complement molecules carried in the blood. Antibodies are not 
effective against pathogens that are present inside cells, hence the 
need for a second type of weapon system. Cell-mediated immunity 
is carried out by T lymphocytes (or T cells) that, when activated, 
can specifically recognize and kill an infected (or foreign) cell 
(Figure 17.2g). 

Band T cells arise from the same type of precursor cell (a hemat- 
opoietic stem cell) in the bone marrow, but they differentiate along 
different pathways in different lymphoid organs. A summary of the 
various pathways of differentiation of the hematopoietic stem cell is 
shown in FIGURE 17.6. B lymphocytes differentiate in the fetal liver 
or adult bone marrow, whereas T lymphocytes differentiate in the 
thymus gland, an organ located in the chest that reaches its peak 
size during childhood. Because of these differences, cell-mediated 
and humoral immunity can be dissociated to a large extent. For 
instance, humans may suffer from a rare disease called congenital 
agammaglobulinemia in which humoral antibody is deficient and 
cell-mediated immunity is normal. 


REVIEW | 


1. Contrast the general properties of innate and adaptive 
immune responses. 


2. List four types of innate immune responses. Which 
would be most effective against a pathogen inside an 
infected cell? 


| 


3. What is meant by the terms “humoral” and “cell- 


mediated” immunity? 


oq______—_- 
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FIGURE 17.6 Pathways of differentiation of a hematopoietic stem 
cell of the bone marrow. A hematopoietic stem cell can give rise to 

two different progenitor cells: a myeloid progenitor cell that can 
differentiate into most of the various blood cells (e.g., erythrocytes, 
basophils, and neutrophils), macrophages, or dendritic cells; or a lymphoid 
progenitor cell that can differentiate into any of the various types of 
lymphocytes (NK cells, T cells, or B cells). T-cell precursors migrate to the 
thymus where they differentiate into T cells. In contrast, B cells undergo 
differentiation in the bone marrow. Cells in the various stages of B- and 
T-cell differentiation can be distinguished by the species of proteins at 
their cell surface and/or the transcription factors that determine the genes 
being expressed. 
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17.2 The Clonal Selection 
Theory as It Applies to B Cells 


If a person is infected with a virus or exposed to foreign material, his 
or her blood soon contains a high concentration of antibodies capa- 
ble of reacting with the foreign substance, which is known as an 
antigen. Most antigens consist of proteins or polysaccharides, but 
lipids and nucleic acids can also serve in this capacity. How can the 
body produce antibodies that react specifically with an antigen to 
which the body is exposed? In other words, how does an antigen 
induce an adaptive immune response? Initially, it was thought that 
antigens somehow instructed lymphocytes to produce complemen- 
tary antibodies. It was suggested that an antigen wraps itself around 
an antibody molecule, molding the antibody into a shape capable of 
combining with that particular antigen. In this “instructive” model, 
the lymphocyte only gains its ability to produce a specific antibody 
after its initial contact with antigen. In 1955, Niels Jerne, a Danish 
immunologist, proposed a radically different mechanism. Jerne sug- 
gested that the body produces small amounts of randomly struc- 
tured antibodies in the absence of any antigen. As a group, these 
antibodies would be able to combine with any type of antigen to 
which a person might someday be exposed. According to Jerne’s 
model, when a person is exposed to an antigen, the antigen com- 
bines with a specific antibody, which somehow leads to the subse- 
quent production of that particular antibody molecule. Thus in 
Jerne’s model, the antigen selects those preexisting antibodies capa- 
ble of binding to it. In 1957, the concept of antigen selection of anti- 
bodies was expanded into a comprehensive model of antibody 
formation by the Australian immunologist F. MacFarlane Burnet. 
Burnet’s clonal selection theory quickly gained widespread 
acceptance. An overview of the steps that occur during the clonal 
selection of B cells is shown in FIGURE 17.7. More detailed discussion 


Proliferation 

of antigen-specific 

B cell following 
lymphocyte selection 


Plasma cells secreting 
a. capable of 


5 ] 
Memory cell remains 
behind for future encounter 
with antigen (responsible 
for long-term immunity) 


FIGURE 17.7 The clonal selection of B cells by a thymus-independent antigen. The steps are described in the figure and also in the text. 


of these events is provided later in the chapter. The clonal selection 
of T cells is described in the following section. The main features of 
B-cell clonal selection are the following: 


1. Each B cell becomes committed to produce one species of antibody. 
B cells arise from a population of undifferentiated and indistin- 
guishable progenitor cells. As it differentiates, a B cell becomes 
committed as the result of DNA rearrangements (see Figure 17.8) 
to producing only one species of antibody molecule (Figure 17.7, 
step 1). Thousands of different DNA rearrangements are possible, 
so that different B cells produce different antibody molecules. 
Thus, even though mature B cells appear identical under the micro- 
scope, they can be distinguished by the antibodies they produce. 


2. Bcells become committed to antibody formation in the absence 
of antigen. The basic repertoire of antibody-producing cells that 
a person will possess throughout his or her lifetime is already 
present within the lymphoid tissues prior to stimulation by an 
antigen and is independent of the presence of foreign materials. 
Each B cell displays its particular antibody on its surface with the 
antigen-reactive portion facing outward. As a result, the cell is 
coated with antigen receptors that can bind specifically with anti- 
gens having a complementary structure. Although most lym- 
phoid cells are never required during a person's lifetime, the 
immune system is primed to respond immediately to any antigen 
that a person may be exposed to. The presence of cells with dif- 
ferent membrane-bound antibodies can be demonstrated experi- 
mentally, as shown in FIGURE 17.8. 


3. Antibody production follows selection of B cells by antigen. In 
most cases, the activation of a B cell by antigen requires the 
involvement of T cells (described in Figure 17.12 and page 689). 
A few antigens, however, such as the polysaccharides present in 
bacterial cell walls, activate B cells by themselves; antigens of this 
type are described as thymus-independent antigens. For simplic- 
ity, we will restrict the discussion at this point in the chapter to a 
thymus-independent antigen. Suppose a person were to be 
exposed to Haemophilus influenzae type B, an encapsulated bac- 
terium that can cause fatal meningitis. The capsule of these bac- 
teria contains a polysaccharide that can bind to a tiny fraction of 
the body’s B cells (Figure 17.7, step 2). The B cells that bind the 
polysaccharide contain membrane-bound antibodies whose 
combining site allows them to interact specifically with that anti- 
gen. In this way, an antigen selects those lymphocytes that pro- 
duce antibodies capable of interacting with that antigen. Antigen 
binding activates the B cell, causing it to proliferate (Figure 17.7, 
step 3) and form a population (or clone) of lymphocytes, all of 
which make the same antibody. Some of these activated cells dif- 
ferentiate into short-lived plasma cells that secrete large 
amounts of antibody molecules (Figure 17.7, step 4). Unlike their 
B-cell precursors (FIGURE 17.9a), plasma cells possess an exten- 
sive rough ER characteristic of cells that are specialized for pro- 
tein synthesis and secretion (Figure 17.9b). 


4. Immunologic memory provides long-term immunity. Not all B 
lymphocytes that are activated by antigen differentiate into anti- 
body-secreting plasma cells. Some remain in lymphoid tissues as 
memory B cells (Figure 17.7, step 5) that can respond rapidly at a 
later date if the antigen reappears in the body. Although plasma 
cells die off following removal of the antigenic stimulus, memory 
B cells may persist for a person’s lifetime. It has been demon- 
strated, for example, that elderly individuals who were alive 
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FIGURE 17.8 Experimental demonstration that different B cells 
contain a different membrane-bound antibody and that these antibod- 
ies are produced in the absence of antigen. In this experiment, B cells are 
prepared from a mouse spleen (step 1). In step 2, the spleen cells are 
passed through a column containing beads coated with an antigen 
(antigen A) to which the mouse had never been exposed. A tiny fraction of 
the spleen cells bind to the beads, while the vast majority of spleen cells 
pass directly through the column (shown in step 3). In step 4, spleen cells 
from the previous experiment are passed through one of two different 
columns: a column whose beads are coated with antigen A or a column 
whose beads are coated with an unrelated antigen (antigen B) to which the 
mouse had never been exposed. In step 4a, the spleen cells tested are 
those that had bound to the beads in the previous step. These cells are 
found to rebind to beads coated with antigen A, but do not bind to beads 
coated with antigen B. In step 4b, the spleen cells tested are those that did 
not bind to the beads in the previous step. None of these cells bind to 
beads coated with antigen A, but a tiny fraction binds to beads coated with 
antigen B. 
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FIGURE 17.9 Comparison of the structure of a B cell (a) and a plasma cell (b). The plasma cell has a much larger cytoplasmic compartment than the 
B cell, with more mitochondria and an extensively developed rough endoplasmic reticulum. These characteristics reflect the synthesis of large numbers of 
antibody molecules by the plasma cell. 


Source: Steve Gschmeissner/Photo Researchers, Inc. 


down of the tolerant state can lead to the development of debili- 
tating autoimmune diseases. 


during the 1918 influenza pandemic still contain circulating 
B cells that are specific for the pathogen they were exposed to 90 


years earlier. When stimulated by the same antigen, some of the Several principles of the clonal selection theory can be illus- 


memory B cells rapidly proliferate into plasma cells, generating a 
secondary immune response in a matter of hours rather than the 
days required for the original response (see Figure 17.13). 


. Immunologic tolerance prevents the production of antibodies 


against self. As discussed below, genes encoding antibodies are 
generated by a process in which DNA segments are randomly 
combined. As a result, genes are invariably formed that encode 
antibodies that can react with the body’s own tissues, which 
could produce widespread organ destruction and subsequent 
disease. It is obviously in the best interest of the body to prevent 
the production of such proteins, which are called autoantibod- 
ies. As they develop, many of the B cells capable of producing 
autoantibodies are either destroyed or rendered inactive. As a 
result, the body develops an immunologic tolerance toward itself. 
As discussed in the Human Perspective (Section 17.3) a break- 


trated by briefly considering the subject of vaccination. 


REVIEW | 


1. Contrast instructive and selective mechanisms of 
antibody production. 

2. What are the basic tenets of the clonal selection theory? 

3. What does it mean for a B cell to become committed 
to antibody formation? How is this process influenced 
by the presence of antigen? What role does antigen 
play in antibody production? 

4. What is meant by the terms immunologic memory and 
immunologic tolerance? 


. —_ 


17.3 THE HUMAN PERSPECTIVE 


Autoimmune Diseases 


The immune system requires complex and highly specific inter- 
actions between many different types of cells and molecules. 
Numerous events must take place before a humoral or cell-mediated 
immune response can be initiated, which makes these processes 
vulnerable to disruption at various stages by numerous factors. 
Included among the various types of immune dysfunction are 


autoimmune diseases, which result when the body mounts an 
immune response against part of itself. More than 80 different 
autoimmune diseases have been characterized, affecting approx- 
imately 5 percent of the population. 

Because the specificity of the antigen receptors of both T and 
B cells is determined by a process of random gene rearrangement, 


it is inevitable that some members of these cell populations pos- 
sess receptors directed against the body’s own proteins (self- 
antigens). Lymphocytes that bind self-antigens with high affinity 
tend to be removed from the lymphocyte population, making the 
immune system tolerant toward self. However, some of the self- 
reactive lymphocytes generated in the thymus and bone marrow 
escape the body's negative selection processes, giving them the 
potential to attack normal body tissues. The presence of B and 
T lymphocytes capable of reacting against the body's tissues is 
readily demonstrated in healthy individuals. For example, when 
T cells are isolated from the blood and treated in vitro with a 
normal self-protein, together with the cytokine IL-2, a small number 
of cells in the population are likely to proliferate to form a clone 
of cells that can react to that self-antigen. Similarly, if laboratory 
animals are injected with a purified self-protein together with 
an adjuvant, which is a nonspecific substance that enhances the 
response to the injected antigen, they mount an immune response 
against the tissues in which that protein is normally found. Under 
normal circumstances, B and T cells capable of reacting to self- 
antigens are inhibited by antigen-specific Tk.g cells (page 678) or 
by other suppressive mechanisms. When these mechanisms fail, 
a person may suffer from an autoimmune disease, including those 
described below. 


1. Multiple sclerosis (MS) is a chronic inflammatory disease that 
typically strikes young adults, causing severe and often progres- 
sive neurological damage. MS results from an attack by immune 
cells and/or antibodies against the myelin sheath that surrounds 
the axons of nerve cells (Figure 4.5). These sheaths form the 
white matter of the central nervous system. The demyelination of 
nerves that results from this immunologic attack interferes with 
the conduction of nerve impulses along axons, leaving the person 
with diminished eyesight, problems with motor coordination, 
and disturbances in sensory perception. A disease similar to MS, 
called experimental allergic encephalomyelitis, can be induced 
in laboratory animals by injection of myelin basic protein, a major 
component of the myelin plasma membrane. However, ques- 
tions have been raised about the validity of this and other mouse 
models of human autoimmune diseases because agents that 
have shown efficacy in treating these affected mice often do not 
translate into effective treatments for humans. 


2. Type 1 diabetes (T1D) typically arises in children and results 
from the autoimmune destruction of the insulin-secreting 
B cells of the pancreas. Destruction of these cells is mediated 
by self-reactive T cells, whose proliferation may be stimulated 
by a viral infection. At present, patients with T1D are adminis- 
tered daily doses of insulin. While the hormone allows them 
to survive, these individuals may still be subject to degenera- 
tive kidney, vascular, and retinal disease. Patients typically pos- 
sess a significant fraction of their pancreatic B cells (e.g., 10-20 
percent) at the time they are diagnosed with the disease. It is 
hoped that treatments can be developed that will stop further 
loss of these cells and possibly even increase the number of 
these cells in their pancreas. As with MS, a well-studied mouse 
model for T1D, called NOD (non-obese diabetic) mice, is often 
used to test drugs in preclinical studies. 


3. Graves’ disease and thyroiditis are autoimmune diseases of 
the thyroid that produce very different symptoms. In Graves’ 
disease, the target of immune attack is the TSH receptor on 
the surface of thyroid cells that normally binds the pituitary 
hormone thyroid-stimulating hormone (TSH). In patients with 
this disease, autoantibodies bind to the TSH receptor, causing 
the prolonged stimulation of thyroid cells, leading to hyper- 
thyroidism (i.e., elevated blood levels of thyroid hormone). 


Thyroiditis (or Hashimoto's thyroiditis) develops from an 
immune attack against one or more common proteins of thy- 
roid cells, including thyroglobulin. The resulting destruction 
of the thyroid gland leads to hypothyroidism (i.e., decreased 
blood levels of thyroid hormone). 


4. Rheumatoid arthritis affects approximately 1 percent of the 
population and is characterized by the progressive destruc- 
tion of the body’s joints due to a cascade of inflammatory 
responses. In a normal joint, the synovial membrane that lines 
the synovial cavity is only a single cell layer thick. In persons 
with rheumatoid arthritis, this membrane becomes inflamed 
and thickened due to the infiltration of autoreactive immune 
cells and/or autoantibodies into the joint. Over time, the 
cartilage is replaced by fibrous tissue, which causes immobi- 
lization of the joint. 


5. Systemic lupus erythematosus (SLE) gets its name “red wolf” 
from a reddish rash that develops on the cheeks during the 
early stages of the disease (FIGURE 1). Unlike the other auto- 
immune diseases discussed above, SLE is seldom confined 
to a particular organ but often attacks tissues throughout the 
body, including the central nervous system, kidneys, and heart. 
The serum of patients with SLE contains antibodies directed 
against a number of components that are found in the nuclei of 
cells, including small nuclear ribonucleoproteins (snRNPs); pro- 
teins of the centromeres of chromosomes; and, most notably, 
double-stranded DNA. Recent studies suggest that autoim- 
munity occurs when TLRs that normally recognize microbial 
DNA and RNA (page 664) mistakenly bind to the body's own 
informational macromolecules. The incidence of SLE is particu- 
larly high in women of child-bearing age, suggesting a role for 
female hormones in triggering the disease. 


6. Inflammatory bowel diseases (IBDs), such as Crohn's dis- 
ease and ulcerative colitis, are characterized by painful inflam- 
mation of the intestine. Accumulating evidence suggests 
that these conditions result from an inappropriate response 
by the immune system to the normal commensal bacteria 
that inhabit our digestive system. Genome-wide association 
studies (page 400) have linked more than 50 genetic loci with 
susceptibility to these diseases. 


Not everyone in the population is equally susceptible to 
developing one of these autoimmune diseases. Most of these 
disorders appear much more frequently in certain families than in 
the general population, indicating a strong genetic component 
to their development. Although many different genes have been 
shown to increase susceptibility to autoimmune diseases, those 
that encode MHC class II polypeptides are most strongly linked. 
For example, people who inherit certain alleles of the MHC 
locus are particularly susceptible to developing type 1 diabetes. 
It is thought that cells bearing MHC molecules encoded by the 
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FIGURE 1 A “butterfly” rash that often appears as one of the early 
symptoms of SLE. 


Source: American College of Rheumatology 
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susceptible alleles can bind particular peptides that stimulate the 
formation of autoantibodies against insulin-secreting B cells of 
the pancreas. Other loci that are correlated with increased risk for 
autoimmune diseases include genes that encode proteins that 
are involved in T-cell signaling pathways, such as the protein tyro- 
sine phosphatase PTPN22, and genes that encode certain pro- 
inflammatory cytokines or their receptors. 

Possession of high-risk alleles may be necessary for an 
individual to develop certain autoimmune diseases, but it is not 
the only contributing factor. Studies of identical twins indicate 
that if one twin develops an autoimmune disease, the likelihood 
that the other twin will also develop the disease ranges from 
about 25 to 75 percent, not 100 percent as expected if genetics 
were the only contributing factor. Identifying the specific envi- 
ronmental or epigenetic factors (Section 12.7) that contribute to 
the development of these diseases has proven even more elusive 
than identification of the predisposing genes. 

The great progress that has been made over the past two 
decades in our understanding of the cellular and molecular basis 
of immunity has led to new treatments for a number of autoim- 
mune diseases that involve manipulation of the body’s immune 
system. These treatments have been tested in animal models (i-e., 
animals that can be made to develop diseases similar to those 
of humans) and their safety and efficacy determined in human 
clinical trials. Among the approaches taken are: 


@ Treatment with immunosuppressive drugs, such as cyclosporin A or 
CellCept, that block the autoimmune response. Corticosteroids, such 
as prednisone, are also prescribed for short periods. Because 
these drugs are nonspecific, they inhibit all types of immune 
responses and, therefore, render the patient susceptible to 
dangerous infections. 


@ Restoring immunological tolerance to self-antigens so that 
the body stops producing autoantibodies and autoreactive 
T cells. Of all of the approaches discussed in this section, 
this is the only one that promises the potential for antigen- 
specific therapy that targets a specific population of auto- 
reactive immune cells; all of the others exert a nonspecific 
influence on the immune system. One way to induce toler- 
ance to specific antigens is to administer peptides (called 
APLs) that resemble the peptides that would be generated 
from the self-antigens responsible for causing the disease. It 
is hoped that such APLs would bind to TCRs in a suboptimal 
manner, blocking T-cell activation and reducing the secretion 
of inflammatory cytokines (e.g., TNF-« and IFN-a). One drug 
of this type (Copaxone) consists of a mixture of synthetic pep- 
tides whose structure resembles that of myelin basic protein. 
Copaxone remains the only antigen-specific therapy for auto- 
immune diseases to be approved to date but a number of 
similar types of peptide “vaccines” are in clinical trials. While 
Copaxone may reduce the frequency of relapses in patients 
with multiple sclerosis, it does not stop disease progression 
and may elicit severe allergic side effects. Another approach 
to inducing immunological tolerance to myelin-derived pro- 
teins in patients with MS has been to isolate autoreactive 
T cells from the patient, expand their numbers in culture, 
render them incapable of replication, and inject them back 
into the patient with the aim of inducing an immune reaction 
against the reintroduced cells and other autoreactive cells in 
the body. Phase Ilb trials of this patient-specific, anti-T cell 
vaccine (Tovaxin or Tcelna) began in 2011 and have yielded 
promising results. Yet another way to induce immunological 
tolerance is to administer a modified version of the protein 


CTLA4, which binds to the B7 costimulatory protein on the 
surface of APCs and inhibits these cells’ ability to activate 
autoreactive T cells. Orencia, which acts in this manner, has 
been approved for patients with rheumatoid arthritis. Many 
researchers believe that the best long-term, antigen-specific 
approach to reestablishing tolerance is to treat a person with 
his or her own Treg cells. In this approach, the desired Treg 
cells would be isolated from the patient's blood, allowed to 
proliferate extensively in vitro, and then reinjected back into 
the patient in an attempt to suppress the specific self-reactive 
immune cells that are waging the autoimmune attack. Several 
clinical trials involving the transfer of Treg cells have begun in 
patients who are experiencing immunological complications 
following organ transplantation. 


Blocking the effect of pro-inflammatory cytokines, which are 
among the agents that wreak the greatest tissue destruction 
in many autoimmune diseases. The best studied examples of 
this approach are antibodies (e.g., Remicade, Cimzia, Simponi, 
and Humira) and a recombinant fusion protein (Enbrel) that 
act against the pro-inflammatory cytokine TNF-a. These drugs 
have been approved for the treatment of rheumatoid arthritis 
and can have dramatic curative effects in many patients. At 
the same time, blocking the action of TNF-a can increase 
the risk of infections (including tuberculosis) and lymphoma. 
IL-1 is another key pro-inflammatory cytokine, and a number 
of IL-1 inhibitors are presently in the drug pipeline. Kineret, a 
recombinant protein that blocks the activity of IL-1 by binding 
to the IL-1 receptor, has been in use for over a decade for the 
treatment of rheumatoid arthritis. Similarly, a human mono- 
clonal antibody (Stelara) directed against IL-12 and IL-23 shows 
efficacy against Crohn's disease and psoriasis. 


Treatment with cytokines. At present, the most widely prescribed 
treatment for multiple sclerosis is one of several approved 
IFN-B cytokines (Avonex, Rebif, Extavia, and Betaseron), which 
reduce disease progression by an average of 35 percent. IFN-B 
has many activities, and the precise mechanism of action is 
debated. 


Treatment with agents that destroy B cells or block their 
activation. Although autoimmune diseases are generally thought 
to result from the dysfunction of T cells, particularly helper 
T cells, there is considerable evidence that self-reactive B cells 
have several roles to play. In addition to producing antibodies, 
B cells can serve as APCs for antigen-specific T cells, and they 
secrete a variety of cytokines. The importance of B cells in 
autoimmune disorders has been confirmed in clinical studies 
with two monoclonal antibodies (Rituxan and Arzerra) that 
bind to and destroy B cells. Rituxan had been used as a safe 
and successful treatment for lymphoma (page 655) before 
it was tested in patients with rheumatoid arthritis and mul- 
tiple sclerosis, where it has proved surprisingly effective. It is 
remarkable that even though these antibody-based drugs vir- 
tually deplete the blood of circulating B cells for a period of 
several months, they do not limit a patient's ability to mount 
immune responses against infectious agents. In 2011 an anti- 
body (Benlysta) that indirectly inhibits B cell production was 
approved for the treatment of SLE—the first new drug to be 
approved for treatment of the disease in 50 years. Benlysta 
acts by targeting BLyS, a protein that stimulates B cell prolifer- 
ation and differentiation, and provides benefit to a fraction of 
patients with SLE. 


Disrupting the movement of self-reactive immune cells to 
areas of inflammation. The first effective treatment based on this 


concept to be approved by the FDA was a monoclonal anti- 
body (Tysabri) that is directed against the integrin subunit aq 
present on the surface of activated T cells. The drug is aimed at 
preventing these T cells from crossing the blood-brain barrier 
(page 247) and attacking the myelin sheaths of the central ner- 
vous system of MS patients. There is always a concern with any 
type of immune therapy that the treatment will interfere with 
the body's ability to fight infection. This concern was realized 
when Tysabri was temporarily removed from the market in 2005 
after several patients developed a serious viral brain infection 
(called PML). At the present time, Tysabri is widely prescribed 
to treat MS, although the small risk of contracting PML remains 
a concern. In 2010 the first MS drug that could be taken orally, 
called Gilenya, was approved by the FDA based on its ability to 
reduce relapses and delay the progression of the disease. Like 
Tysabri, Gilenya also blocks the migration of T cells but does 
so by a very different mechanism. Gilenya is a small synthetic 
compound that, once phosphorylated by the body, binds to 
sphingosine 1-phosphate receptors on the surface of T cells, 
causing the receptors to be internalized by endocytosis. Cells 
lacking the surface receptors cannot respond to their normal 
ligand, which would promote their migration out of the thymus 


and lymph nodes and into the bloodstream, from where they 
could reach the central nervous system. Another oral med- 
ication to treat MS, called Aubagio, was approved in 2012. 
Aubagio is known as a pyrimidine synthesis inhibitor, and a 
related compound has been used to treat rheumatoid arthritis. 
The mechanism of action is currently not well understood, but 
appears to act by inhibiting the expansion and migration of 
T and B cells to the CNS. 


@ Transplantation of hematopoietic stem cells (page 18) from 
either the patient (an autologous transplant) or a closely 
matched donor (an allogeneic transplant). Because this 
procedure has the potential to cause life-threatening compli- 
cations, it represents a treatment for only those patients with 
severe and debilitating autoimmune disease. However, unlike 
the drugs described above, transplant recipients begin the rest 
of their lives with a dramatically altered immune system and 
the possibility of being completely cured of their disease. It is 
estimated that about one-third of transplant recipients expe- 
rience dramatic long-term benefits from the procedure while 
another third obtain no apparent benefit at all. The reasons for 
this marked discrepancy in response remain unclear. 


17.4 Vaccination 


Edward Jenner practiced medicine in the English countryside at a 
time when smallpox was one of the most prevalent and dreaded dis- 
eases. Over the years, he noticed that the maids who tended the cows 
were typically spared the ravages of the disease. Jenner concluded 
that milkmaids were somehow “immune” to smallpox because they 
were infected at an early age with cowpox, a harmless disease they 
contracted from their cows. Cowpox produces blisters that resemble 
the pus-filled blisters of smallpox, but the cowpox blisters are local- 
ized and disappear, causing nothing more serious than a scar at the 
site of infection. 

In 1796, Jenner performed one of the most famous (and risky) 
medical experiments of all time. First, he infected an 8-year-old boy 
with cowpox and gave the boy time to recover. Six weeks later, Jenner 
intentionally infected the boy with smallpox by injecting pus from a 
smallpox lesion directly under the boy’s skin. The boy showed no 
signs of the deadly disease. Within a few years, thousands of people 
had become immune to smallpox by intentionally infecting them- 
selves with cowpox. This procedure was termed vaccination, after 
vacca, the Latin word for cow. 

Jenner’s experiment was successful because the immune 
response generated against the virus that causes cowpox happens to 
be effective against the closely related virus that causes smallpox. 
Most modern vaccines contain attenuated pathogens, which are 
pathogens that are capable of stimulating immunity but have been 
genetically “crippled” so that they are unable to cause disease. Most 
vaccines currently in use are B-cell vaccines, such as that employed 
to fight tetanus. Tetanus results from infection by the anaerobic soil 
bacterium Clostridium tetani, which can enter the body through a 


puncture wound. As they grow, the bacteria produce a powerful 
neurotoxin that blocks transmission across inhibitory synapses on 
motor neurons, leading to sustained muscle contraction and asphyx- 
iation. At 2 months of age, most infants are immunized against teta- 
nus by inoculation with a modified and harmless version of the 
tetanus toxin (called a toxoid). The tetanus toxoid binds to the sur- 
faces of B cells whose membrane-bound antibody molecules have a 
complementary binding site. These B cells proliferate to form a clone 
of cells that produce antibodies capable of binding to the actual teta- 
nus toxin. This initial response soon wanes, but the person is left 
with memory cells that respond rapidly if the person should happen 
to develop a C. tetani infection at a later date. Unlike most immuni- 
zations, immunity to the tetanus toxin does not last a lifetime, which 
is the reason that people are given a booster shot every 10 years or so. 
The booster shot contains the toxoid protein and stimulates the pro- 
duction of additional memory cells. What if a person receives a 
wound that has the potential to cause tetanus, and he cannot remem- 
ber ever receiving a booster shot? In these cases, the person is likely 
to be given a passive immunization, consisting of antibodies that can 
bind the tetanus toxin. Passive immunization is effective for only a 
short period of time and does not protect the recipient against a sub- 
sequent infection. 


REVIEW | 
1. Why did inoculation with cowpox successfully protect 
individuals from infection by smallpox? 


2. What is meant by the term passive immunization? 
@ 
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17.5 EXPERIMENTAL PATHWAYS 


The Role of the Major Histocompatibility Complex in Antigen Presentation 


In 1973, Hugh McDevitt and his colleagues at the Scripps 
Foundation in La Jolla, California, and at Stanford University, 
demonstrated that the susceptibility of mice to a particular path- 
ogen depends on the allele present at one of the MHC loci.’ They 
found that lymphocytic choriomeningitis virus (LCMV) causes a 
lethal brain infection in mice that are either homozygous or het- 
erozygous for the H-29 allele but does not cause infections in 
mice that are homozygous for the H—2K allele at this locus. 

These findings led Rolf Zinkernagel and Peter Doherty of 
the Australian National University to examine the role of cyto- 
toxic T lymphocytes (CTLs) in the development of this disease. 
Zinkernagel and Doherty planned experiments to correlate the 
level of CTL activity with the severity of the disease in mice hav- 
ing different MHC genotypes (or haplotypes, as they are called). 
Cytotoxic T lymphocyte activity was monitored using the follow- 
ing experimental protocol. Monolayers of fibroblasts (L cells) from 
one mouse were grown in culture and subsequently infected with 
the LCM virus. The infected fibroblasts were then overlaid by a 
preparation of spleen cells from a mouse that had been infected 
with the LCM virus seven days earlier. Waiting for seven days gives 
the animal’s immune system time to generate CTLs against virus- 
infected cells. The CTLs become concentrated in the infected 
animal's spleen. To monitor the effectiveness of the attack by the 
CTLs on the cultured L cells, the L cells were first labeled with a 
radioisotope of chromium (*!Cr). °'Cr is used as a marker for cell 
viability: As long as a cell remains alive, the radioisotope remains 
inside the cell. If a cell should be lysed during the experiment by 
a CTL, the *!Cr is released into the medium. 

Zinkernagel and Doherty found that the level of CTL activity 
against the cultured fibroblasts—as measured by the release of 
°!Cr—depended on the relative genotypes of the fibroblasts and 
spleen cells (Table 1).? All of the fibroblasts used to obtain the 
data shown in Table 1 were taken from an inbred strain of mice 
that were homozygous for the allele H—2* at the H-2 locus. When 
spleen cells were prepared from mice having an H—2K allele 
(e.g., CBA/H, AKR, and C3H/HedJ strains of mice), the L cells were 


TABLE 1 Cytotoxic Activity of Spleen Cells from Various Strains 
of Mice Injected 7 Days Previously with LCM Virus for Monolayers 
of LCM-Infected or Normal C3H (H-2*) Mouse L Cells 


% *'Cr release 


Infected Normal 


Exp Mouse strain 


65.1+3.3 207 
179+0,9 17.2+0.6 
22.7414 19.8+0.9 
56.1+0.5 16.7 £03 
24.8+2.4 19.8209 
nu/+or+/+ 42.8+2.0 21907 
nu/nu 23,3'+0:6 20.0+1.4 
CBA/H 85.5+3.1 209 £12 
AKR 712416 18.6 1.2 
DBA/2 24.5+1.2 217217 
CBA/H 7792.7 2S 
C3H/HeJ 77.8 +0.8 24.5+1.5 


CBA/H 

Balb/C 

CB57B1 

CBA/H3 xC57B1 
CB57BI x Balb/C 


Reprinted with permission from R. M. Zinkernagel and P. C. Doherty; Nature 
248:701, 1974; copyright 1974, Nature by Nature Publishing Group. Reproduced 
with permission of Nature Publishing Group in the format reuse in a book/ 
textbook via Copyright Clearance Center. 


effectively lysed. However, spleen cells taken from mice bearing 
H-2 or H—22 alleles at this locus were unable to lyse the infected 
fibroblasts. (The *!Cr released is approximately the same when 
noninfected fibroblasts are used in the assay, as shown in the right 
column of Table 1.) 

It was essential to show that the results were not pecu- 
liar to mice bearing H—2* alleles. To make this determination, 
Zinkernagel and Doherty tested LCMV-activated spleen cells 
from H-2 mice against various types of infected cells. Once 
again, the CTLs would only lyse infected cells having the same 
H-2 genotype, in this case H-2. These studies provided the first 
evidence that the MHC molecules on the surface of an infected 
cell restricts its interactions with T cells. T-cell function is said to 
be MHC restricted. 

These and other experiments during the 1970s raised ques- 
tions about the role of MHC proteins in immune cell function. 
Meanwhile, another line of investigation was focusing on the 
mechanism by which T cells were stimulated by particular 
antigens. Studies had indicated that T cells respond to antigen 
that is bound to the surface of other cells. It was presumed that 
the antigen being displayed had simply bound to the surface of 
the antigen-presenting cell (APC) from the extracellular medium. 
During the mid-1970s and early 1980s, studies by Alan Rosenthal 
at NIH, Emil Unanue at Harvard University, and others demon- 
strated that antigen had to be internalized by the APC and sub- 
jected to some type of processing before it could stimulate T-cell 
proliferation. Most of these studies were carried out in cell culture 
utilizing T cells activated by macrophages that had been previ- 
ously exposed to bacteria, viruses, or other foreign material. 

One of the ways to distinguish between antigen that is simply 
bound to the surface of an APC and antigen that has been pro- 
cessed by metabolic activities is to compare events that can occur 
at low temperatures (e.g., 4°C) where metabolic processes are 
blocked with those that occur at normal body temperatures. In 
one of these earlier experiments, macrophages were incubated 
with antigen for one hour at either 4°C or 37°C and then tested for 
their ability to stimulate T cells prepared from lymph nodes.? At 
lower antigen concentrations, macrophages were nearly 10 times 
more effective at stimulating T cells at 37°C than at 4°C, suggesting 
that antigen processing requires metabolic activities. Treatment of 
cells with sodium azide, a cytochrome oxidase inhibitor, also inhib- 
ited the appearance of antigen on the surface of T cells, indicating 
that antigen presentation requires metabolic energy.’ 

Subsequent experiments by Kirk Ziegler and Emil Unanue 
provided evidence that processing occurred as extracellular 
antigens were taken into the macrophage by endocytosis and 
delivered to the cell's lysosomal compartment.° One approach 
to determining whether lysosomes are involved in a particular 
process is to treat the cells with substances, such as ammonium 
chloride or chloroquine, that disrupt lysosomal enzyme activity. 
Both of these agents raise the pH of the lysosomal compartment, 
which inactivates the acid hydrolases (Section 8.15). Table 2 shows 
the effects of these treatments on processing and presentation of 
antigen derived from the bacteria Listeria monocytogenes. It can 
be seen from the table that neither of these substances affected 
the uptake (endocytosis) of antigen, but both substances mark- 
edly inhibited the processing of the antigen and its ability to stim- 
ulate binding of T cells to the macrophage. These data were among 
the first to suggest that fragmentation of extracellular antigens by 


TABLE 2 Inhibition of Antigen Presentation with NH,CI and Chloroquine 


10 mM NHA4Cl 


0.1 mM Chloroquine 


Control (%) 


Antigen uptake 


Antigen ingestion 

Antigen catabolism 

T cell-macrophage binding 
before antigen handling 
after antigen handling 


Source: H. K. Ziegler and E. R. Unanue, Proc. Nat’l. Acad. Sci. U.S.A. 79:176, 1982. 


lysosomal proteases may be an essential step in preparation of 
extracellular antigens prior to presentation. 

Other studies continued to implicate MHC molecules in the 
interaction between APCs and T cells. In one series of experi- 
ments, Ziegler and Unanue treated macrophages with antibodies 
directed against MHC proteins encoded by the H-2 locus. They 
found that these antibodies had no effect on the uptake or catab- 
olism of antigen,® but markedly inhibited the macrophages from 
interacting with T cells.” Inhibition of binding of T cells to macro- 
phages was obtained even when macrophages were exposed to 
the antibodies before addition of antigen. 

Evidence from these and many other studies indicated that 
the interaction between aT cell and a macrophage depended 
on the recognition of two components on the surface of the 
antigen-presenting cell: the antigen fragment being displayed 
and an MHC molecule. But there was no clear-cut picture as 
to how the antigen fragment and MHC molecule were related. 
Two models of antigen recognition were considered likely pos- 
sibilities. According to one model, T cells possess two distinct 
receptors, one for the antigen and another for the MHC protein. 
According to the other model, a single T-cell receptor recog- 
nizes both the MHC protein and the antigen peptide on the 
APC surface simultaneously. The balance of opinion began to 
shift in favor of the one-receptor model as evidence began to 
point to a physical association between MHC proteins and dis- 
played antigens. In one study, for example, it was shown that 
antigen that had been processed by T cells could be isolated 
as a complex with MHC proteins.® In this experiment, cultured 
T cells taken from H-2‘ mice were incubated with radioactively 
labeled antigen for 40 minutes. After the incubation period, pro- 
cessed antigen was prepared from the cells and passed through 
a column containing beads coated with antibodies directed 
against MHC proteins. When the beads were coated with anti- 
bodies against H—2* protein, an MHC molecule present in the 
T cells, large amounts of radioactive antigen adhered to the 
beads, indicating the association of the processed antigen with 
the MHC protein. If the beads were instead coated with anti- 
bodies against H—2° protein, an MHC protein that was not pre- 
sent in the T cells, relatively little radioactive antigen remained 
in the column. 

Following these early experiments, investigators turned their 
attention to the atomic structure of the molecules involved in 
T-cell interactions, which lies at the heart of the immune response. 
Rather than utilizing MHC class Il molecules on the surfaces of 
macrophages, structural studies have examined MHC class | 
molecules of the type found on the surfaces of virally infected 
cells. The first three-dimensional portrait of an MHC molecule 
was published in 1987 and was based on X-ray crystallographic 
studies by Don Wiley and colleagues at Harvard University.” The 
events leading up to this discovery are discussed in Reference 10. 
MHC class | molecules consist of (1) a heavy chain containing three 


Observed (%) 


Inhibition (%) Observed (%) Inhibition (%) 


extracellular domains (a,, a), and a3) and a single membrane- 
spanning segment and (2) an invariant B,m polypeptide (see 
Figure 17.23). Wiley and colleagues examined the structure of the 
extracellular (soluble) portion of the MHC molecule (a, a5, a3, 
and B, m) after removing the transmembrane anchor. A ribbon 
model of the observed structure is shown in FIGURE 1a, with the 
outer (antigen-bearing) portion of the protein constructed from 
the a, anda, domains. It can be seen from the top portion of this 
ribbon model that the inner surfaces of these domains form the 
walls of a deep groove approximately 25 A long and 10 A wide. It 
is this groove that acts as the binding site for peptides produced 
by antigen processing in the cytoplasm. As shown in Figure 1b, 
the sides of the antigen-binding pocket are lined by helices from 
the a, and a, domains, and the bottom of the pocket is lined 
by B sheet that extends from these same domains across the 
midline. The helices are thought to form relatively flexible side 
walls enabling peptides of different sequence to bind within the 
groove. 

Subsequent X-ray crystallographic studies described the 
manner in which peptides are positioned within the MHC antigen- 
binding pocket. In one of these studies, the spatial arrange- 
ment of several naturally processed peptides situated within the 
antigen-binding pocket of a single MHC class | molecule (HLA- 
B27) was determined.''! The backbones of all the peptides bound 
to HLA-B27 share a single, extended conformation running the 
length of the binding cleft. The N- and C-termini of the pep- 
tides are precisely positioned by numerous hydrogen bonds at 
both ends of the cleft. The hydrogen bonds link the peptide to a 
number of conserved residues in the MHC molecule that are part 
of both the sides and bottom of the binding groove. 

In another key study, lan Wilson and colleagues at the 
Scripps Research Institute in La Jolla, California, reported on the 
X-ray crystallographic structure of a mouse MHC class | protein 
complexed with two peptides of different length.'*'? The overall 
structure of the mouse MHC protein is similar to that of the human 
MHC protein shown in Figure 1a. In both cases, the peptides are 
bound in an extended conformation deep within the binding 
groove of the MHC molecule (FIGURE 2). This extended confor- 
mation allows for numerous interactions between the side chains 
of the MHC molecule and the backbone of the bound peptide. 
Because the MHC interacts primarily with the backbone of the 
peptide rather than its side chains, there are very few restrictions 
on the particular amino acid residues that can be present at var- 
ious sites of the binding pocket. As a result, each MHC molecule 
can bind a diverse array of antigenic peptides. 

An MHC-peptide complex projecting from the surface of 
an infected cell is only half the story of immunologic recogni- 
tion; the other half is represented by the T-cell receptor (TCR) 
projecting from the surface of the cytotoxic T cell. It has been 
evident for more than a decade that a TCR can somehow rec- 
ognize both an MHC and its contained peptide, but the manner 

continued 
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(a) 


(b) 


FIGURE 11 (a) Schematic representation of an MHC class | molecule, in this case the human protein HLA-A2. The molecule consists of two 

subunits: a heavy chain made up of three domains (0.,, a, and a3) and a B,m chain. The membrane-spanning portion of the heavy chain would connect 
with the polypeptide at the site marked C (for C-terminus). Disulfide bonds are indicated as two connected spheres. The peptide-binding groove 

is shown at the top of the drawing, between the a helical segments of the a, and a, domains of the heavy chain. (b) Schematic representation of 

the peptide-binding pocket of the MHC protein as viewed from the top of the molecule. The bottom of the pocket is lined by B sheet 

(orange-purple arrows) and the walls by helices (green). The a, domain is shown in orange and light green; the a, domain is shown in purple and 


dark green. 


Source: From Pamela J. Bjorkman et al., Nature 329:508, 509, 1987, Fig. 2a, 2b; reprinted by permission from Macmillan Publishers Ltd. 


by which this occurs eluded researchers because of difficulties in 
preparing protein crystals of the TCR that were suitable for X-ray 
crystallography. These difficulties were eventually overcome, and 
in 1996, reports were published by both the Wiley and Wilson 
laboratories that provided a three-dimensional portrait of the 
interaction between an MHC-peptide and TCR.™"° The overall 
structure of the complex formed between the two proteins is 
shown in FIGURE 3, where the backbones of the polypeptides 
are portrayed as tubes. 

The structure shown in Figure 3 depicts the portions of the 
proteins that project between a CTL and virally infected host cell. 
The lower half of the image shows the structure and orientation 
of the MHC class | molecule with the extended peptide antigen 
(yellow-green) embedded within the protein’s binding pocket. 
The upper half of the image shows the structure and orienta- 
tion of the TCR. As indicated in Figure 17.20b, a TCR consists of 
a and B polypeptide chains, each chain comprised of a variable 
(V) and constant (C) portion. Like immunoglobulins (Figure 17.15), 
the variable portion of each TCR subunit contains regions that 
are exceptionally variable (i.e., hypervariable). The hypervariable 
regions form protruding loops (shown as the colored segments 
of the two TCR polypeptides in Figure 3) that fit snugly over 
the outer end of the MHC-peptide complex. The hypervariable 
regions are referred to as complementarity-determining regions, 
or CDRs, because they determine the binding properties of the 
TCR. The CDRs of the TCR interact with the a helices of the a, 
and a, domains of the MHC, as well as the exposed residues of 
the bound peptide. The central CDRs of the TCR, which exhibit 


(b) 


FIGURE 2 Three-dimensional models of a peptide (shown in ball-and- 
stick structure) bound within the antigen-binding pocket of a mouse 
MHC class | protein (H—2K>). The peptide in a consists of eight amino 
acid residues; the peptide in b consists of nine residues. The peptides are 
seen to be buried deep within the MHC binding groove. 

Source: From Masazumi Matsumura, Daved H. Fremont, Per A. Peterson, 
and lan A. Wilson, Science 257:932, 1992. © 1992. Reprinted with 
permission from AAAS. 


FIGURE 3 Representation of the interaction between an MHC-peptide 
complex (at the bottom) and a TCR (at the top). The hypervariable 
regions (CDRs) of the TCR are shown as colored loops that form the 
interface between the two proteins. The bound peptide (P1-P8) is shown 
in yellow-green as it is situated within the binding groove of the MHC 
class | molecule. The peptide backbones are represented as tubes. 
Source: From K. Christopher Garcia et al., Courtesy of lan Wilson, 
Science 274:217, 1996, Fig. 9d, © 1996. Reprinted with permission 

from AAAS. 


17.6 T Lymphocytes: Activation 
and Mechanism of Action 
Like B cells, T cells are also subject to a process of clonal selection. 


T cells possess a cell-surface protein, called a T-cell receptor, that 
allows them to interact specifically with a particular antigen. Like 


the greatest sequence variability, interact primarily with the centrally 
situated bound peptide, whereas the outer CDRs, which have a less 
variable sequence, interact most closely with the a helices of the 
MHC."> Because of these interactions, the TCR meets both of its 
recognition “responsibilities”: It recognizes the bound peptide as 
a foreign antigen and the MHC as a self-protein. (Information from 
recent studies of additional TCR-peptide-MHC structures can be 
found in References 17-19.) 
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the antibody molecules that serve as B-cell receptors, the proteins 
that serve as T-cell receptors exist as a large population of molecules 
that have differently shaped combining sites. Just as each B cell pro- 
duces only one species of antibody, each T cell has only a single spe- 
cies of T-cell receptor. It is estimated that adult humans possess 
approximately 10’? T cells that, collectively, exhibit more than 10’ 
different antigen receptors. 
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T cells are activated by fragments of antigens that are displayed 
on the surfaces of other cells, called antigen-presenting cells 
(APCs). Consider what would happen if a liver or kidney cell were 
infected with a virus. The infected cell would display portions of the 
viral proteins on its surface (see Figure 17.24), enabling the infected 
cell to bind to a T cell with the appropriate (cognate) T-cell receptor. 
As a result of this presentation, the immune system becomes alerted 
to the entry of a specific pathogen. The process of antigen presenta- 
tion is discussed at length later in the chapter (Section 17.11) and is 
also the subject of the Experimental Pathways. 

Whereas any infected cell can serve as an APC in activating T 
cells, certain types of “professional” APCs are specialized for this 
function. Professional APCs include dendritic cells and macrophages 
(FIGURE 17.10). We will focus on dendritic cells (DCs), which 
were first discovered and characterized by Ralph Steinman of 
Rockefeller University in the early 1970s and are often described as 
the “sentinels” of the immune system. Dendritic cells have earned 
this title because they “stand guard” in the body’s peripheral tissues 
where pathogens are likely to enter (such as the skin and airways). 
DCs are equipped with a wide variety of receptors capable of recog- 
nizing virtually every type of pathogen. This property makes DCs a 
major component of the innate immune system (page 663). As dis- 
cussed in the following paragraphs, DCs utilize the information 
about the pathogens they ingest to initiate a response from the 
appropriate cells of the adaptive immune system. 

When present in the body’s peripheral tissues, immature DCs 
recognize and ingest microbes and other foreign materials by phago- 
cytosis. Once a microbe is taken into a dendritic cell, it has to be 
processed before its components can be presented to another cell. 
Antigen processing requires that the ingested material is fragmented 
enzymatically in the cytoplasm and the fragments moved to the cell 
surface (see Figure 17.23). DCs that have processed antigen migrate 
to nearby lymph nodes where they differentiate into mature antigen- 
presenting cells. Once in a lymph node, DCs come into contact with 
a large pool of T cells, including a minute percentage whose T-cell 
receptors can bind specifically with the processed foreign antigen, 


which activates the T cell. This dynamic process of DC-T cell inter- 
actions has been visualized recently in living lymph-node tissues by 
microscopic imaging of fluorescently labeled cells (FIGURE 17.11). In 
the absence of antigen, a given DC may interact transiently with as 
many as 500 to 5000 different T cells, remaining in contact with each 
cell for only a few minutes (as in Figure 17.11). In contrast, when the 
DC displays an antigen that is specifically recognized by the TCR of 
the T cell, the interaction between the cells is seen to last a period of 
hours, leading to the activation of the T cell, as evidenced by a tran- 
sient increase in the cytosolic Ca** concentration. 

Once activated, a T cell proliferates to form a clone of T cells 
having the same T-cell receptor. It is estimated that a single activated 
T cell can divide three to four times per day for several days, generat- 
ing a tremendous population of T cells capable of interacting with 
the foreign antigen. The massive proliferation of specific T lympho- 
cytes in response to an infection is often reflected in the enlargement 
of local lymph nodes. Once the foreign antigen has been cleared, the 
vast majority of the expanded T-cell population dies by apoptosis, 
leaving behind a relatively small population of memory T cells capa- 
ble of responding rapidly in the event of future contact with the same 
pathogen. 

Unlike B cells, which secrete antibodies, T cells carry out their 
assigned function through direct interactions with other cells, 
including B cells, other T cells, or target cells located throughout the 
body. This cell-cell interaction may lead to the activation, inactiva- 
tion, or death of the other cell. In addition to direct cell contact, 
many T-cell interactions are mediated by highly active chemical 
messengers, called cytokines, that work at very low concentrations. 
Cytokines are small secreted proteins produced by a wide variety of 
cells and include interferons (IFNs), interleukins (ILs), and tumor 
necrosis factors (TNFs). Cytokines bind to specific receptors on the 
surface of a responding cell, generating an internal signal that alters 
the activity of the cell. In responding to a cytokine, a cell may pre- 
pare to divide, undergo differentiation, or secrete its own cytokines. 
One family of small cytokines, called chemokines, acts primarily as 
chemoattractants that stimulate the migration of lymphocytes into 


(b) 


FIGURE 17.10 Professional antigen-presenting cells (APCs). (a) Colorized scanning electron micrograph of a Kupffer cell (a type of macrophage) as 
it penetrates openings in the endothelium that lines the sinusoidal vessels of the liver. These cells are capable of ingesting aged red blood cells and 
pathogens and accumulate at sites of infection or injury. (b) Colorized SEM of a dendritic cell. These irregular-shaped cells, which are characterized by 


long cytoplasmic (or dendritic) processes, are concentrated in the tissues that line the body's surfaces with the external environment. 


Source: (a) Courtesy of Thomas Deerinck, NCMIR, University of California, San Diego; (b) David Scharf/Photo Researchers, Inc. 


FIGURE 17.11 Live cell imaging of DCs 
and T cells, and their interactions, within a 
lymph node. T cells travel from lymph node to 
lymph node within the body. When they enter 
a lymph node, they migrate within the tissue 
as their surface is scanned by individual DCs 
with which they come into contact. (a) Several 
fluorescently labeled T cells (stained green, 
three of which are indicated by labeled 


00:00 


numbers) are seen moving around within a 
lymph node and coming in contact with an 
individual DC (stained red and indicated by 
the asterisk) over a period of 2.5 minutes. 

(b) The trajectory taken by each of the three 
numbered T cells and the asterisk-marked DC 
shown in part a. (c) Contacts within a lymph 
node between a single DC (green) and several 
T cells (orange) rendered in three dimensions. 
Contacts between the cells are dynamic, 
changing rapidly in size over a period of tens 
of seconds. 


Source: (a, 6) From Philippe Bousso and 
Ellen Robey, Nature Immunol. 4:581, 2003. 
Reprinted by permission from Macmillan 
Publishers Ltd.; (c) From Mark J. Miller et al., 
Courtesy of Michael D. Cahalan, Proc. Nat'l. 
Acad. Sci. U.S.A. 101:1002, 2004 © 2004, 
National Academy of Sciences, U.S.A. 
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inflamed tissue. Different types of lymphocytes and phagocytes 
possess receptors for different chemokines, so that their migration 
patterns can be separately controlled. A list of some of the best stud- 
ied cytokines is given in Table 17.1. 

Three major subclasses of T cells can be distinguished by the 
proteins on their surfaces and their biological functions. 


1. Cytotoxic T lymphocytes (CTLs) screen the cells of the body for 
abnormalities. Under normal circumstances, healthy cells are not 
harmed by CTLs, but aged or infected cells, and possibly some 
malignant cells, are attacked and killed. CTLs kill target cells by 
inducing them to undergo apoptosis. Two distinct cell-killing 


TABLE 17.1 Selected Cytokines 


Source Major functions 


Induces inflammation, stimulates 
Ty-cell proliferation 

Stimulates T-cell and B-cell proliferation 

Induces IgM to IgG class switching in B 
cells, suppresses inflammatory cytokine 
action 

Stimulates B-cell differentiation 

Inhibits macrophage function, suppresses 
inflammatory cytokine action 

Induces MHC expression in APCs, 
activates NK cells 

Induces inflammation, activates NO 
production in macrophages 

Stimulates growth and proliferation of 
granulocytes and macrophages 


Diverse 


Ty cells 
T cells 


Ty cells 

T cells, 
macrophages 

Ty cells, CTLs 


Diverse 


Ty cells, CTLs 
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pathways have been described. In one pathway, the CTL releases 
perforins and granzymes into a tightly enclosed space between 
the cells. Perforins are proteins that assemble within the mem- 
brane of the target cell to form transmembrane channels. 
Granzymes are proteolytic enzymes that enter the perforin chan- 
nels and activate caspases, which are the proteolytic enzymes that 
initiate the apoptotic response (Section 15.17). In the alternate 
pathway, the CTL binds to a receptor on the target cell surface, 
activating a suicide pathway in the target cell similar to that of 
Figure 15.39. By killing infected cells, CTLs eliminate viruses, 
bacteria, yeast, protozoa, and parasites after they have found 
their way into host cells and are no longer accessible to circulating 
antibodies. CTLs possess a surface protein called CD8 (cluster 
designation 8) and are referred to as CD8" cells. 


2. Helper T lymphocytes (T,, cells) activate other immune cells in 
an orchestrated attack against a specific pathogen. They are dis- 
tinguished from CTLs by the presence of the protein CD4 at their 
surface rather than CD8.' T,, cells are activated by professional 


‘There are three major subtypes of helper T cells, Ty1, Ty2, and Ty17 cells, which can be 
distinguished by the cytokines they secrete. All of these helper T cells stimulate B cells to 
produce antibodies, but each also has a unique function. T};1 cells produce IFN-y, which 
protects the body by activating macrophages to kill intracellular pathogens they might 
harbor (Section 9.15). T};2 cells produce IL-4, which mobilizes mast cells, basophils, and 
eosinophils to protect against extracellular pathogens, especially parasitic worms. 'T}17 
cells produce IL-17, which is thought to stimulate epithelial cells to recruit phagocytes 
and thereby prevent the entry of extracellular bacteria and fungi into the body. T}17 cells 
are also implicated in the development of autoimmune disease. The three subtypes of Ty; 
cells differentiate from a common precursor following stimulation by different cytokines 
and are defined by the presence of different “master” transcription factors (Section 12.11). 
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antigen-presenting cells, such as dendritic cells and macrophages 
as shown in Figure 17.12. This is one of the first and most impor- 
tant steps in initiating an adaptive immune response. Nearly all B 
cells require the help of T,, cells before they can mature and dif- 
ferentiate into antibody-secreting plasma cells. B cells are acti- 
vated by direct interaction with a T,, cell that is specific for the 
same antigen, as shown in FIGURE 17.12 (and in more detail in 
Figure 17.26). Thus, the formation of antibodies requires the 
activation of both T cells and B cells capable of interacting spe- 
cifically with the same antigen. The importance of T,, cells 
becomes apparent when one considers the devastating effects 
wrought by HIV, the virus that causes AIDS. T,, cells are the pri- 
mary targets of HIV. Most HIV-infected people remain free of 
symptoms as long as their T,,-cell count remains relatively high— 
above 500 cells/ul of blood (the normal count is over 1000 cells/ 
ul). Once the count drops below approximately 200 cells/ul, a 
person develops full-blown AIDS and becomes prone to attack 
by viral and cellular pathogens. 


3. Regulatory T lymphocytes (Treg cells) are primarily inhibitory 
cells that suppress the proliferation and activities of various types 
of immune cells. Virtually all of the suppressive activities of Treg 
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FIGURE 17.12 Highly simplified, schematic drawing showing the role 
of T, cells in antibody formation. In step 1, the macrophage interacts with 
the complex antigen. The antigen is taken into the macrophage and 
cleaved into fragments, which are displayed at the cell surface. In step 2, 
the macrophage interacts with a Ty cell whose TCR is bound to one of the 
displayed antigen fragments (the green membrane protein is an MHC 
molecule, Section 17.5). This interaction activates the T cell. In step 3, the 
activated Ty, cell interacts with a B cell whose antigen receptor is bound to 
an intact, soluble antigen. B-cell activation is stimulated by cytokines (e.g., 
IL-4, IL-5, and IL-6) released by the Ty, cell into the space separating it from 
the adjacent B cell. Interaction with the T,, cell activates the B cell, causing 
it to proliferate (step 4). The progeny of the activated B cell differentiate into 
plasma cells that synthesize antibodies that can bind the antigen (step 5). 


cells have been studied in vitro so it remains much less clear how 
these cells operate in the Poey. Treg Cells are characterized by 
possession of CD4* CD25" surface markers and are thought to 
play an important role in limiting inflammation and maintaining 
immunologic self-tolerance. T,., cells carry out this latter activity 
by suppressing CTLs that carry self-reactive receptors from 
attacking the body’s own cells. T,., cells appear also to protect a 
fetus from immunological attack by the pregnant mother. On the 
other hand, these same cells may prove detrimental to our health 
by preventing the immune system from ridding the body of 
tumor cells. The differentiation of T,,, cells requires stimulation 
by the cytokine IL-2 and leads to the expression by the cells of a 
key transcription factor, FOXP3. Mutations in the FOXP3 gene 
result in a fatal disease (IPEX), which is characterized by severe 
autoimmunity in newborn infants. Defects in T,.,, cells are widely 
suspected of playing a key role in the development of most auto- 
immune diseases. Studies of Treg cells have provided direct dem- 
onstration that homeostasis in the immune system requires a 
tight balance between stimulatory and inhibitory influences. 
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1. How does an infected cell in the body reveal its 
condition to a T cell? What is the T cell's response? 


2. What is an APC? What types of cells can act as APCs? 


. Compare and contrast the properties and functions 


of a Ty cell anda CTL. A Ty cell and a Treg cell. 
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17.7. The Modular Structure 
of Antibodies 


Antibodies are proteins produced by B cells and their descendants 
(plasma cells). B cells incorporate antibody molecules into their 
plasma membrane, where they serve as antigen receptors, whereas 
plasma cells secrete these proteins into the blood or other bodily flu- 
ids, where they serve as a molecular arsenal in the body’s war against 
invading pathogens. Interaction between bloodborne antibodies 
and antigens on the surface of a virus or bacterial cell can neutralize 
the pathogen’s ability to infect a host cell and facilitate the pathogen’s 
ingestion and destruction by wandering phagocytes. The immune 
system produces millions of different antibody molecules that, taken 
together, can bind any type of foreign substance to which the body 
may be exposed. Although the immune system exhibits great diversity 
through the antibodies it produces, a single antibody molecule can 
interact with only one or a few closely related antigenic structures. 
Antibodies are globular proteins called immunoglobulins. 
Immunoglobulins are built of two types of polypeptide chains, larger 
heavy chains (molecular mass of 50,000 to 70,000 daltons) and 
smaller light chains (molecular mass of 23,000 daltons). The two 
types of chains are linked to one another in pairs by disulfide bonds. 
Five different classes of immunoglobulin (IgA, IgD, IgE, IgG, and 
IgM) have been identified. The different immunoglobulins appear at 
different times after exposure to a foreign substance and have differ- 
ent biological functions (Table 17.2). IgM molecules are the first 
antibodies secreted by B cells following stimulation by an antigen, 
appearing in the blood after a lag of a few days (FIGURE 17.13). 
IgM molecules have a relatively short half-life (about five days), and 
their appearance is followed by secretion of longer-lived IgG and/or 
IgE molecules. IgG molecules are the predominant antibodies found 


TABLE 17.2 Classes of Human Immunoglobulins 


Molecular 
mass (kDa) 


360-720 


Heavy 
chain 


Light 


Class chain 


IgA 


Properties 


Present in tears, nasal mucus, 
breast milk, intestinal 
secretions 

Present in B-cell plasma 
membranes; function 
uncertain 

Binds to mast cells, releasing 
histamine responsible for 
allergic reactions 

Primary blood-borne soluble 
antibodies; crosses placenta 

Present in B-cell plasma 
membranes; mediates initial 
immune response; activates 
bacteria-killing complement 


(07 Kora 


IgD Korda 


KorA 


KorA 


KorA 


in the blood and lymph during a secondary response to most anti- 
gens (Figure 17.13). IgE molecules are produced at high levels in 
response to many parasitic infections. IgE molecules are also bound 
with high affinity to the surface of mast cells, triggering histamine 
release, which causes inflammation and symptoms of allergy. IgA is 
the predominant antibody in secretions of the respiratory, digestive, 
and urogenital tracts and acts to protect these mucosal linings from 
pathogens. The function of IgD is unclear. 

There are two types of light chains, kappa («) chains and lambda 
(A) chains, both of which are present in the immunoglobulins of all 
five classes. In contrast, each immunoglobulin class has a unique 
heavy chain that defines that class (Table 17.2).? We will focus primar- 
ily on the structure of IgGs. An IgG molecule is composed of two iden- 
tical light chains and two identical heavy chains arranged to form a 
Y-shaped molecule, as shown in FIGURE 17.14a and described below. 

To determine the basis of antibody specificity, it was first neces- 
sary to determine the amino acid sequence of a number of specific 
antibodies. Normally, the first step in amino acid sequencing is to 
purify the particular protein to be studied. Under normal condi- 
tions, however, it is impossible to obtain a purified preparation of a 
specific antibody from the blood because each person produces a 
large number of different antibody molecules that are too similar in 
structure to be separated from one another. The problem was solved 
when it was discovered that the blood of patients suffering from a 
type of lymphoid cancer called multiple myeloma contained large 
quantities of a single antibody molecule. 

As described in Chapter 16, cancer is a monoclonal disease; that 
is, the cells of a tumor arise from the proliferation of a single way- 
ward cell. Because a single lymphocyte normally produces only a 
single species of antibody, a patient with multiple myeloma produces 
large amounts of the specific antibody that is synthesized by the par- 
ticular cell that became malignant. One patient produces one highly 
abundant antibody species, and other patients produce different 
antibodies. As a result, investigators were able to purify substantial 
quantities of several antibodies from a number of patients and 
compare their amino acid sequences. An important pattern was 
soon revealed. It was found that half of each kappa light chain has a 
constant amino acid sequence among all kappa chains, whereas the 


? Humans actually produce four related heavy chains as part of their IgG molecules 
(forming IgG1, IgG2, IgG3, and IgG4) and two related heavy chains as part of their 
IgA molecules (forming IgA1 and IgA2) (see Figure 17.19). These differences will not 
be mentioned in the following discussion. 
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FIGURE 17.13 Primary and secondary antibody responses. A primary 
response, which is elicited by an initial exposure to an antigen, leads first to 
the production of soluble IgM antibody molecules, followed by production 
of soluble IgG antibody molecules. When the antigen is reintroduced at a 
later time, a secondary response is initiated. In contrast to the primary 
response, the secondary response begins with production of IgG (as well as 
IgM) molecules, leads to a much higher antibody level in the blood, and 
occurs with almost no delay. 


other half varies from patient to patient. A similar comparison of 
amino acid sequences of several lambda chains from different 
patients revealed that they too consist of a section of constant 
sequence and a section whose sequence varies from one immuno- 
globulin to the next. The heavy chains of the purified IgGs also con- 
tain a variable (V) and a constant (C) portion. A schematic structure 
of one of these IgG molecules is shown in Figure 17.14b. 

It was further found that, whereas approximately half of each 
light chain consists of a variable region (V,), only one-quarter of 
each heavy chain is variable (V,,) among different patients; the 
remaining three-quarters of the heavy chain (C,, ) is constant for all 
IgGs. The constant portion of the heavy chain can be divided into 
three sections of approximately equal length that are clearly homolo- 
gous to one another. These homologous Ig units are designated C,,1, 
C,,2, and C,,3 in Figure 17.140. It appears that the three sections of 
the C part of the IgG heavy chain (as well as those of heavy chains of 
the other Ig classes and the C portions of both kappa and lambda 
light chains) arose during evolution by the duplication of an ances- 
tral gene that coded for an Ig unit of approximately 110 amino acids. 
The variable regions (V,, or V,) are also thought to have arisen by 
evolution from the same ancestral Ig unit. Structural analysis indi- 
cates that each of the homologous Ig units of a light or heavy chain 
folds independently to form a compact domain that is held together 
by a disulfide bond (Figure 17.15). In an intact IgG molecule, each 
light chain domain associates with a heavy chain domain as shown 
in Figure 17.14a, b. Genetic analysis indicates that each domain is 
encoded by its own exon. 

The specificity of an antibody is determined by the amino acids 
of the antigen-combining sites at the ends of each arm of the Y-shaped 
antibody molecule (Figure 17.14). The two combining sites ofa single 
IgG molecule are identical, and each is formed by the association of 
the variable portion of a light chain with the variable portion of a 
heavy chain (Figure 17.14). Assembly of antibodies from different 
combinations of light and heavy chains allows a person to produce a 
tremendous variety of antibodies from a modest number of different 
polypeptides (Section 17.8). 

A closer look at the polypeptides of immunoglobulins reveals 
that the variable portions of both heavy and light chains contain sub- 
regions that are especially variable, or hypervariable, from one anti- 
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(a) 


Fab fragments 


Hinge region 


Fe fragment 


(b) 


FIGURE 17.14 Antibody structure. (a) Spacefilling model of an IgG molecule. The molecule contains four polypeptide chains: two identical light chains 
and two identical heavy chains. One of the heavy chains is shown in orange, the other in pink, while both light chains are shown in yellow. The domains of 
each chain (two per light chain and four per heavy chain) are evident. (b) Schematic model showing the domain structure of an IgG molecule. The tertiary 
structure of each Ig domain is maintained by a disulfide bond. Domains comprising a constant region of the polypeptide chain are indicated by the letter C; 
domains comprising a variable region are indicated by the letter V. Each heavy chain contains three Cy regions (C441, C42, C43) and one \y region at the 
N-terminus of the polypeptide. Each light chain contains one C, and one \, region at its N-terminus. The variable regions of each light and heavy chain form 
an antigen-combining site. Each Y-shaped IgG molecule contains two identical antigen-combining sites. Each IgG molecule can be fragmented by mild 
proteolytic treatment into two Fab fragments that contain the antigen-combining sites and an Fc fragment, as indicated. 


Source: (a) Courtesy of David Goodsell/RSCB PDB, 2001. 


body to another (labeled Hv in FIGURE 17.15). Light and heavy chains 
both contain three hypervariable stretches that are clustered at the 
ends of each arm on the antibody molecule. As might be expected, 
the hypervariable regions play a prominent role in forming the struc- 
ture of the antigen-combining site, which can range from a deep cleft 
to a narrow groove or a relatively flat pocket. Variations in the amino 
acid sequence of hypervariable regions account for the great diversity 
of antibody specificity, allowing these molecules to bind to antigens 
of every conceivable shape. 

The combining site of an antibody has a complementary stereo- 
chemical structure to a particular portion of the antigen, which is 
called the epitope (or antigenic determinant). Because of their 
close fit, antibodies and antigens form stable complexes, even though 
they are joined only by noncovalent forces that individually are quite 
weak. The precise interaction between a particular antigen and anti- 
body, as determined by X-ray crystallography, is shown in FIGURE 17.16. 
The two hinge regions within the molecule (Figure 17.14) provide the 
flexibility necessary for the antibody to bind to two separate antigen 
molecules or to a single molecule with two identical epitopes. 

Whereas the hypervariable portions of light and heavy chains 
determine an antibody’s combining site specificity, the remaining 
portions of the variable domains provide a scaffold that maintains 
the overall structure of the combining site. The constant portions of 
antibody molecules are also important. Different classes of antibod- 
ies (IgA, IgD, IgE, IgG, and IgM) have different heavy chains, whose 
constant regions differ considerably in length and sequence. These 
differences enable antibodies of various classes to carry out different 
biological (effector) functions. For example, heavy chains of an IgM 
molecule bind and activate one of the proteins of the complement 
system, which leads to the lysis of bacterial cells to which the IgM 
molecules are bound. Heavy chains of IgE molecules play an impor- 
tant role in allergic reactions by binding to specific receptors on the 
surfaces of mast cells, triggering the release of histamine. In contrast, 


heavy chains of an IgG molecule bind specifically to the surface 
receptors of macrophages and neutrophils, inducing these phago- 
cytic cells to ingest the particle to which the antibodies are bound. 
The heavy chains of IgG molecules are also important in allowing 
this class of antibodies to pass from the blood vessels of a mother to 
those of her fetus during pregnancy. While this provides the fetus 
and newborn with passive immunity to infectious organisms, it may 
cause a life-threatening condition called erythroblastosis fetalis. For 
this condition to occur, an Rh” mother (who lacks the Rh protein on 
her red blood cells) must have given birth to a child with an Rh* 
phenotype (Rh*/Rh™ genotype) during a previous pregnancy. The 
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C, Domain 
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FIGURE 17.15 Antibody domains. Schematic drawing of a human lambda 
light chain synthesized by cells from a patient with multiple myeloma. The 
polypeptide undergoes folding so that the constant and variable portions are 
present in separate domains. Thick arrows represent 8 strands, which are 
assembled into B sheets. Each domain has two B sheets, which are distin- 
guished by the red and orange colors. The three hypervariable (Hv) segments 
of the chain are present as loops at one end of the variable domain, which 
forms part of the antigen-combining site of the antibody. 


Source: Reprinted (adapted) with permission from Schiffer et al., 
Biochemistry 12:4628, 1973. Copyright 1973. American Chemical Society. 


FIGURE 17.16 Antigen-antibody interaction. Space-filling model based 
on X-ray crystallography of a complex between lysozyme (green) and Fab 
portions of three different antibody molecules (see Figure 17.14). The 
heavy chain of the antibody is orange; the light chain is yellow. Note that 
each Fab fragment recognizes a different portion of the lysozyme protein. 
Source: Courtesy of David Goodsell/RCSB/PDB, 2001. 


mother is usually exposed to the Rh* fetal antigen during delivery of 
the first child, who is not affected. If, however, the mother should 
have a second Rh” pregnancy, antibodies present in her blood- 
stream can enter the fetal circulation and destroy the red blood cells 
of the fetus. Babies with this condition are given a blood transfusion, 
either before (intrauterine) or after birth, which cleanses the blood 
of maternal antibodies. 
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1. Draw the basic structure of an IgG molecule bound to 
an epitope of an antigen. Label the heavy and light 
chains; the variable and constant regions of each 
chain; the regions that would contain hypervariable 
sequences. 
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17.8 DNA Rearrangements That 
Produce Genes Encoding B- 
and T-Cell Antigen Receptors 


As discussed above, each IgG molecule is composed of two light 
(L) chains and two heavy (H) chains. Both types of polypeptides 
consist of two recognizable parts—a variable (V) portion, whose 
amino acid sequence varies from one antibody species to another, 
and a constant (C) portion, whose amino acid sequence is identical 
among all H or L chains of the same class. What is the genetic basis 
for the synthesis of polypeptides having a combination of shared and 
unique amino acid sequences? 

In 1965, William Dreyer of the California Institute of Technology 
and J. Claude Bennett of the University of Alabama put forward the 


“two gene-one polypeptide” hypothesis to account for antibody 681 


structure. In essence, Dreyer and Bennett proposed that each anti- 
body chain is encoded by two separate genes—a C gene and a 
V gene—that somehow combine to form one continuous “gene” that 
codes for a single light or heavy chain. In 1976, Susumu Tonegawa, 
working at a research institute in Basel, Switzerland, provided clear 
evidence in favor of the DNA rearrangement hypothesis. The basic 
outline of the experiment is shown in FIGURE 17.17. In this experi- 
ment, Tonegawa and his colleagues compared the length of DNA 
between the nucleotide sequences encoding the C and V portions of 
a specific antibody chain in two different types of mouse cells: early 
embryonic cells and malignant, antibody-producing myeloma cells. 
The DNA segments encoding the C and V portions of the antibody 
were widely separated in embryonic DNA but were very close to 
each other in DNA obtained from antibody-producing myeloma 
cells (Figure 17.17). These findings strongly suggested that DNA 
segments encoding the parts of antibody molecules became rear- 
ranged during the formation of antibody-producing cells. 

Subsequent research revealed the precise arrangement of the 
DNA sequences that give rise to antibody genes. To simplify the dis- 
cussion, we will consider only those DNA sequences involved in the 
formation of human kappa light chains, which are located on chro- 
mosome 2. The organization of sequences in germ-line DNA (ie., 
the DNA of a sperm or egg) that are involved in the formation of 
human kappa light chains is shown in the top line (step 1) of 
FIGURE 17.18. In this case, a variety of different V,. genes are located 
in a linear array and separated from a single C, gene by some dis- 
tance. Nucleotide-sequence analysis of these V genes indicated that 
they are shorter than required to encode the V region of the kappa 
light chain. The reason became clear when other segments in the 
region were sequenced. The stretch of nucleotides that encodes the 
13 amino acids at the carboxyl end of a V region is located at some 
distance from the remainder of the V, gene sequence. This small 
portion that encodes the carboxyl end of the V region is termed the 
J segment. As shown in Figure 17.18, there are five distinct J, seg- 
ments of related nucleotide sequence arranged in tandem. The clus- 
ter of J, segments is separated from the C, gene by an additional 
stretch of over 2000 nucleotides. A complete kappa V gene is formed, 
as shown in Figure 17.18 (steps 2-3), as a specific V,. gene is joined 
to one of the J, segments with the intervening DNA excised. This 
process is catalyzed by a protein complex called V(D)J recombinase. 
As indicated in Figure 17.18, the V,. gene sequence generated by this 
DNA rearrangement is still separated from the C, gene by more than 
2000 nucleotides. No further DNA rearrangement occurs in kappa 
gene assembly prior to transcription; the entire genetic region is 
transcribed into a large primary transcript (step 4) from which 
introns are excised by RNA splicing (step 5). 

Rearrangement begins as double-stranded cuts are made in the 
DNA between a V gene and a J gene. The cuts are catalyzed by a pair 
of proteins called RAG1 and RAG2 that are the working parts of the 
V(D)J recombinase. The four free ends that are generated are then 
joined in such a way that the V and J coding segments are linked to 
form an exon that encodes the variable region of the polypeptide 
chain, while the two ends of the intervening DNA are linked to form 
a small circular piece of DNA that is displaced from the chromo- 
some (Figure 17.18, step 3). Joining of the broken DNA ends is 
accomplished by the same basic process used to repair DNA strand 
breaks that was depicted in Figure 13.28. 

The rearrangement of Ig DNA sequences has important conse- 
quences for a lymphocyte. Once a specific V,. sequence is joined to a 
J,. sequence, no other species of kappa chain can be synthesized by 
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FIGURE 17.17 Experimental demonstration that genes encoding 
antibody light chains are formed by DNA rearrangement. DNA is first 
extracted from either embryonic cells or antibody-producing cancer cells 
and fragmented by a restriction endonuclease (step 1) that cleaves both 
strands of the DNA at a specific sequence. The DNA fragments from the 
two preparations are fractionated separately by gel electrophoresis; two 
identical gels are prepared from each DNA sample (step 2). Following 
electrophoresis, each of the gels is incubated with a labeled probe 
containing either the variable (V) or constant (C) gene sequence (step 3). 
The location of the bound, labeled DNA in the gel is revealed by autoradi- 
ography and shown at the bottom of the figure. Whereas the V and C gene 
sequences are located on separate fragments in DNA from embryonic 
cells, the two sequences are located on the same small fragment in DNA 
from the antibody-producing cells. The V and C gene sequences are 
brought together during B-cell development by a process of DNA 
rearrangement. 
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FIGURE 17.18 DNA rearrangements that lead to the formation of a 
functional gene that encodes an immunoglobulin « chain. The organiza- 
tion of the variable (V), joining (J), and constant (C) DNA sequences within 
the genome is shown in step 1. Steps leading to the synthesis of the mature 
mRNA that encodes the « chain polypeptide are described in the text. The 
random union of a V and J segment (steps 2 and 3) determines the amino 
acid sequence of the polypeptide. The space between the “chosen” J 
segment and the C segment (which may contain one or more J segments, 
as shown in the figure) remains as an intron in the gene. The portion of the 
primary transcript (step 4) that corresponds to this intron is removed during 
RNA processing (step 5). DNA rearrangement and subsequent transcription 
of the “stitched” gene occurs on only one allele in each cell, which ensures 
that the cell will only express a single « chain. The other allele on the 
homologous chromosome typically remains unaltered and is not transcribed. 


that cell. It is estimated that the DNA of human germ cells contains 
approximately 40 functional V,. genes. Thus, if we assume that any 
V sequence may join to any J sequence, we expect that a person can 
synthesize approximately 200 different kappa chains (5 J, segments 
x 40 V,. genes). But this is not the only source of diversity among 
these polypeptides. The site at which a J sequence is joined to a 
V sequence can vary somewhat from one rearrangement to another 
so that the same V, and J, genes can be joined in two different cells 
to produce kappa light chains having different amino acid sequences. 
Additional variability is achieved by the enzyme deoxynucleotidyl 
transferase, which inserts nucleotides at the sites of strand breakage. 
These sources of additional variability increase the diversity of kappa 
chains an additional tenfold, bringing the number to at least 
2000 species. The site at which V and J sequences are joined is part 
of one of the hypervariable regions of each antibody polypeptide 


(Figure 17.15). Thus, slight differences at a joining site can have 
important effects on antibody-antigen interaction. 

We have restricted our discussion to kappa light chains for the 
sake of simplicity. Similar types of DNA rearrangement occur during 
the commitment of a cell to the synthesis of a particular lambda light 
chain and to a particular heavy chain. Whereas the variable regions of 
light chains are formed from two distinct segments (V and J seg- 
ments), the variable regions of heavy chains are formed from three 
distinct segments (V, D, and J segments) by similar types of rearrange- 
ment. DNA rearrangement of the heavy chain locus precedes DNA 
rearrangement of the light chain locus. The human genome contains 
51 functional V,, segments, 25 D,, segments, and 6 J,, segments. 
Given the additional diversity stemming from the variability in V,,- 
D, and D,,-J,, joining, a person is able to synthesize at least 100,000 
different heavy chains. The antigen receptors of T cells (TCRs) also 
consist of a type of heavy and light chain, whose variable regions are 
formed by a similar process of DNA rearrangement. 

The formation of antibody genes by DNA rearrangement illus- 
trates the potential of the genome to engage in dynamic activities. 
Because of this rearrangement mechanism, a handful of DNA 
sequences that are present in the germ line can give rise to a remark- 
able diversity of gene products. As discussed above, a person synthe- 
sizes roughly 2000 different species of kappa light chains and 100,000 
different species of heavy chains. If any kappa light chain can com- 
bine with any heavy chain, a person can theoretically produce more 
than 200 million different antibody species from a few hundred 
genetic elements present in the germ line.* 

We have seen how antibody diversity arises from (1) the presence 
of multiple V exons, J exons, and D exons in the DNA of the germ line, 
(2) variability in V-J and V-D-J joining, and (3) the enzymatic inser- 
tion of nucleotides. An additional mechanism for generating antibody 
diversity, referred to as somatic hypermutation, occurs long after DNA 
rearrangement is complete. When a specific antigen is reintroduced 
into an animal following a period of time, antibodies produced during 
the secondary response have a much greater affinity for the antigen 
than those produced during the primary response. Increased affinity 
is due to small changes in amino acid sequence of variable regions of 
the heavy and light antibody chains. These sequence changes result 
from mutations in the genes that encode these polypeptides. It is esti- 
mated that rearranged DNA elements encoding antibody V regions 
have a mutation rate 10° times greater than that of other genetic loci in 
the same cell. Included in the mechanism responsible for this increased 
level of V-region mutation is (1) an enzyme—known as activation- 
induced cytosine deaminase (AID)—that converts cytosine residues 
in DNA into uracil residues leading to U:G mismatches and subse- 
quent mutations during DNA repair and (2) one or more translesion 
DNA polymerases (page 535) that tend to make errors when DNA 
containing uracils is copied or repaired. Persons who carry mutations 
in AID and are unable to generate somatic hypermutation are plagued 
by infections and often die at an early age. 

Somatic hypermutation generates random changes in the 
V regions of Ig genes. Those B cells whose genes produce Ig mole- 
cules with greater antigen affinity are preferentially selected follow- 
ing antigen reexposure. Selected cells proliferate to form clones that 


>A roughly comparable number of antibodies containing lambda light chains can also 
be generated. 


VDJ Cu Cs Cal Cy2 Cy4 Ce Caz 


FIGURE 17.19 Arrangement of the C genes for the various human 
heavy chains. In humans, the heavy chains of IgM, IgD, and IgE are 
encoded by a single gene, whereas those of IgG are encoded by four 
different genes and IgA by two different genes (see footnote, page 679). 


undergo additional rounds of somatic mutation and selection, 
whereas nonselected cells that express low affinity Igs undergo apop- 
tosis. In this way, the antibody response to recurrent or chronic 
infections improves markedly over time. 

Once a B cell is committed to form a specific antibody, it can 
switch the class of Ig it produces (e.g., from IgM to IgG) by changing 
the heavy chain produced in the cell. This process, known as class 
switching, occurs without changing the combining site of the anti- 
bodies synthesized. Recall that there are five different types of heavy 
chains distinguished by their constant regions. The genes that encode 
the constant regions of heavy chains (C,, portions) are clustered 
together in a complex, as shown in FIGURE 17.19. Class switching is 
accomplished by moving a different Cy gene next to the VDJ gene 
that was formed previously by DNA rearrangement. Class switching 
is under the direction of cytokines secreted by helper T cells during 
their interaction with the B cell producing the antibody molecule. 
For example, a helper T cell that secretes IFN-y, induces a switch in 
the adjacent B cell from its initial synthesis of IgM to synthesis of one 
of the IgG classes (Figure 17.13). Class switching allows a lineage of 
B cells to continue to produce antibodies having the same specificity 
but different effector functions (page 680). 


REVIEW | 


1. Draw the basic arrangement of the genes of the germ 
line that are involved in encoding the light and heavy 
chains of an lgG molecule. How does this differ from 
their arrangement in the genome of an antibody- 
producing cell? What steps occur to bring about this 
DNA rearrangement? 


2. Give three different mechanisms that contribute to 


variability in the V regions of antibody chains. 
o______1 


17.9 Membrane-Bound 
Antigen Receptor Complexes 


The recognition of antigen by both B and T lymphocytes occurs at 
the cell surface. An antigen receptor on a B cell (a B-cell receptor, or 
BCR) consists of a membrane-bound immunoglobulin that binds 
selectively to a portion of an intact antigen (ie., the epitope) 
(FIGURE 17.20a). In contrast, the antigen receptor on a T cell 
(a T-cell receptor, or TCR, Figure 17.20b) recognizes and binds to a 
small fragment of an antigen, typically a peptide about 7 to 25 amino 
acids in length, that is held at the surface of another cell (described 
below). Both types of antigen receptors are part of large membrane- 
bound protein complexes that include invariant proteins as depicted 
in Figure 17.20. The invariant polypeptides associated with BCRs 
and TCRs play a key role in transmitting signals to the interior that 
lead to changes in activity of the B cell or T cell. 
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Heavy chain 


(a) (b) 


FIGURE 17.20 Structure of the antigen receptors of a B cell anda 

T cell. (a) The BCR of a B cell is a membrane-bound version of an immuno- 
globulin associated with a pair of invariant chains and a pair of invariant 

B chains. The o and B chains are also members of the Ig superfamily. (b) The 
TCR of a T cell consists of an o and B polypeptide chain linked to one another 
by a disulfide bridge. Each polypeptide contains a variable domain that 
forms the antigen-binding site and a constant domain. The TCR is associ- 
ated with six other invariant polypeptides of the CD3 protein as indicated in 
the illustration. (A small fraction of T cells contain a different type of TCR 
consisting of a y and 6 subunit. These cells are not restricted to recognition 
of MHC-peptide complexes, and their function is not well understood.) 


Each subunit of a TCR contains two Ig-like domains, indicating 
that they share a common ancestry with BCRs. Like the heavy and 
light chains of the immunoglobulins, one of the Ig-like domains of 
each subunit of a TCR has a variable amino acid sequence; the other 
domain has a constant amino acid sequence (Figure 17.20). X-ray 
crystallographic studies have shown that the two types of antigen 
receptors also share a similar three-dimensional shape. 


REVIEW | 


1. Compare and contrast the structure of antigen 
receptors on B and T cells. 


eo  — 


17.10 The Major 
Histocompatibility Complex 


During the first part of the twentieth century, clinical researchers dis- 
covered that blood could be transfused from one person to another, 
as long as the two individuals were compatible for the ABO blood 
group system. The success of blood transfusion led to the proposal 
that skin might also be grafted between individuals. This idea was 
tested during World War II when skin grafts were attempted on pilots 
and other military personnel who had received serious burns. The 
grafts were rapidly and completely rejected. After the war, researchers 
set out to determine the basis for tissue rejection. It was discovered 
that skin could be grafted successfully between mice of the same 
inbred strain, but that grafts between mice of different strains were 
rapidly rejected. Mice of the same inbred strain are like identical 
twins; they are genetically identical. Subsequent studies revealed that 
the genes that governed tissue graft rejection were clustered in a 
region of the genome that was named the major histocompatibil- 
ity complex (MHC). Approximately 20 different MHC genes have 


been characterized, most of which are highly polymorphic: Over 
7,000 different alleles of MHC (i.e., HLA) genes have been identified, 
far more than any other loci in the human genome. It is very unlikely, 
therefore, that two individuals in a population have the same 
combination of MHC alleles. This is the reason that transplanted 
organs are so likely to be rejected and why transplant patients are 
given drugs, such as cyclosporin A, to suppress the immune system 
following surgery. Cyclosporin A is a cyclic peptide produced by a soil 
fungus. Cyclosporin A inhibits a particular phosphatase in the signal- 
ing pathway leading to the production of cytokines required for T-cell 
activation. Although these drugs help prevent graft rejection, they 
make patients susceptible to opportunistic infections similar to those 
that strike people with immunodeficiency diseases such as AIDS. 

It is obvious that proteins encoded by the MHC did not evolve 
to prevent indiscriminate organ transplantation, which raises the 
question of their normal role. Long after their discovery as trans- 
plantation antigens, MHC proteins were shown to be involved in 
antigen presentation. Some of the key experiments that led to our 
current understanding of antigen presentation are discussed in the 
Experimental Pathways (17.5). 

It was noted earlier that T cells are activated by an antigen that 
has been dissected into small peptides and displayed on the surface of 
an antigen-presenting cell (an APC). These small fragments of anti- 
gen are held at the surface of the APC in the grip of MHC proteins. 
Each species of MHC molecule can bind a large number of different 
peptides that share certain structural features that allow them to fit 
into its binding site (see Figure 17.24). For example, all of the peptides 
that are capable of binding to a protein encoded by a particular MHC 


Peptide —® 


(a) 


(b) 


FIGURE 17.21 Human APCs can present large numbers of peptides. 
(a) Schematic model of the variety of class | MHC molecules an individual 
might possess. This class of MHC protein is encoded by a number of 
genes, three of which are represented by a large number of alleles. This 
particular individual is heterozygous at the HLA-A, -B, and -C loci and 
homozygous at the HLA-E, -F, and -G loci, which gives them a total of nine 
different MHC class | molecules. (The difference between class | and Il MHC 
is discussed on page 685.) (b) Schematic model illustrating the variety of 
peptides that can be presented by the protein encoded by a single MHC 
allele. (The term HLA is an acronym for human leukocyte antigen reflecting 
the discovery of these proteins on the surface of leukocytes.) 


allele, such as HLA-B8, may contain a specific amino acid at a certain 
position, which allows it to fit into the peptide-binding groove. 

Given the fact that each individual expresses a number of differ- 
ent MHC proteins (as in FIGURE 17.21a), and each MHC variant may 
be able to bind large numbers of different peptides (as in Figure 
17.21b), a dendritic cell or macrophage should be able to display a 
vast array of peptides. At the same time, not every person is capable 
of presenting every possible peptide in an effective manner, which is 
thought to be a major factor in determining differences in suscepti- 
bility in a population to different infectious diseases, including AIDS. 
For example, the HLA-B*35 allele is associated with rapid progres- 
sion to full-blown AIDS and the HLA-DRB1*1302 allele is correlated 
with resistance to a certain type of malaria and hepatitis B infection. 
The MHC alleles present in a given population have been shaped by 
natural selection. Those persons who possess MHC alleles that are 
best able to present the peptides of a particular infectious agent will 
be the most likely to survive an infection by that agent. Conversely, 
persons who lack these alleles are more likely to die without passing 
on their alleles to offspring. As a result, populations tend to be more 
resistant to diseases to which their ancestors have been routinely 
exposed. This could explain why Native American populations have 
been devastated by certain diseases, such as measles, that produce 
only mild symptoms in persons of European ancestry. 

The entire process of T-cell-mediated immunity rests on the basis 
that small peptides derived from the proteins of a pathogen differ in 
structure from those derived from the proteins of the host. Consequently, 
one or more peptides held at the surface of an APC serve as a small 
representation of the pathogen, providing the cells of the immune sys- 
tem with a “glimpse” of the type of pathogen that is hidden within the 
cytoplasm of the infected cell. Nearly any cell in the body can function 
as an APC. Most cells present antigen as an incidental activity that 
alerts the immune system to the presence of a pathogen, but some 


professional APCs (e.g., dendritic cells, macrophages, and B cells) are 
specialized for this function, as discussed later in the chapter. 

When a T cell interacts with an APC, it does so by having its 
TCRs dock onto the MHC molecules projecting from the APC’s sur- 
face (FIGURE 17.22b). This interaction brings a TCR of a T cell into an 
orientation that allows it to recognize the specific peptide displayed 
within a groove of an MHC molecule (Figure 17.224). The interac- 
tion between MHC proteins and TCRs is strengthened by additional 
contacts that form between cell-surface components, such as occur 
between CD4 or CD8 molecules on a T cell and MHC proteins on an 
APC (Figure 17.22b). This specialized region that develops between 
aT cell and an APC is referred to as an immunological synapse. 

MHC proteins can be subdivided into two major groups, 
MHC class | and MHC class II molecules. MHC class I molecules 
consist of one polypeptide chain encoded by an MHC allele (known 
as the heavy chain) associated noncovalently with a non-MHC 
polypeptide known as B,-microglobulin (see Experimental Pathway, 
Figure la, page 674). Differences in the amino acid sequence of the 
heavy chain are responsible for dramatic changes in the shape of the 
molecule’s peptide-binding groove. MHC class II molecules also 
consist of a heterodimer, but both subunits are encoded by MHC 
alleles. Both classes of MHC molecule as well as B,-microglobulin 
contain Ig-like domains and are thus members of the immunoglobulin 
superfamily. Whereas most cells of the body express MHC class I 
molecules on their surface, MHC class I molecules are expressed 
primarily by professional APCs. 

The two classes of MHC molecules display antigens that origi- 
nate from different sites within a cell, though some overlap can 
occur. MHC class I molecules are predominantly responsible for dis- 
playing antigens that originate within the cytosol of a cell, that is, 
endogenous proteins. In contrast, MHC class II molecules primarily 
display fragments of exogenous antigens that are taken into a cell by 


(a) 


FIGURE 17.22 Interaction between an antigen-presenting cell and a T cell during antigen presentation. (a) Micrograph showing the immunologic 
synapse between an APC and an engineered T cell. The antigen presenting cell is shown in blue and the T cell is shown in red. The localization of ZAP70, a 


kinase that is found in the immunological synapse, is shown in green. (b) Schematic model showing some of the proteins present in the immunologic 


synapse that forms in the region of interaction between an APC and a cytotoxic T lymphocyte (CTL) or helper T (Ty) cell. Antigen recognition occurs as the 
TCR of the T cell recognizes a peptide fragment of the antigen bound to a groove in an MHC molecule of the APC. As discussed in the text, CTLs recognize 


antigen in combination with an MHC class | molecule, whereas Ty cells recognize antigen in combination with an MHC class I] molecule. CD8 and CD4 are 
integral membrane proteins expressed by the two types of T cells that bind MHC class | and class Il molecules, respectively. CD8 and CD4 are described as 


coreceptors. (Numerous other proteins present in these cell-cell interaction zones are not shown; see Nat. Revs. Immunol. 11:672, 2011.) 
Source: (a) From J.R. James et al., Nature 487:64—-69. (b) from L. Chantenoud, Mol. Med. Today 4:26, 1998, with permission from Elsevier. 
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MHC molecules pick up their antigen fragments and display them 
on the plasma membrane are described here and shown in 
FIGURE 17.23. 


e Processing of class IMHC-peptide complexes (Figure 17.23a). 
Antigens located in the cytosol of an APC are degraded into 
short peptides by proteases that are part of the cell’s proteasomes 
(page 510). These proteases cleave cytosolic proteins into 
fragments approximately to 10 residues long that are suitable for 
binding within a groove of an MHC class I molecule (Figure 
17.24a). The peptides are then transported across the membrane 
of the rough endoplasmic reticulum and into its lumen by a 
dimeric protein called TAP (Figure 17.23a). Once in the ER 
lumen, the peptide can bind to a newly synthesized MHC class I 
molecule, which is an integral protein of the ER membrane. The 
MHC-peptide complex moves through the biosynthetic pathway 
(Figure 8.2b) until it reaches the plasma membrane where the 
peptide is displayed. 

e Processing of class II MHC-peptide complexes (Figure 17.23). 
MHC class II molecules are also synthesized as membrane 
proteins of the RER, but they are joined noncovalently to a protein 
called Ii, which blocks the peptide-binding site of the MHC 
molecule (Figure 17.23b). Following its synthesis, the MHC class 
II-Ii complex moves out of the ER along the biosynthetic pathway, 
directed by targeting sequences located within the cytoplasmic 
domain of Ii. MHC class I and II molecules are thought to separate 
from one another in the trans Golgi network (TGN), which is the 
primary sorting compartment along the biosynthetic pathway 
(Section 8.11). An MHC class I-peptide complex is directed 
toward the cell surface, whereas an MHC class H-Ii complex is 
directed into a transport vesicle that fuses with a lysosome. There, 
the Ii protein is digested by acid proteases, leaving only a small 
fragment (CLIP) associated in the peptide-binding groove. CLIP is 
subsequently exchanged with one of the peptides derived from the 
digestion of antigens that were taken into the cell by way of the 
endocytic pathway (Figure 17.23b).4 The MHC class II-peptide 
complex is then moved to the plasma membrane where it is 
displayed, as shown in FIGURE 17.24b. 


Once on the surface of an APC, MHC molecules direct the cell’s 
interaction with different types of T cells (Figure 17.22). Cytotoxic T 
lymphocytes (CTLs) recognize their antigen in association with MHC 
class I molecules; they are said to be MHC class I restricted. Under 
normal circumstances, cells of the body that come into contact with 
CTLs display fragments of their own normal proteins in association 
with their MHC class I molecules. Normal cells displaying normal 
protein fragments are ignored by the body’s T cells because T cells 
capable of binding with high affinity to peptides derived from normal 
cellular proteins are eliminated as they develop in the thymus. In con- 
trast, when a cell is infected, it displays fragments of viral proteins in 
association with its MHC class I molecules. These cells are recognized 
by CTLs bearing TCRs whose binding sites are complementary to the 
viral peptides, and the infected cell is destroyed. The presentation of a 
single foreign peptide on the surface of a cell is probably sufficient to 
invite an attack by a CTL. Because virtually all cells of the body express 
MHC class I molecules on their surface, CTLs can combat an infection 


“Peptides generated in lysosomes and attached to MHC class II molecules tend to be 
longer (10 to 25 residues) than those generated in proteasomes and attached to MHC 
class I molecules (typically 8 to 10 residues). 


regardless of the type of cell affected. CTLs may also recognize and 
destroy cells that display abnormal (mutated) proteins on their sur- 
faces, which could play a role in the elimination of potentially life- 
threatening tumor cells. 

In contrast to CTLs, helper T cells recognize their antigen in 
association with MHC class II molecules; they are said to be MHC 
class II restricted. As a result, helper T cells are activated primarily by 
exogenous (i.e., extracellular) antigens (Figure 17.23b), such as those 
that occur as part of bacterial cell walls or bacterial toxins. MHC 
class II molecules are found predominantly on B cells, dendritic 
cells, and macrophages. These are the lymphoid cells that ingest for- 
eign, extracellular materials and present the fragments to helper 
T cells. Helper T cells that are activated in this way can then stimu- 
late B cells to produce soluble antibodies that bind to the exogenous 
antigen wherever it is located in the body. 


REVIEW | 


1. Describe the steps in processing a cytosolic antigen in 
an APC. What is the role of MHC proteins in this 
process? 


2. Compare and contrast the roles of an MHC class | and 
class Il protein molecule. What types of APCs utilize 
each class of MHC molecule, and what types of cells 
recognize them? 


——— 


17.11 Distinguishing Self 
from Nonself 


T cells gain their identity in the thymus. When a stem cell migrates 
from the bone marrow to the thymus, it lacks the cell-surface pro- 
teins that mediate T-cell function, most notably its TCRs. The stem 
cells proliferate in the thymus to generate a population of T-cell pro- 
genitors. Each of these cells then undergoes the DNA rearrange- 
ments that enable it to produce a specific TCR. These cells are then 
subjected to a complex screening process in the thymus that selects 
for cells having potentially useful T-cell receptors (FIGURE 17.25). 
Studies suggest that epithelial cells of the thymus produce small 
quantities of a great variety of proteins normally restricted to other 
tissues throughout the body. Production of these tissue-specific 
antigens is under the control of a special transcriptional regulator 
(called AIRE) that is present only in the thymus. According to this 
model, the thymus re-creates an environment in which developing 
T cells can sample proteins containing a vast array of the body’s own 
unique epitopes. T cells whose TCRs have a high affinity for peptides 
derived from the body’s own proteins are destroyed (Figure 17.25a). 
This process of negative selection greatly reduces the likelihood that 
the immune system will attack its own tissues. Humans lacking a 
functional AIRE gene suffer from a severe autoimmune disease 
(called APECED), in which numerous organs come under immuno- 
logic attack. 

The generation of T cells requires more than negative selection. 
When a TCR interacts with a foreign peptide on the surface of an 
APC, it must recognize both the peptide and the MHC molecule 
holding that peptide (discussed in Section 17.5). Consequently, 
T cells whose TCRs do not recognize self- MHC molecules are of 
little value. The immune system screens out such cells by requiring 
that T cells recognize self-MHC-self-peptide complexes with low 
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FIGURE 17.23 Classical processing pathways for antigens that become associated with MHC class | and class II molecules. (a) A proposed pathway 
for the assembly of an MHC class I-peptide complex. This pathway occurs in almost all cell types. In step 1, the heavy chain of the MHC protein is synthe- 
sized by a membrane-bound ribosome and translocated into the ER membrane. The MHC heavy chain becomes associated with calnexin (step 2), a 
chaperone in the ER membrane, and the dimeric complex binds the invariant Bm chain (step 3). The MHC complex then becomes associated with another 
ER membrane protein, TAP (step 4). Meanwhile, cytosolic antigens are taken into proteasomes (step A) and degraded into small peptides (step B). The 
peptides are transported into the ER lumen by the TAP protein, where they are trimmed to their final length by an ER peptidase (not shown). These 
peptides then become bound within the groove of the MHC molecule (step 5) with the help of a large complex of chaperones that is labeled PLC in the 
figure. PLC and calnexin dissociate from the MHC complex (step 6), which is transported along the biosynthetic/secretory pathway through the Golgi 
complex (step 7) to the plasma membrane, where it is ready to interact with the TCR of a CTL. (b) A proposed pathway for the assembly of an MHC class 
lI-peptide complex. This pathway occurs in dendritic cells and other professional APCs. In step 1, the MHC protein is synthesized by a membrane-bound 
ribosome and translocated into the ER membrane, where it becomes associated with li (step 2), a trimeric protein that blocks the MHC-peptide-binding 
site. The MHC complex passes through the Golgi complex (step 3) and into a transport vesicle (step 4). Meanwhile, an extracellular protein antigen is taken 
into the APC by endocytosis (step A) and delivered to a lysosome (step B), where the antigen is fragmented into peptides. The lysosome containing 
antigenic fragments fuses with the transport vesicle containing the MHC-li complex (step 5), leading to the degradation of the li protein and association 
between the antigenic peptide fragment and the MHC class II molecule (step 6). The MHC-peptide complex is transported to the plasma membrane 

(step 7) where it is ready to interact with the TCR of a Ty cell. (Note: Not all exogenous antigens follow the classical MHC class I! pathway shown in part b. 
A pathway also exists by which exogenous antigens can be taken into APCs by endocytosis and degraded into peptides, which are then bound and 
displayed by MHC class | molecules. This cross-presentation pathway, as it is called, allows CTLs to become activated by exogenous antigen that would 
otherwise go “unseen.”) 


Source: (a, b) After D. B. Williams et al., Trends Cell Biol. 6:271, 1996. Trends in Cell Biology by Elsevier Ltd. Reproduced with permission of Elsevier Ltd. in 
the format reuse in a book/textbook via Copyright Clearance Center. 
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(a) 


FIGURE 17.24 Peptides produced by antigen processing bind within a groove of the MHC protein molecule. These models illustrate the binding of 
peptides to an MHC class | (a) and MHC class II (b) molecule. The molecular surfaces of the MHC molecules are shown in white and the peptide in the 
peptide-binding site in color. The peptide in a is derived from the influenza virus matrix protein, and the peptide in b is derived from influenza virus 


hemagglutinin protein. The N-terminus of each peptide is on the left. 


Source: (a, b) Courtesy of T. Jardetzky; Trends Biochem. Sci. 22:378, 1997, Fig. 1, with permission from Elsevier Science. 


affinity. T cells whose TCRs are unable to recognize self- MHC com- 
plexes die within the thymus due to a lack of growth signals—a pro- 
cess referred to as “death by neglect” (Figure 17.25b). In contrast, T 
cells whose TCRs exhibit weak (low affinity) recognition toward 
self-MHC complexes are stimulated to remain alive but are not acti- 
vated (Figure 17.25c). This process of selective survival is referred to 
as positive selection. It is estimated that less than 5 percent of thymic 
T cells survive these screening events.” 

Those cells that recognize class I MHC molecules are thought to 
develop into cytotoxic (CD4 CD8*) T lymphocytes, whereas those 
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FIGURE 17.25 Determining the fate of a newly formed T cell in the 
thymus. A screening process takes place in the thymus that selects for 

T cells with appropriate TCRs. (a) Those T cells whose TCR exhibits strong 
affinity for MHC molecules bearing self-peptides are eliminated by 
apoptosis (negative selection). (b) Those T cells whose TCR fails to 
recognize MHC molecules bearing self-peptides also die by apoptosis 
(death by neglect). (c) In contrast, those T cells whose TCR exhibits weak 
affinity for MHC molecules bearing self-peptides survive (positive selection) 
and ultimately leave the thymus to constitute the peripheral T-cell 
population of the body. 


that recognize class II MHC molecules are thought to differentiate 
into helper (CD4* CD8°) T lymphocytes. Both types of T cells leave 
the thymus and circulate for extended periods through the blood 
and lymph. T cells at this stage are described as naive T cells because 
they have not yet encountered the specific antigen to which their 
TCR can bind. As they pass through the lymphoid tissues, naive 
T cells come into contact with various cells that either maintain their 
survival in a resting state or trigger their activation. 

As they percolate through the lymph nodes and other tissues, 
T cells scan the surfaces of cells for the presence of an inappropriate 
peptide bound to an MHC molecule. CD4* T cells are activated by a 
foreign peptide bound to a class II MHC molecule, whereas CD8* 
T cells are activated by foreign peptides bound to a class I MHC mol- 
ecule. CD8* T cells also respond strongly to cells bearing nonself- 
MHC molecules, such as the cells of a transplanted organ from a 
mismatched donor. In this latter case, they initiate a widespread attack 
against the graft cells that can result in organ rejection. Under normal 
physiologic conditions, autoreactive lymphocytes (i.e., lymphocytes 
capable of reacting to the body’s own tissues) are prevented from 
becoming activated by a number of poorly understood mechanisms 
that operate outside of the thymus in the body’s periphery. As dis- 
cussed in the Human Perspective, a breakdown in these mechanisms 
of peripheral tolerance leads to the production of autoantibodies and 
autoreactive T cells that can cause chronic tissue damage. 


REVIEW | 


1. Describe the role of negative selection and positive 


selection in T cell development. 
e 


5B cells are also subjected to selective processes that lead to the death or inactivation 
of cells capable of producing autoreactive antibodies. In some cases, the light chains 
of autoreactive antibodies can be replaced by a new light chain encoded by an Ig gene 
that has been secondarily rearranged in a process called receptor editing. 


17.12 Lymphocytes Are Activated 
by Cell-Surface Signals 


Lymphocytes communicate with other cells through an array of cell- 
surface proteins. As discussed above, T-cell activation requires an 
interaction between the TCR of the T cell and an MHC-peptide 
complex on the surface of another cell. This interaction provides 
specificity, which ensures that only T cells that bind the antigen are 
activated. T-cell activation also requires a second signal, called the 
costimulatory signal, which is delivered through a second type of 
receptor on the surface of a T cell. This receptor is distinct and spa- 
tially separate from the TCR. Unlike the TCR, the receptor that 
delivers the costimulatory signal is not specific for a particular anti- 
gen and does not require an MHC molecule to bind. The best stud- 
ied of these interactions occur between helper T cells and professional 
antigen-presenting cells (e.g., dendritic cells and macrophages). 


Activation of Helper T Cells 
by Professional APCs 


T, cells recognize antigen fragments on the surface of dendritic cells 
and macrophages that are lodged in the binding cleft of MHC class 
II molecules. A costimulatory signal is delivered to a T,; cell as the 
result of an interaction between a protein known as CD28 on the 
surface of the T,, cell and a member of the B7 family of proteins on 
the surface of the APC (FIGURE 17.26a). The B7 protein appears 
on the surface of the APC after the phagocyte ingests foreign anti- 
gen. If the T,, cell does not receive this second signal from the APC, 
rather than becoming activated, the T,, cell either becomes nonre- 
sponsive (anergized) or is stimulated to undergo apoptosis. Because 
professional APCs are the only cells capable of delivering the costim- 
ulatory signal, they are the only cells that can initiate a T,, response. 
As a result, normal cells of the body that bear proteins capable of 
combining with the TCRs of a T cell cannot activate T,, cells. Thus, 
the requirement by a Ty cell for two activation signals protects nor- 
mal body cells from autoimmune attack involving T,, cells. 

Prior to its interaction with an APC, a T,, cell can be described 
as a quiescent cell, that is, one that has withdrawn from the cell cycle 
(a Go cell, page 542). Once it receives the dual activation signals, a 
T,, cell is stimulated to reenter the G, phase of the cell cycle and 
eventually to progress through S phase into mitosis. Thus, interac- 
tion of T cells with a specific antigen leads to the proliferation (clonal 
expansion) of those cells capable of responding to that antigen. In 
addition to triggering cell division, activation ofa T,, cell causes it to 
synthesize and secrete cytokines (most notably IL-2). Cytokines pro- 
duced by activated T,, cells act on other cells of the immune system 
(including B cells and macrophages) and also back on the T,; cells 
that secreted the molecules. The source and function of various 
cytokines were indicated in Table 17.1. 

We have seen in this chapter how immune responses are stimu- 
lated by ligands that activate receptor signaling pathways. But many 
of these events are also influenced by inhibitory stimuli, so that the 
ultimate response by the cell is determined by a balance of positive 
and negative influences. For example, interaction between CD28 
and a B7 protein delivers a positive signal to the T cell leading to its 
activation. Activation of the T cell leads to the trafficking of a protein 
called CTLA4 from intracellular membranes to the cell surface. 
CTLA4 is similar to CD28 in structure and also interacts with 
B7 proteins of the APC. Unlike the CD28-B7 interaction, however, 


contact between CTLA4 and B7 leads to inhibition of the T cell's 
response, rather than activation. CTLA4 is thought to play a role in 
maintaining peripheral T cell tolerance. The need for balance 
between activation and inhibition is most evident in mice that have 
been genetically engineered to lack the gene encoding CTLA4. 
These mice die as a result of massive overproliferation of T cells. As 
noted on page 670, insights into CTLA4 function have recently been 
put to clinical use. 


Activation of B Cells by T,, Cells 


T, cells bind to B cells whose receptors recognize the same antigen. 
The antigen initially binds to the immunoglobulin (BCR) at the 
B-cell surface. Antigens taken up by B cells can be soluble proteins 
from the extracellular medium or proteins bound to the plasma 
membrane of other cells. In the latter case, the B cell obtains the 
antigen by spreading itself over the outer surface of the target cell 
and collecting the BCR-antigen complexes into a central cluster. The 
bound antigen is then taken into the B cell, where it is processed 
enzymatically, and its fragments are displayed in combination with 
MHC class II molecules (Figure 17.26b). The B cell with displayed 
antigenic peptides recruits a T,, cell with the appropriate (cognate) 
TCR. Recognition of the peptide fragment by the TCR leads to acti- 
vation of the T,, cell, which responds by activating the B cell. 
Activation of a B cell follows transmission of several signals from the 
T,, cell to the B cell. Some of these signals are transmitted directly 
from one cell surface to the other through an interaction between 
complementary proteins, such as CD40 and CD40 ligand (CD40L) 
(Figure 17.26b). Binding between CD40 and CD4OL generates sig- 
nals that help move the B cell from the G, resting state back into the 
cell cycle. Other signals are transmitted by cytokines released by the 
T cell into the immunologic synapse separating it from the nearby 
B cell. This process is not unlike the way neurotransmitters act across 
a neural synapse (Section 4.18). Cytokines released by the T cells 
into the immunologic synapse include IL-4, IL-5, IL-6, and IL-10. 
Interleukin 4 is thought to stimulate the B cell to switch from pro- 
ducing the IgM class to producing the IgG or IgE class. Other 
cytokines induce the proliferation, differentiation, and secretory 
activities of B cells. 


REVIEW | 


1. Describe the steps between the stage when a 
bacterium is ingested by a macrophage and the stage 
when plasma cells are producing antibodies that bind 


to the bacterium and neutralize its infectivity. 
@O 


17.13 Signal Transduction 
Pathways in Lymphocyte 
Activation 


We saw in Chapter 15 how hormones, growth factors, and other 
chemical messengers bind to receptors on the outer surfaces of target 
cells to initiate a process of signal transduction, which transmits 
information into a cell’s internal compartments. We saw in that 
chapter how a large variety of extracellular messenger molecules 
transmit information along a small number of shared signal 
transduction pathways. The stimulation of lymphocytes occurs by a 
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FIGURE 17.26 Lymphocyte activation. (a) Schematic drawing depicting the interaction between a professional APC, in this case a mature dendritic cell, 
and a Ty cell. Specificity in this cell-cell interaction derives from recognition by the TCR on the Ty cell of the MHC class Il-peptide complex displayed on the 
surface of the dendritic cell. Interaction between CD28 of the T cell and a B7 protein of the dendritic cell provides a nonspecific costimulatory signal that is 
required for T-cell activation. (Inset) Scanning electron micrograph of a mature dendritic cell (orange) presenting antigen to a number of T cells (green). 

(b) Schematic drawing of the interaction between an activated Ty cell and a B cell. Specificity in this cell-cell interaction derives from recognition by the TCR 
on the Ty cell of the MHC class II-peptide complex displayed on the surface of the B cell. The peptide displayed by the B cell is derived from protein 
molecules that had initially bound to the BCRs at the cell surface. These bound antigens are taken up by endocytosis, fragmented in lysosomes, and bound 


to MHC class II molecules, as in Figure 17.23b. 


Source: (a, b) E. Lindhout et al., Immunol Today 18:574, 1997. Immunology Today by Elsevier Sci Ltd/U. K. Reproduced with permission of Elsevier Sci Ltd/ 
U.K. in the format reuse in a book/textbook via Copyright Clearance Center. Inset photo; From Peter Friedl, Nature Revs. Immunology 5:533, 2005, Fig. 1a, 


middle image; reprinted by permission from Macmillan Publishers Limited. 


similar mechanism and utilizes many of the same components 

employed by hormones and growth factors that act on other cell types. 

When a T cell is activated by a dendritic cell, or a B cell is acti- 
vated by aT, cell, signals are transmitted from plasma membrane to 
cytoplasm by tyrosine kinases, similar to the signals described in 
Chapter 15 for insulin and growth factors. Unlike the receptors for 
insulin and growth factors (page 604), lymphocyte antigen receptors 
lack an inherent tyrosine kinase activity. Instead, ligand binding to 
antigen receptors leads to the recruitment of cytoplasmic tyrosine 
kinase molecules to the inner surface of the plasma membrane. This 
process may be facilitated by the clustering of activated receptors. 
Several different tyrosine kinases have been implicated in signal 
transduction during lymphocyte activation, including members of 
the Src and Tec families. Src was the first tyrosine kinase to be identi- 
fied and is the product of the first cancer-causing oncogene to be 
discovered (page 633). 

Activation of these tyrosine kinases leads to a cascade of events 
and the activation of numerous signal transduction pathways, 
including: 

1. Activation of phospholipase C, which leads to the formation of 
IP; and DAG. As discussed on page 598, IP; causes a marked 
elevation in the levels of cytosolic Ca**, whereas DAG stimulates 
protein kinase C activity. 


2. Activation of Ras, which leads to the activation of the MAP 
kinase cascade (Section 15.11). 


3. Activation of PI3K, which catalyzes the formation of membrane- 
bound lipid messengers having diverse functions in cells (page 612). 


Transmission of signals along these various pathways, and oth- 
ers, leads to the activation of a number of transcription factors (e.g., 
NF-«B and NFAT) and the resulting transcription of dozens of genes 
that are not expressed in the resting T cell or B cell. 

As noted above, one of the most important responses by an acti- 
vated lymphocyte is the production and secretion of cytokines. Some 
of these cytokines may act as part of an autocrine circuit by binding 
to receptors on the cell that released them. Like other extracellular 
signals, cytokines bind to receptors on the outer surface of target 
cells, generating cytoplasmic signals that act on various intracellular 
targets. Cytokines utilize a novel signal transduction pathway 
referred to as the JAK-STAT pathway, which operates without the 
involvement of second messengers. The “JAK” portion of the name 
is an acronym for Janus kinases, a family of cytoplasmic tyrosine 
kinases whose members become activated following the binding ofa 
cytokine to a cell-surface receptor. (Janus is a two-faced Roman god 
who protected entrances and doorways.) “STAT” is an acronym for 
“signal transducers and activators of transcription,” a family of 
transcription factors that become activated when one of their tyro 
sine residues is phosphorylated by a JAK (see Figure 15.19c). Once 
phosphorylated, STAT molecules interact to form dimers that trans- 
locate from the cytoplasm to the nucleus where they bind to specific 
DNA sequences, such as an interferon-stimulated response element 


(ISRE). ISREs are found in the regulatory regions of a dozen or so 
genes that are activated when a cell is exposed to the cytokine inter- 
feron a (IFN-a). 

As with the hormones and growth factors discussed in 
Chapter 15, the specific response of a cell depends on the particular 
cytokine receptor engaged and the particular JAKs and STATs that 
are present in that cell. For example, it was noted earlier that IL-4 
acts to induce Ig class switching in B cells. This response follows the 
IL-4-induced phosphorylation of the transcription factor STAT6, 
which is present in the cytoplasm of activated B cells. Resistance to 


viral infection induced by interferons (page 665) is mediated through 691 


the phosphorylation of STAT1. Phosphorylation of other STATs may 
lead to the progression of the target cell through the cell cycle. 


REVIEW | 


1. Describe the signaling pathway in lymphocytes that is 
triggered in response to cytokine binding. 


————— 
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An illustration of the protein Cas?, the RNA-guided DNA endonuclease 
that is used in CRISPR-based genome editing. Cas9 protein (purple) 
associates with a guide RNA (purple backbone with white bases) and 
scans along genomic DNA looking for a segment of DNA (blue 
backbone with white bases) that matches with the guide RNA. When it 
finds this match, it cleaves the DNA at that point. 


Techniques in Cell 


and Molecular Biology 


THE ASILOMAR CONFERENCE 


Ts rapid advancements in recombinant DNA technology 
in the early 1970s provided researchers with the ability to 
“cut and paste” DNA from different organisms, such as 
viruses and bacteria, together for the first time. Although the 
research potential for such DNA manipulation was apparent, 
many researchers were concerned about the possible risks 
this new technology posed to human health and the envi- 
ronment. In response, a group of researchers led by Stanford 
biochemist Paul Berg called for an international voluntary 
moratorium on recombinant DNA experiments in 1974. The 
following year, Berg and others organized an international 
conference at the Asilomar Conference Center in Pacific 
Grove, California, to address the concerns regarding the use 
of recombinant DNA technology. 


The Asilomar Conference of 1975 brought together 


140 molecular biologists, lawyers, government officials, and 
journalists with the support of the National Institutes of 
Health and the National Academy of Sciences. Over several 
days, the attendees discussed (and argued over) the 
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potential risks associated with different types of experiments 
and methods to best minimize these risks. By the end of the 
meeting, an agreement was reached to lift the moratorium 
and allow research to continue with a set of defined restric- 
tions. The guidelines set forth in the Asilomar meeting were 
adopted by the NIH the next year. 


More recently, the development of new technologies that 
pave the way for human genome editing has caused 
researchers to consider convening a meeting similar to the 
historic Asilomar conference. Of particular ethical concern is 


18.1 The Light Microscope 


Because of the very small size of the subject matter, cell and molecular 
biology is more dependent on the development of new instruments 
and technologies than any other branch of biology. Consequently, it is 
difficult to learn about cell and molecular biology without also learn- 
ing about the technology that is required to collect data. In this chap- 
ter, we will survey the methods used most commonly in the field 
without becoming immersed in the details or the many variations 
that are employed. These are the goals of the present chapter: to 
describe the ways that selected techniques are used and to provide 
examples of the types of information that can be learned by using 
these techniques. We will begin with the instrument that enabled 
biologists to discover the very existence of cells, providing the starting 
point for all of the information that has been presented in this text. 
Microscopes are instruments that produce an enlarged image of 
an object. FIGURE 18.1 shows the most important components of a 
compound light microscope. A light source, which may be external 
to the microscope or built into its base, illuminates the specimen. 
The substage condenser lens gathers the diffuse rays from the light 
source and illuminates the specimen with a small cone of bright light 
that allows very small parts of the specimen to be seen after magnifi- 
cation. The light rays focused on the specimen by the condenser lens 
are then collected by the microscope’s objective lens. From this 


Ocular lens 


Objective lens 
Specimen 


Condenser lens 


Light source 


FIGURE 18.1 Sectional diagram through a compound light microscope; 
that is, a microscope that has both an objective and an ocular lens. 


the potential for genome editing techniques to be used to 
genetically engineer human embryos. In a letter to Science 
published in April 2015, a group of researchers led by 
Jennifer Doudna (University of California, Berkeley) has 
called for a global ban on genome modifications of human 
germline cells and increased discussions with the public 
about genome editing. The letter also calls for a meeting of 
researchers, ethicists, lawyers, and the public to consider the 
merits and risks of genome editing, and to recommend 
policies where they are needed. 


point, we need to consider two sets of light rays that enter the objec- 
tive lens: those that the specimen has altered and those that it hasn't 
(FIGURE 18.2). The latter group consists of light from the condenser 
that passes directly into the objective lens, forming the background 
light of the visual field. The former group of light rays emanates from 
the many parts of the specimen and forms the image of the speci- 
men. These light rays are brought to focus by the objective lens to 
form a real, enlarged image of the object within the column of the 


Plane of focus —— 
of image 


Plane of focus —— 
of light source 


Objective lens 


Focal length 


: of objective lens 
Plane of specimen —— 


Lamp 


(===) Light rays that form the image 
(—= ) Background light of the field 


FIGURE 18.2 The paths taken by light rays that form the image of the 
specimen and those that form the background light of the field. Light 
rays from the specimen are brought to focus on the retina, whereas 
background rays are out of focus, producing a diffuse bright field. As 
discussed later in the text, the resolving power of an objective lens is 
proportional to the sine of the angle a. Lenses with greater resolving power 
have shorter focal lengths, which means that the specimen is situated closer 
to the objective lens when it is brought into focus. 
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used as an object by a second lens system, the ocular lens, to form an 
enlarged and virtual image. A third lens system located in the front 
part of the eye uses the virtual image produced by the ocular lens as 
an object to produce a real image on the retina. When the focusing 
knob of the light microscope is turned, the relative distance between 
the specimen and the objective lens changes, allowing the final image 
to become focused precisely on the plane of the retina. The total 
magnification attained by the microscope is the product of the mag- 
nifications produced by the objective lens and the ocular lens. 


Resolution 


Up to this point we have considered only the magnification of an 
object without paying any attention to the quality of the image pro- 
duced, that is, the extent to which the detail of the specimen is 
retained in the image. Suppose you are looking at a structure in the 
microscope using a relatively high-power objective lens (for example, 
63 x) and an ocular lens that magnifies the image of the objective lens 
another fivefold (a5 ocular). Suppose the field is composed of chro- 
mosomes and it is important to determine the number present, but 
some of them are very close together and cannot be distinguished as 
separate structures (FIGURE 18.3a). One solution to the problem 
might be to change oculars to increase the size of the object being 
viewed. If you were to switch from the 5x to a10 x ocular, you would 
most likely increase your ability to determine the number of chromo- 
somes present (Figure 18.3b) because you have now spread the image 
of the chromosomes produced by the objective lens over a greater 
part of your retina. The more photoreceptors that provide informa- 
tion about the image, the more detail that can be seen (FIGURE 18.4). 
If, however, you switch to a 20x ocular, you are not likely to see 
additional detail, although the image is larger (Figure 18.3c), that is, 
occupies more retinal surface. The change of ocular fails to provide 
additional information because the image produced by the objective 
lens does not possess any further detail to be enhanced by the 
increased power of the ocular lens. The second switch in oculars pro- 
vides only empty magnification (as in Figure 18.3c). 

The optical quality of an objective lens is measured by the extent 
to which the fine detail present in a specimen can be discriminated, 
or resolved. The resolution attained by a microscope is limited by 
diffraction. Because of diffraction, light emanating from a point in 
the specimen can never be seen as a point in the image, but only as a 
small disk. If the disks produced by two nearby points overlap, the 
points cannot be distinguished in the image. Thus the resolving 
power of a microscope can be defined in terms of the ability to see 
two neighboring points in the visual field as two distinct entities. 
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FIGURE 18.3 Magnification versus resolution. The transition from (a) to 
(b) provides the observer with increased magnification and resolution, 

whereas the transition from (b) to (c) provides only increased magnification 
(empty magnification). In fact, the quality of the image actually deteriorates 


(a) (b) 


as empty magnification increases. 
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FIGURE 18.4 The resolving power of the eye. A highly schematic 
illustration of the relationship between the stimulation of individual 
photoreceptors (left) and the resulting scene one would perceive (right). 
The diagram illustrates the value of having the image fall over a sufficiently 
large area of the retina. 


The resolving power of a microscope is limited by the wavelength of 
the illumination according to the equation 


_ 0.614 


nsin a 


d 


where d is the minimum distance that two points in the specimen 
must be separated to be resolved, J is the wavelength of light (527 nm 
is used for white light), and n is the refractive index of the medium 
present between the specimen and objective lens. Alpha (a) is equal 
to half the angle of the cone of light entering the objective lens as 
shown in Figure 18.2. Alpha is a measure of the light-gathering abil- 
ity of the lens and is directly related to its aperture. 

The denominator of the equation in column 1 is called the 
numerical aperture (N. A.). The numerical aperture is a constant for 
each lens, a measure of its light-gathering qualities. For an objective 
that is designed for use in air, the maximum possible N.A. is 1.0 
because the sine of the maximum possible angle of o, 90°, is 1, and 
the refractive index of air is 1.0. For an objective designed to be 
immersed in oil, the maximum possible N.A. is approximately 1.5. 
As a common rule, the maximum useful magnification of a light 
microscope ranges between 500 and 1000 times the numerical aper- 
ture of the objective lens being used. Attempts to enlarge the image 
beyond this point result in empty magnification, and the quality of 
the image deteriorates. High numerical aperture is achieved by using 
lenses with a short focal length, which requires the lens to be placed 
very close to the specimen. 

If we substitute the minimum possible wavelength of illumina- 
tion and the greatest possible numerical aperture in the preceding 
equation, we can determine the limit of resolution of the conven- 
tional light microscope. When these substitutions are made, a value 
of slightly less than 0.2 um (or 200 nm) is obtained, which is 
sufficient to resolve larger cellular organelles, such as nuclei and 
mitochondria. In contrast, the limit of resolution of the naked eye, 
which has a numerical aperture of about 0.004, is approximately 
0.1 mm. 

In addition to these theoretical factors, resolving power is also 
affected by optical flaws, or aberrations. There are seven important 
aberrations, and they are the handicaps that lensmakers must over- 
come to produce objective lenses whose actual resolving power 
approaches that of the theoretical limits. Objective lenses are made 
of a complex series of lenses, rather than a single convergent lens, to 
eliminate these aberrations. One lens unit typically affords the 
required magnification, while the others compensate for errors in 
the first lens to provide a corrected overall image. 


Visibility 

On the more practical side of microscopy from that of limits of reso- 
lution is the subject of visibility, which is concerned with factors that 
allow an object actually to be observed. This may seem like a trivial 
matter; if an object is there, it should be seen. Consider the case of a 
clear glass bead. Under most conditions, against most backgrounds, 
the bead is clearly visible. If, however, a glass bead is dropped into a 
beaker of immersion oil having the same refractive index as glass, 
the bead disappears from view because it no longer affects light in 
any obvious manner that is different from the background fluid. 
Anyone who has spent any time searching for an amoeba can appre- 
ciate the problem of visibility when using the light microscope. 

What we see through a window or through a microscope are 
those objects that affect the light differently from their background. 
Another term for visibility in this sense of the word is contrast, or 
the difference in appearance between adjacent parts of an object or 
between an object and its background. The need for contrast can be 
appreciated by considering the stars. Whereas a clear night sky can 
be filled with stars, the same sky during the day appears devoid of 
celestial bodies. The stars have disappeared from view, but they 
haven't disappeared from the sky. They are no longer visible against 
the much brighter background. 

In the macroscopic world, we examine objects by having the 
light fall on them and then observing the light that is reflected back 
to our eyes. In contrast, when we use a microscope, we place the 
specimen between the light source and our eyes and view the light 
that is transmitted through the object (or more properly, diffracted 
by the object). If you take an object and go into a room with one light 
source and hold the object between the light source and your eye, 
you can appreciate part of the difficulty in such illumination; it 
requires that the object being examined be nearly transparent, that 
is, translucent. Therein lies another aspect of the problem: Objects 
that are “nearly transparent” can be difficult to see. 

One of the best ways to make a thin, translucent specimen visible 
under the microscope is to stain it with a dye, which absorbs only cer- 
tain wavelengths within the visible spectrum. Those wavelengths that 
are not absorbed are transmitted to the eye, causing the stained object 
to appear colored. Different dyes bind to different types of biological 
molecules, and therefore, not only do these procedures heighten visi- 
bility of the specimen, they can also indicate where in the cells or tis- 
sues different types of substances are found. A good example is the 
Feulgen stain, which is specific for DNA, causing the chromosomes to 
appear colored under the microscope (FIGURE 18.5). A problem with 
stains is that they generally cannot be used with living cells; they are 
usually toxic, or the staining conditions are toxic, or the stains do not 
penetrate the plasma membrane. The Feulgen stain, for example, 
requires that the tissue be hydrolyzed in acid before the stain is applied. 


18.2 Bright-Field and Phase- 
Contrast Microscopy 


Different types of light microscopes use different types of illumina- 
tion. In a bright-field microscope, the cone of light that illumi- 
nates the specimen is seen as a bright background against which the 
image of the specimen must be contrasted. Bright-field microscopy 
is ideally suited for specimens of high contrast, such as stained 
sections of tissues, but it may not provide optimal visibility for other 


FIGURE 18.5 The Feulgen stain. This staining procedure is specific for 
DNA as indicated by the localization of the dye to the chromosomes of this 
onion root tip cell that was in metaphase of mitosis at the time it was fixed. 


Source: Ed Reschke. 


specimens. In the following sections, we will consider various means 
of making specimens more visible in a light microscope. 


Bright-Field Light Microscopy 


Specimens to be observed with the light microscope are broadly 
divided into two categories: whole mounts and sections. A whole 
mount is an intact object, either living or dead, that can consist of 
an entire microscopic organism such as a protozoan or a small part 
of a larger organism. Most tissues of plants and animals are much 
too opaque for microscopic analysis unless examined as a very thin 
slice, or section. To prepare a section, the cells are first killed by 
immersing the tissue in a chemical solution, called a fixative. A 
good fixative rapidly penetrates the cell membrane and immobilizes 
all of its macromolecular material so that the structure of the cell is 
maintained as close as possible to that of the living state. The most 
common fixatives for the light microscope are solutions of formalde- 
hyde, alcohol, or acetic acid. 

After fixation, the tissue is dehydrated by transfer through a 
series of alcohols and then usually embedded in paraffin (wax), 
which provides mechanical support during sectioning. Paraffin is 
used as an embedding medium because it is readily dissolved by 
organic solvents. Slides containing adherent paraffin sections are 
immersed in toluene, which dissolves the wax, leaving the thin slice 
of tissue attached to the slide, where it can be stained or treated with 
antibodies or other agents. After staining, a coverslip is mounted 
over the tissue using a mounting medium that has the same refrac- 
tive index as the glass slide and coverslip. 


Phase-Contrast Microscopy 


Small, unstained specimens such as a living cell can be very difficult 
to see with a bright-field microscope (FIGURE 18.6a). The phase- 
contrast microscope makes highly transparent objects more 
visible (Figure 18.6). We can distinguish different parts of an object 
because they affect light differently from one another. One basis for 
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(a) 


(b) 


(c) 


FIGURE 18.6 A comparison of cells seen with different types of light 
microscopes. Light micrographs of a ciliated protist as observed under 
bright-field (a), phase-contrast (b), and differential interference contrast 
(DIC) (or Nomarski) optics (c). The organism is barely visible under bright- 
field illumination but clearly seen under phase-contrast and DIC microscopy. 


Source: (a—c) Micrographs by M. |. Walker/Photo Researchers, Inc. 


such differences is refractive index. Cell organelles are made up of 
different proportions of various molecules: DNA, RNA, protein, lipid, 
carbohydrate, salts, and water. Regions of different composition are 
likely to have different refractive indices. Normally such differences 
cannot be detected by our eyes. However, the phase-contrast micro- 
scope converts differences in refractive index into differences in 
intensity (relative brightness and darkness), which are visible to the 
eye. Phase-contrast microscopes accomplish this result by (1) separat- 
ing the direct light that enters the objective lens from the diffracted 
light emanating from the specimen and (2) causing light rays from 
these two sources to interfere with one another. The relative bright- 
ness or darkness of each part of the image reflects the way in which 
the light from that part of the specimen interferes with the direct light. 

Phase-contrast microscopes are most useful for examining 
intracellular components of living cells at relatively high resolu- 
tion. For example, the dynamic motility of mitochondria, mitotic 


chromosomes, and vacuoles can be followed and recorded with 
these optics. Simply watching the way tiny particles and vacuoles of 
cells are bumped around in a random manner in a living cell conveys 
an excitement about life that is unattainable by observing stained, 
dead cells. The greatest benefit derived from the invention of the 
phase-contrast microscope has not been in the discovery of new 
structures, but in its everyday use in research and teaching labs for 
observing cells in a more revealing way. 

The phase-contrast microscope has optical handicaps that 
result in loss of resolution, and the image suffers from interfering 
halos and shading produced where sharp changes in refractive 
index occur. The phase-contrast microscope is a type of interference 
microscope. Other types of interference microscopes minimize these 
optical artifacts by achieving a complete separation of direct and 
diffracted beams using complex light paths and prisms. Another 
type of interference system, termed differential interference contrast 
(DIC), or sometimes Nomarski interference after its developer, 
delivers an image that has an apparent three-dimensional quality 
(Figure 18.6c). Contrast in DIC microscopy depends on the rate of 
change of refractive index across a specimen. As a consequence, the 
edges of structures, where the refractive index varies markedly over 
a relatively small distance, are seen with especially good contrast. 


18.3 Fluorescence Microscopy 
(and Related Fluorescence-Based 
Techniques) 


Over the past several decades, the light microscope has been trans- 
formed from an instrument designed primarily to examine sections 
of fixed tissues to one capable of observing the dynamic events 
occurring at the molecular level in living cells. These advances in 
live-cell imaging have been made possible to a large extent by inno- 
vations in fluorescence microscopy. The fluorescence microscope 
allows viewers to observe the location of certain molecules (called 
fluorophores). Fluorophores absorb photons of a specific wavelength 
and release a portion of the energy in longer wavelengths, a phe- 
nomenon called fluorescence. The light source in a fluorescence 
microscope produces a beam of light that travels through a filter, 
which blocks all wavelengths except those capable of exciting the 
fluorophore. The beam of monochromatic light is focused on the 
specimen containing the fluorophore, which becomes excited and 
emits light of a visible wavelength that is focused by the objective 
lens into an image that can be seen by the viewer or digitally cap- 
tured using a charge-coupled device (CCD). 

There are many different ways that fluorescent molecules can be 
used in cell and molecular biology. In one of its most common and 
earliest applications, a relatively small organic fluorophore (such as 
rhodamine or fluorescein) is covalently linked (conjugated) to an 
antibody to produce a fluorescent antibody that can be used to 
determine the location of a specific protein within the cell. This 
technique is called immunofluorescence and is illustrated by the 
micrograph in Figure 9.5. Immunofluorescence is discussed further 
in Section 18.26. Fluorescently labeled proteins can also be used to 
study dynamic processes as they occur in a living cell. For example, 
a fluorophore can be linked to a cellular protein, such as actin or 
tubulin, and the fluorescently labeled protein injected into a living 
cell (as in Figure 9.20). Fluorophores can also be used to locate 


DNA or RNA molecules that contain specific nucleotide sequences 
as illustrated in Figure 10.20. In other examples, fluorophores have 
been used to study the sizes of molecules that can pass between cells 
(see Figure 7.30), as indicators of transmembrane potentials (see 
Figure 5.21), or as probes to determine the free Ca** concentration 
in the cytosol (see Figure 15.27). The use of calcium-sensitive fluo- 
rophores is discussed on page 614. 

In all of the examples discussed in the previous paragraph, the 
biomolecules have been made fluorescent by chemically conjugating 
them with a synthetic fluorophore. But there are also naturally fluo- 
rescent molecules. Humans have probably wondered for thousands 
of years why jellyfish and certain other marine organisms glow in the 
dark. It wasn't until the early 1960s that Osamu Shimomura discov- 
ered that a certain species of jellyfish (Aequorea victoria) owes its 
luminescent character to the presence of fluorescent proteins, such 
as aequorin and the green fluorescent protein (GFP), that he was 
able to purify and analyze. In the early 1990s, Douglas Prasher, 
Martin Chalfie, and colleagues cloned the gene that encodes GFP 
and showed that the fluorescent protein could be genetically incor- 
porated and expressed in another organism. This study set the stage 
for the use of GFP to study the spatial and temporal distribution of 
proteins in living cells. Unlike most other fluorescent proteins, GFP 
does not require an additional cofactor to absorb and emit light; 
instead, the light-absorbing/emitting chromophore is formed by 
self-modification (i.e., by an autocatalytic reaction) of three of the 
amino acids that make up the primary structure of the GFP polypep- 
tide. In most studies employing GFP, a recombinant DNA is con- 
structed in which the coding region of the GFP gene is joined to the 
coding region of the gene for the protein under study. This recombi- 
nant DNA is used to transfect cells, which then synthesize a chimeric 
protein containing fluorescent GFP fused to the protein under study. 
Use of GFP to study membrane trafficking is discussed in Section 8.2. 
In these various experimental protocols, the labeled proteins partici- 
pate in the normal activities of the cell, and their location can be 
followed microscopically to reveal the dynamic activities in which 
the protein participates (see Figures 8.4 and 9.22). 

Live-cell imaging studies can often be made more informative by 
the simultaneous use of GFP variants that exhibit different spectral 
properties. Variants of GFP that fluoresce in shades of blue (BFP), 
yellow (YFP), and cyan (CFP) were generated by Roger Tsien of the 
University of California, San Diego, through directed mutagenesis of 
the GFP gene. In addition, a distantly related red fluorescent tetra- 
meric protein (DsRed) has been isolated from a sea anemone. 
Monomeric variants of DsRed (e.g., mBanana, mTangerine, and 
mOrange), which fluoresce in a variety of distinguishable colors, 
have also been generated by mutagenesis experiments in Tsien’s lab. 
The type of information that can be obtained using this colorful “pal- 
ette” of GFP variants is illustrated by the study depicted in FIGURE 18.7, 
in which researchers generated strains of mice whose neurons con- 
tained differently colored fluorescent proteins. When a muscle of one 
of these mice was exposed surgically, the investigators could observe 
the dynamic interactions between the variously colored neurons and 
the neuromuscular junctions being innervated (see Figure 4.56 
for a drawing of this type of junction). They watched, for example, 
as branches from a CFP-colored neuron competed with branches 
from a YFP-colored neuron for synaptic contact with the muscle tis- 
sue. In each case they found that, when two neurons compete for 
innervation of different muscle fibers, all of the “winning” branches 
belong to one of the neurons, while all of the “losing” branches belong 
to the other neuron (Figure 18.7b). The micrographs in FIGURE 18.8 


(a) (b) 


FIGURE 18.7 Use of GFP variants to follow the dynamic interactions 
between neurons and their target cells in vivo. (a) A portion of the brain 
of a mouse with two differently colored, fluorescent neurons. These mice 
are generated by mating transgenic animals whose neurons are labeled 
with one or the other fluorescent protein. (6) Fluorescence micrograph of 
portions of two neurons, one labeled with YFP and the other with CFP. The 
arrows indicate two different neuromuscular junctions on two different 
muscle fibers in which the YFP-labeled axonal branch has outcompeted the 
CFP-labeled branch. The third junction is innervated by the CFP-labeled 
axon in the absence of competition. 

Source: (a) Courtesy N. Kasthuri & J. W. Lichtman, Washington University 
School of Medicine; (b) From N. Kasthuri and Jeff W. Lichtman, Nature 424: 
429, 2003, Fig. 4A. Reprinted by permission from Macmillan Publishers Ltd. 


provide another example of how much can be learned about the spa- 
tial and temporal dynamics of cellular events using two (or more) 
spectrally distinct fluorescent proteins. In this case, the dual-label 
strategy has allowed investigators to follow the simultaneous move- 
ments of two different proteins in real time as they occur within the 
boundaries of a single cellular organelle. Another example is described 
in the Experimental Pathways in Section 8.18. 

GFP variants have also been useful in a technique, called fluores- 
cence (or Forster) resonance energy transfer (FRET), which can meas- 
ure distances between fluorophores in the nanoscale range. FRET is 
typically employed to measure changes in distance between two 
parts of a protein (or between two separate proteins within a larger 
structure). FRET can be used to study such changes as they occur in 
vitro or within a living cell. FRET is based on the fact that excitation 
energy can be transferred from one fluorescent group (a donor) to 
another fluorescent group (an acceptor), as long as the two groups 
are in very close proximity (1 to 10nm). This transfer of energy 
reduces the fluorescence intensity of the donor and increases the 
fluorescence intensity of the acceptor. The efficiency of transfer 
between two fluorescent groups that are bound to strategic sites on a 
protein decreases sharply as the distance between the two groups 
increases. As a result, determination of changes in fluorescence of 
the donor and acceptor groups that occur during a cellular process 
provides a measure of changes in the distance between them at vari- 
ous stages in the process. This technique is illustrated in FIGURE 18.9, 
where two different GFP variants (labeled ECFP and EYFP) have 
been covalently linked to two different parts of a cGMP-dependent 
protein kinase (PKG). In the absence of bound cGMP, the two fluo- 
rophores are too far apart for energy transfer to occur. Binding of 
cGMP induces a conformational change in the protein that brings 
the two fluorophores into close enough proximity for FRET to occur. 
FRET can also be used to follow many other processes including 
protein folding or the association and dissociation of components 
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dynamic events within cellular organelles of living cells. 
The Golgi cisternae of a budding yeast cell are not 
organized into distinct stacks as in most eukaryotic cells 
but are dispersed within the cytoplasm. Each of the 
brightly colored oval structures is an individual cisterna 
whose color is due to the localization of fluorescently 
labeled protein molecules. A cisterna that appears green 
contains a GFP-labeled protein (Vrg4) that is involved in 
early Golgi activities, whereas a cisterna that appears red 
contains a DsRed-labeled protein (Sec7) that is involved in 
late Golgi activities. This series of micrographs reveals the 
protein composition of individual cisternae over a period 
of about 13 minutes (elapsed time is indicated in the 
lower left corner). The white arrowhead and arrow point 
out two of these cisternae over time. These two cisternae 
are shown imaged by themselves in the lower rows of 
micrographs. It can be seen from this series that the 
protein composition of an individual cisterna changes 
over time from one containing “early” Golgi proteins 
(green) to one containing “late” Golgi proteins (red). 
These findings provide direct visual support for the 
cisternal maturation model discussed in Section 8.8. 
Source: From Eugene Losev et al., Courtesy of Benjamin 
S. Glick, Nature 441:1004, 2006, Fig. 3A. Reprinted by 
permission from Macmillan Publishers Ltd. 


within a membrane. The separation of the cytoplasmic tails of integ- 
rin subunits following activation by talin (Figure 7.13) is an example 
of an event that has been studied by FRET. Figure 15.8 shows an early 
example where FRET was used to reveal the changes in cAMP levels 
following binding of a neurotransmitter with its surface receptor. 
After a period of excitation by a light source, most fluorophores 
reach a state where they are no longer able to fluoresce—that is, they 


FIGURE 18.9 Fluorescence resonance energy transfer (FRET). This 
schematic diagram shows an example of the use of FRET technology to 
follow the change in conformation of a protein (PKG) following cGMP 
binding. The two small, barrel-shaped fluorescent proteins—enhanced CFP 
(ECFP) and enhanced YFP (EYFP)—are shown in their fluorescent color. In 
the absence of cGMP, excitation of ECFP with light of 440 nm leads to 
emission of light of 480 nm from the fluorescent protein. Following cGMP 
binding, a conformational change in PKG brings the two fluorescent 
proteins into close enough proximity for FRET to occur. As a result, 
excitation of the ECFP donor with light of 440nm leads to energy transfer to 
the EYFP acceptor and emission of light of 535nm from the acceptor. The 
enhanced versions of these proteins have greater fluorescence intensity 
and tend to be more stable than the original protein molecules. 


Source: From Moritoshi Sato and Yoshio Umezawa, Analytical Chemistry 
72:5924, 2000. © 2000 American Chemical Society. 


undergo photobleaching. While this is often considered a detriment 
for many experiments, photobleaching has proven useful for a tech- 
nique known as fluorescence recovery after photobleaching (FRAP). 
FRAP is used to study the rate of protein turnover in living cells, and 
was discussed in conjunction with the movement of membrane pro- 
teins (Figure 4.27). FRAP has also been used extensively to study 
cytoskeletal protein turnover. To carry out a FRAP experiment 
focusing specifically on tubulin, a cell is either injected with tubulin 
coupled to a fluorescent dye or induced to express GFP-tubulin. 
Using a microscope equipped with a laser that can be focused on a 
small region of the cell, the fluorescence of the tubulin in that region 
is bleached by the laser beam (FIGURE 18.10). It is then possible to 
follow the specimen over time and measure the recovery of the fluo- 
rescent signal into the bleached zone (Figure 18.10). 

Another innovation in fluorescence microscopy has been the 
development of software that can be used to examine large numbers 
of images and score them for various characteristics. These auto- 
mated technologies have been particularly useful for screening the 
phenotypes of cells that have been subjected to a library of different 
siRNAs, looking for genes encoding proteins that are involved in a 
particular cellular process, such as membrane _ trafficking 
(Section 8.2) or mitosis (Figure 18.51). In addition, software has also 
been developed to track the movement of fluorescent proteins in liv- 
ing cells by automatically identifying and “following” a specific fluo- 
rescent spot that appears over numerous frames in time series. 

Another recent innovation in fluorescence microscopy falls 
under the heading of multiphoton microscopy, in which fluoro- 
phores present within cells are excited by the simultaneous 
absorption of two or more photons of longer wavelength. The longer 
the wavelength of the photon, the less its energy (which makes it less 
destructive to the illuminated cells and to the absorbing fluoro- 
phore) and the greater its penetrating power. Using multiphoton 
microscopy, researchers can track the movements of fluorescent 
proteins that are present at least 200 um deep within a living tissue. 
An example of the use of this approach can be seen in Figure 17.11, 
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where fluorescently labeled immune cells are followed as they move 
around within an excised lymph node. 


Laser Scanning Confocal Microscopy 


The use of digital cameras and imaging and computer processing has 
led to a renaissance in light microscopy over the past couple of dec- 
ades. So too has the development of a new type of light microscope. 
When a whole cell or a section of an organ is examined under a 
standard light microscope, the observer views the specimen at dif- 
ferent depths by changing the position of the objective lens by rotat- 
ing the focusing knob. As this is done, different parts of the specimen 
go in and out of focus. But the fact that the specimen contains differ- 
ent levels of focus reduces the ability to form a crisp image because 
those parts of the specimen above and below the plane of focus inter- 
fere with the light rays from that part that is in the plane of focus. 
In the late 1950s, Marvin Minsky of the Massachusetts Institute of 
Technology invented a revolutionary new instrument called a con- 
focal microscope that produces an image of a thin plane situated 
within a much thicker specimen. A schematic diagram of the optical 
components and light paths within a modern version of a laser scan- 
ning confocal microscope is shown in FIGURE 18.11a. In this type of 
microscope, the specimen is illuminated by a finely focused laser 
beam that rapidly scans across the specimen at a single depth, thus 
illuminating only a thin plane (or “optical section”) within the 
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FIGURE 18.10 The study of the cytoskeleton using FRAP. (a) A cell 
expressing GFP-tubulin incorporates the fluorescent tubulin into its 
microtubule array. When a laser is focused on a box on the array, the 
fluorescence is bleached (t = 0 sec). Over time the fluorescence is recovered 
in the bleached zone. (b) The fluorescence recovery can come from 
different properties of microtubules. The dynamic properties of the 
microtubules are very likely to contribute to fluorescence recovery. 
Alternatively, if new fluorescent microtubules grow from the centrosome, 
they can grow into the bleached zone. Finally, it is possible, that a 
fluorescent translocating microtubule can move through the bleached zone 
and be seen there at the time of observation. (c) Micrographs from a FRAP 
experiment performed on interphase (non-dividing) cells expressing GFP- 
tubulin. Two side-by-side cells were bleached in the boxed region with a 
laser, and the cells were then imaged over time. The fluorescence signal in 
the bleached region recovers over the time course of the experiment. 


Source: Courtesy of Claire Walczaw and Rania Rizk. 


specimen. As indicated in Figure 18.11a, confocal microscopes are 
used with fluorescence optics. As described earlier, short-wavelength 
incident light is absorbed by the fluorophores in a specimen and ree- 
mitted at longer wavelength. Light emitted from the specimen is 
brought to focus at a site within the microscope that contains a pin- 
hole aperture. Thus, the aperture and the illuminated plane in the 
specimen are confocal. Light rays emitted from the illuminated plane 
of the specimen can pass through the aperture, whereas any light 
rays that might emanate from above or below this plane are pre- 
vented from participating in image formation. As a result, out-of- 
focus points in the specimen become invisible. 

Figure 18.11b shows micrographs of a single cell nucleus taken 
at three different planes within the specimen. It is evident that 
objects out of the plane of focus have little effect on the quality of the 
image of each section. If desired, the images from separate optical 
sections can be stored in a computer and merged with one another 
to reconstruct a three-dimensional model of the entire object. 


Super-Resolution Fluorescence 
Microscopy 
In 2014, Eric Betzig, WE. Moerner, and Stefan Hell were awarded the 


Nobel Prize in Chemistry for the development of super-resolution 
fluorescence microscopy. It was pointed out in Section 18.1 that the 
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FIGURE 18.11 Laser scanning confocal fluorescence microscopy. (a) The light 
paths in a confocal fluorescence microscope. Light of short (blue) wavelength is 
emitted by a laser source, passes through a tiny aperture, and is reflected by a dichroic 
mirror (a type of mirror that reflects certain wavelengths and transmits others) into an 
objective lens and focused onto a spot in the plane of the specimen. Fluorophores in 
the specimen absorb the incident light and emit light of longer wavelength, which is 
able to pass through the dichroic mirror and come to focus in a plane that contains a 
pinhole aperture. The light then passes into a photomultiplier tube that amplifies the 
signal and is transmitted to a computer that forms a processed, digitized image. Any 
light rays that are emitted from above or below the optical plane in the specimen are 
prevented from passing through the pinhole aperture and thus do not contribute to 
formation of the image. This diagram shows the illumination of a single spot in the 
specimen. Different sites within this specimen plane are illuminated by means of a 
laser scanning process. The diameter of the pinhole aperture is adjustable. The smaller 
the aperture, the thinner the optical section and the greater the resolution, but the less 
intense the signal. (b) Confocal fluorescence micrographs of three separate optical 
sections, each 0.3 pm thick, of a yeast nucleus stained with two different fluorescently 
labeled antibodies. The red fluorescent antibody has stained the DNA within the 
nucleus, and the green fluorescent antibody has stained a telomere-binding protein 


Specimen e that is localized at the periphery of the nucleus. 
Source: (a, b) From Thierry Laroche and Susan M. Gasser, Cell 75:543, 1993, Cell by 
Focal plane Cell Press. Reproduced with permission of Cell Press in the format reuse in a book/ 
(a) textbook via Copyright Clearance Center and with permission from Elsevier. 


limit of resolution of the light microscope is approximately 200 nm 
owing to the diffraction properties of light. Over the past decade, a 
number of complex optical techniques have been developed that 
allow researchers to localize fluorescently labeled proteins at resolu- 
tions in the tens of nanometers. A number of these super-resolution 
techniques are based on the discovery that a certain mutation of the 
GFP polypeptide converts the protein into a photoactivatable mole- 
cule (PA-GFP). This mutant GFP remains essentially nonfluorescent 
until it is activated by violet light. Subsequent studies have expanded 
the number of available photoactivatable fluorescent proteins and 
have led to the development of photoswitchable proteins, whose 
fluorescence emission at a particular wavelength can be switched on 
and off with pulses of light. We will briefly consider only one of the 
super-resolution techniques, called STORM (stochastic optical 
reconstruction microscopy), that utilizes photoswitchable fluores- 
cent proteins. STORM allows investigators to localize a single fluo- 
rescent molecule within a resolution of less than 20 nm. In this 
technique, a specimen containing one or more photoswitchable 
fluorophores is illuminated with pulses of light of appropriate wave- 
lengths, which has the effect of switching the fluorescence of the 
labeled molecules on and off. During each cycle of illumination, 
most of the labeled molecules remain dark, but a small fraction of 
them are randomly activated. These individual activated fluorescent 
molecules appear as spots that, because of the diffraction properties 
of light, are several hundred nanometers wide. The center of each spot, 
which represents the actual location of the single fluorophore 


responsible for emitting the light, can be determined with high 
accuracy. This process is then repeated for a number of imaging 
cycles, producing a large number of coordinates that represent the 
locations of many of the fluorophores in the specimen. Analysis 
of these coordinates allows the researchers to reconstruct a super- 
resolution image of the field of view. The marked increase in 
resolution that can be attained with the super-resolution STORM 
technique versus that of a conventional fluorescence technique is 
evident by comparing the image of FIGURE 18.12a with those of 
Figures 18.12b and c. 


Light Sheet Fluorescence Microscopy 


In recent years, light sheet fluorescence microscopy has gained popu- 
larity as a technique that allows for noninvasive imaging of large, 
three-dimensional samples—often whole embryos or organisms such 
as zebrafish—over extended periods of time. In this method, an excita- 
tion laser is focused into a thin, flat sheet that illuminates the sample 
along a plane perpendicular to the angle of observation. This allows 
for the excitation of fluorophores only at the observed depth over an 
entire plane (rather than just at a point, as in confocal imaging), result- 
ing in decreased photodamage and increased speed of image 
acquisition. 

In 2014, Eric Betzig and others announced a new technique, 
called lattice light sheet microscopy, that illuminates samples using 


(a) (b) 


(c) 


FIGURE 18.12 Breaking the light microscope’s limit of resolution. (a) A conventional fluorescence micrograph of a portion of a cultured mammalian cell 
with microtubules labeled in green and clathrin-coated pits in red. At this high level of magnification, the image appears pixelated. Moreover, the overlap 
between the two fluorescent labels produces an orange color, which suggests that the two structures are interconnected. (b) ASTORM “super-resolution” 
micrograph of a similar field showing the microtubules and clathrin-coated pits clearly resolved and spatially separated from one another. (c) A magnified 
view of a portion of part b. (Discussion of this and other super-resolution techniques can be found in Nature 459:638, 2009; Ann. Rev. Biochem. 78:993, 
2009; Cell 143:1047, 2010; J. Cell Biol. 190:165, 2010; and J. Cell Sci. 124:1607, 2011.) 


Source: From Mark Bates et al., Courtesy of Xiaowei Zhuang, Science 317:1752, 2007; © 2007. Reprinted with permission from AAAS. 


multiple ultrathin light sheets, allowing for the generation of 3D 
images of a live biological specimen over extended periods of time at 
high spatiotemporal resolution with minimal photobleaching and 
photodamage (FIGURE 18.13). This technique allows researchers to 
visualize the subcellular localization of a protein of interest in a fly or 
worm embryo at subsecond intervals, for example. The ability to col- 
lect high-resolution images in 4D (three dimensions and time) has 
already resulted in numerous novel observations in a variety of sys- 
tems. With continued development and more widespread use, it is 
widely anticipated that lattice light sheet microscopy will provide 
numerous insights into how dynamic molecular processes occur in 
living cells. 


FIGURE 18.13 Live neutrophil-like cell imaged by lattice light sheet 
microscopy and digitally rendered. 


Source: Courtesy of Lillian Fritz-Laylin, Thomas Goddard, Graham Johnson, 
and R. Dyche Mullins, UCSF. 


18.4 Transmission Electron 
Microscopy 


The electron micrographs shown in this text were taken by two dif- 
ferent types of electron microscopes. Transmission electron micro- 
scopes (TEMs) form images using electrons that are transmitted 
through a specimen, whereas scanning electron microscopes (SEMs) 
utilize electrons that have bounced off the surface of the specimen. 
All of the comments in this section of the chapter concern the use of 
the TEM; the SEM is discussed separately in Section 18.6. 

The transmission electron microscope can provide vastly greater 
resolution than the light microscope. This is readily illustrated by 
comparing the two photos in FIGURE 18.14, which show images of 
adjacent sections of the same tissue at the same magnification using 
alight or an electron microscope. Although the photo in Figure 18.14a 
is near the limit of resolution of the light microscope, the photo in 
Figure 18.14b is a very low-power electron micrograph. The great 
resolving power of the electron microscope derives from the wave 
properties of electrons. As indicated in the equation in Section 18.1, 
the limit of resolution of a microscope is directly proportional to the 
wavelength of the illuminating light: the longer the wavelength, 
the poorer the resolution. Unlike a photon of light, which has a 
constant wavelength, the wavelength of an electron varies with the 
speed at which the particle is traveling, which in turn depends on the 
accelerating voltage applied in the microscope. This relationship is 
defined by the equation 


A=V150/V 


where A is the wavelength in angstroms and V is the accelerating 
voltage in volts. Standard TEMs operate with a voltage range from 
10,000 to 100,000 V. At 60,000 V, the wavelength of an electron is 
approximately 0.05 A. If this wavelength and the numerical aperture 
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(a) 


FIGURE 18.14 A comparison between the information contained in 
images taken by a light and electron microscope at a comparable 
magnification of 4500 times actual size. (a) A photo of skeletal muscle 
tissue that had been embedded in plastic, sectioned at 1 1m, and 
photographed with a light microscope under an oil immersion objective 
lens. (b) An adjacent section to that used for part a that was cut at 0.025 um 
and examined under the electron microscope at comparable magnification 
to that in a. The resulting image displays a 100- to 200-fold increase in 
resolution. Note the difference in the details of the muscle myofibrils, 
mitochondria, and the capillary containing a red blood cell. Whereas the 
light microscope cannot provide any additional information to that of a, the 
electron microscope can provide much more information, producing 
images, for example, of the structure of the individual membranes within a 
small portion of one of the mitochondria (as in Figure 5.22). 


Source: (a) and (6) Courtesy of Douglas E. Kelly and M. A. Cahill. 


attainable with the light microscope were substituted into the equa- 
tion on page 701, the limit of resolution would be about 0.03 A. In 
actual fact, the resolution attainable with a standard transmission 
electron microscope is about two orders of magnitude less than its 
theoretical limit. This is due to serious spherical aberration of elec- 
tron-focusing lenses, which requires that the numerical aperture of 
the lens be made very small (generally between 0.01 and 0.001). The 
practical limit of resolution of standard TEMs is in the range of 3 to 
5 A. The actual limit when observing cellular structure is more typically 
in the range of 10 to 15 A. 

Electron microscopes consist largely of a tall, hollow cylindrical 
column through which the electron beam passes and a console con- 
taining a panel of dials that electronically control the operation in 
the column. The top of the column contains the cathode, a tungsten 
wire filament that is heated to provide a source of electrons. Electrons 
are drawn from the hot filament and accelerated as a fine beam by 
the high voltage applied between the cathode and anode. Air is 
pumped out of the column prior to operation, producing a vacuum 
through which the electrons travel. If the air were not removed, 


electrons would be prematurely scattered by collision with gas 
molecules. 

Just as a beam of light rays can be focused by a ground glass lens, 
a beam of negatively charged electrons can be focused by electro- 
magnetic lenses, which are located in the wall of the column. The 
strength of the magnets is controlled by the current provided them, 
which is determined by the positions of the various dials of the con- 
sole. A comparison of the lens systems of a light and electron micro- 
scope is shown in FIGURE 18.15. The condenser lenses of an electron 
microscope are placed between the electron source and the speci- 
men, and they focus the electron beam on the specimen. The speci- 
men is supported on a small, thin metal grid (3 mm diameter) that is 
inserted with tweezers into a grid holder, which in turn is inserted 
into the column of the microscope. 

Because the focal lengths of the lenses of an electron micro- 
scope vary depending on the current supplied to them, one objective 
lens provides the entire range of magnification delivered by the 
instrument. As in the light microscope, the image from the objective 
lens serves as the object for an additional lens system. The image 
provided by the objective lens of an electron microscope is only 
magnified about 100 times, but unlike the light microscope, there is 
sufficient detail present in this image to magnify it an additional 
10,000 times. By altering the current applied to the various lenses of 
the microscope, magnifications can vary from about 1000 times to 
250,000 times. Electrons that have passed through the specimen are 
brought to focus on a phosphorescent screen situated at the bottom 
of the column. Electrons striking the screen excite a coating of fluo- 
rescent crystals, which emit their own visible light that is perceived 
by the eye as an image of the specimen. 
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FIGURE 18.15 A comparison of the lens systems of a light and 
electron microscope. 

Source: Reprinted from Alan W. Agar, Principles and Practice of Electron 
Microscope Operation, Elsevier/North-Holland, 1974. Used with permission 
from Elsevier Ltd. 


Image formation in the electron microscope depends on differ- 
ential scattering of electrons by parts of the specimen. Consider a 
beam of electrons emitted by the filament and focused on the screen. 
If no specimen were present in the column, the screen would be 
evenly illuminated by the beam of electrons, producing an image 
that is uniformly bright. By contrast, if a specimen is placed in the 
path of the beam, some of the electrons strike atoms in the specimen 
and are scattered away. Electrons that bounce off the specimen can- 
not pass through the very small aperture at the back focal plane of 
the objective lens and are, therefore, lost as participants in the for- 
mation of the image. 

The scattering of electrons by a part of the specimen is propor- 
tional to the size of the nuclei of the atoms that make up the specimen. 
Because the insoluble material of cells consists of atoms of relatively 
low atomic number—carbon, oxygen, nitrogen, and hydrogen— 
biological material possesses very little intrinsic capability to scatter 
electrons. To increase electron scattering and obtain required con- 
trast, tissues are fixed and stained with solutions of heavy metals 
(described below). These metals penetrate into the structure of the 
cells and become selectively complexed with different parts of the 
organelles. Those parts of a cell that bind the greatest number of 
metal atoms allow passage of the least number of electrons. The fewer 
electrons that are focused on the screen at a given spot, the darker 
that spot. Photographs of the image are made by lifting the viewing 
screen out of the way and allowing the electrons to strike a photo- 
graphic plate in position beneath the screen. Because photographic 
emulsions are directly sensitive to electrons, much as they are to 
light, an image of the specimen can be recorded directly on film. In 
recent years, film has largely been replaced with digital imaging 
techniques. Most modern TEMs make use of indirect electron detec- 
tion, where electrons are converted to photons which can then be 
detected by a CCD sensor, creating a digital image. 


18.5 Specimen Preparation 
for Electron Microscopy 


As with the light microscope, tissues to be examined in the electron 
microscope must be fixed, embedded, and sectioned. Fixation of tis- 
sue for electron microscopy (FIGURE 18.16) is much more critical 
than for light microscopy because the sections are subjected to much 
greater scrutiny. A fixative must stop the life of the cell without sig- 
nificantly altering the structure of that cell. At the level of resolution 
of the electron microscope, relatively minor damage, such as swollen 
mitochondria or ruptured endoplasmic reticulum, becomes very 
apparent. To obtain the most rapid fixation and the least cellular 
damage, very small pieces of tissue (less than 1.0 mm’) are fixed and 
embedded. Fixatives are chemicals that denature and precipitate cel- 
lular macromolecules. Chemicals having such action may cause the 
coagulation or precipitation of materials that had no structure in the 
living cell, leading to the formation of an artifact. The best argu- 
ment that a particular structure is not an artifact is the demonstra- 
tion of its existence in cells fixed in a variety of different ways or, 
even better, not fixed at all. To view cells that have not been fixed, the 
tissue is rapidly frozen, and special techniques are utilized to reveal 
its structure (see cryofixation and freeze-fracture replication, 
described later). The most common fixatives for electron micros- 
copy are glutaraldehyde and osmium tetroxide. Glutaraldehyde is 
a 5-carbon compound with an aldehyde group at each end of the 


molecule. The aldehyde groups react with amino groups in proteins 
and cross-link the proteins into an insoluble network. Osmium is a 
heavy metal that reacts primarily with fatty acids leading to the pres- 
ervation of cellular membranes. 

Once the tissue has been fixed, the water is removed by dehy- 
dration in alcohol, and the tissue spaces are filled with a material that 
supports tissue sectioning. The demands of electron microscopy 
require the sections to be very thin. The wax sections cut for light 
microscopy are rarely thinner than about 5 um, whereas sections for 
conventional electron microscopy are best when cut at less than 
0.1 um (equivalent in thickness to about four ribosomes). 

Tissues to be sectioned for electron microscopy are usually 
embedded in epoxy resins. Sections are cut by slowly bringing the 
plastic block down across an extremely sharp cutting edge 
(Figure 18.16) made of cut glass or a finely polished diamond face. 
The sections coming off the knife edge float onto the surface of a 
trough of water that is contained just behind the knife edge. The sec- 
tions are then picked up with the metal specimen grid and dried 
onto its surface. The tissue is stained by floating the grids on drops 
of heavy metal solutions, primarily uranyl acetate and lead citrate. 
These heavy metal atoms bind to macromolecules and provide the 
atomic density required to scatter the electron beam. In addition to 
the standard stains, tissue sections can be treated with metal-tagged 
antibodies or other materials that react with specific molecules in 
the tissue section. Studies with antibodies are usually carried out on 
tissues embedded in acrylic resins, which are more permeable to 
large molecules than epoxy resins. 


Cryofixation and the Use 
of Frozen Specimens 


Cells and tissues do not have to be fixed with chemicals and embed- 
ded in plastic resins in order to be observed with the electron micro- 
scope. Alternatively, cells and tissues can be rapidly frozen. Just as a 
chemical fixative stops metabolic processes and preserves biological 
structure, so too does rapid freezing, which is called cryofixation. 
Because cryofixation accomplishes these goals without altering the 
cell’s macromolecules, it is less likely to lead to the formation of arti- 
facts. The major difficulty with cryofixation is the formation of ice 
crystals, which grow outward from sites where nucleation occurs. As 
an ice crystal grows, it destroys the fragile contents of the cell in 
which it develops. The best way to avoid ice crystal formation is to 
freeze the specimen so rapidly that crystals don't have time to grow. 
It is as if the water is frozen yet remains in its liquid state. Water in 
this state is said to be “vitrified” There are several techniques 
employed to achieve such ultrarapid freezing rates. Smaller speci- 
mens are typically plunged into a liquid of very low temperature 
(such as liquid propane, boiling point of -42°C). Larger specimens 
are best treated by high-pressure freezing. In this technique, the 
specimen is subjected to high hydrostatic pressure and sprayed with 
jets of liquid nitrogen. High pressure lowers the freezing point of 
water, reducing the rate of ice crystal growth. 

You might not think that a frozen block of tissue would be of 
much use to a microscopist, but a surprising number of approaches 
can be taken to visualize frozen cellular structure in the light or elec- 
tron microscope. For example, after suitable preparation, a frozen 
block of tissue can be sectioned with a special microtome in a 
manner similar to that of a paraffin or plastic block of tissue. Frozen 
sections (cryosections) can be prepared for examination under 
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the bottom edge of the block 


Drop of heavy 
metal stain 


Sections are stained 


FIGURE 18.16 Preparation of a specimen for observation in the electron microscope. 


either a light or an electron microscope. Frozen sections are particu- 
larly useful for studies on enzymes, whose activities tend to be dena- 
tured by chemical fixatives. Because frozen sections can be prepared 
much more rapidly than paraffin or plastic sections, they are often 
employed by pathologists to examine the light microscopic structure 
of tissues removed during surgery. As a result, the determination as 
to whether a tumor is malignant can be made while the patient is still 
on the operating table. 

Frozen cells do not have to be sectioned to reveal internal struc- 
ture. Figure 1.11 shows an image of the thin, peripheral region of an 
intact cell that had been crawling over the surface of an electron 
microscope grid an instant before it was rapidly frozen. Unlike the 
standard electron micrograph, the image in Figure 1.11 has a three 
dimensionality—like the specimen itself—because it was generated 
by a computer rather than directly by a camera. To obtain the image, 


the computer aligns a large number of two-dimensional digital 
images of the cell that are captured as the specimen is tilted at defined 
angles relative to the axis of the electron beam. The three-dimen- 
sional, computational reconstruction is called a tomogram, and the 
technique is called cryoelectron tomography (cryo-ET).' Cryo-ET, 
which was developed by Wolfgang Baumeister of the Max-Planck 
Institute in Germany, has revolutionized the way in which nanosized. 
intracellular structures can be studied in unfixed, fully hydrated, 
flash-frozen cells. Cryo-ET can also be utilized to examine the 


'This technique is similar in principle to computerized axial tomography (CAT 
scans), which uses a multitude of X-ray images taken at different angles to the body to 
generate a three-dimensional image. Fortunately, the machinery used in radiological 
tomography allows the X-ray source and detector to rotate while the patient can 
remain stationary. 


three-dimensional organization of structures present in vitro, as 
exemplified by the reconstruction of an isolated polysome in the act 
of translation that is shown in Figure 11.50b. Cryo-ET, which can 
deliver a resolution of a few nms, provides an important bridge 
between the cellular and molecular worlds. Two other approaches 
that require the electron microscopic analysis of frozen specimens— 
freeze fracture replication and single particle analysis—are discussed 
later in this section and in Section 18.16. 


Negative Staining 


The electron microscope is also well suited for examining very small 
particulate materials, such as viruses, ribosomes, multisubunit 
enzymes, cytoskeletal elements, and protein complexes. The shapes 
of individual proteins and nucleic acids can also be resolved as long 
as they are made to have sufficient contrast from their surroundings. 
One of the best ways to make such substances visible is to employ 
negative staining procedures in which heavy-metal deposits are 
collected everywhere on a specimen grid except where the particles 
are present. As a result, the specimen stands out by its relative bright- 
ness on the viewing screen. Examples of negatively stained speci- 
mens are shown in Figures 2.41a and 18.17a. 


(a) 


(b) 


FIGURE 18.17 Examples of negatively stained and metal-shadowed 
specimens. Electron micrographs of a tobacco rattle virus after negative 
staining with potassium phosphotungstate (a) or shadow casting with 
chromium (b). 


Source: (a-b) Courtesy of M. K. Corbett. 


Shadow Casting 


Another technique to visualize isolated particles is to have the objects 
cast shadows. The technique is described in FIGURE 18.18. The grids 
containing the specimens are placed in a sealed chamber, which is 
then evacuated by vacuum pump. The chamber contains a filament 
composed of a heavy metal (usually platinum) together with carbon. 
The filament is heated to high temperature, causing it to evaporate 
and deposit a metallic coat over accessible surfaces within the cham- 
ber. As a result, the metal is deposited on those surfaces facing the 
filament, while the opposite surfaces of the particles and the grid 
space in their shadow remain uncoated. When the grid is viewed in 
the electron microscope, the areas in shadow appear bright on the 
viewing screen, whereas the metal-coated regions appear dark. This 
relationship is reversed on the photographic plate, which is a negative 
of the image. The convention for illustrating shadowed specimens is 
to print a negative image in which the particle appears illuminated by 
a bright, white light (corresponding to the coated surface) with a dark 
shadow cast by the particle (Figure 18.17b). The technique provides 
excellent contrast and produces a three-dimensional effect. 


Freeze-Fracture Replication 
and Freeze Etching 


As noted above, a number of electron microscopic techniques have 
been adapted to work with frozen tissues. The ultrastructure of frozen 
cells is often viewed using the technique of freeze-fracture replica- 
tion, which is illustrated in FIGURE 18.19 and demonstrated in the 
Experimental Walkthrough for Chapter 7. Small pieces of tissue are 
placed on a small metal disk and rapidly frozen. The disk is then 
mounted on a cooled stage within a vacuum chamber, and the frozen 
tissue block is struck by a knife edge. The resulting fracture plane 
spreads out from the point of contact, splitting the tissue into two pieces, 
not unlike the way that an axe blade splits a piece of wood in two. 
Consider what might happen as a fracture plane spreads through 
a cell containing a variety of organelles of different composition. 


Evaporation 
of metal from 
platinum wire 


Evacuated 
chamber 


FIGURE 18.18 The procedure used for shadow casting as a means to 
provide contrast in the electron microscope. This procedure is often used 
to visualize small particles, such as the viruses shown in the previous figure. 
DNA and RNA molecules are often made visible by a modification of this 
procedure known as rotary shadowing in which the metal is evaporated at a 
very low angle while the specimen is rotated. 
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These structures tend to cause deviations in the fracture plane, either 
upward or downward, giving the fracture face elevations, depressions, 
and ridges that reflect the contours of the protoplasm traversed. 
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FIGURE 18.19 Procedure for the formation of freeze-fracture replicas 
as described in the text. Freeze etching is an optional step in which a thin 
layer of covering ice is evaporated to reveal additional information about 
the structure of the fractured specimen. 


Consequently, the surfaces exposed by the fracture contain informa- 
tion about the contents of the cell. The goal is to make this informa- 
tion visible. The replication process accomplishes this by using the 
fractured surface as a template on which a heavy-metal layer is 
deposited. The heavy metal is deposited onto the newly exposed sur- 
face of the frozen tissue in the same chamber where the tissue was 
fractured. The metal is deposited at an angle to provide shadows that 
accentuate local topography (FIGURE 18.20), as described in the pre- 
vious section on shadow casting. 

A carbon layer is then deposited on top of the metal layer to 
cement the patches of metal into a solid surface. Once this cast has 
been made, the tissue that provided the template can be thawed, 
removed, and discarded; it is the metal-carbon replica that is 
placed on the specimen grid and viewed in the electron microscope. 
Variations in thickness of the metal in different parts of the replica 
cause variations in the numbers of penetrating electrons to reach 
the viewing screen, producing the necessary contrast in the image. 
As discussed in Chapter 4, fracture planes take the path of least 
resistance through the frozen block, which often carries them 
through the center of cellular membranes. As a result, this tech- 
nique is particularly well suited for examining the distribution of 
integral membrane proteins as they span the lipid bilayer (as in 
Figures 4.15, 7.27, 7.28, and 7.29). Such studies carried out by 
Daniel Branton and others played an important role in the formula- 
tion of the fluid mosaic structure of cellular membranes in the early 
1970s (page 117). 

Freeze-fracture replication by itself is an extremely valuable 
technique, but it can be made even more informative by including a 
step called freeze etching (Figure 18.19). In this step, the frozen, 
fractured specimen, while still in place within the cold chamber, is 
exposed to a vacuum at an elevated temperature for one to a few 
minutes, during which a layer of ice can evaporate (sublime) from 
the exposed surface. Once some of the ice has been removed, the 


FIGURE 18.20 Replica of a freeze-fractured onion root cell showing 
the nuclear envelope (N.E.) with its pores (N.P.), the Golgi complex (G), a 
cytoplasmic vacuole (V), and the cell wall (C.W.). 


Source: Courtesy of Daniel Branton, Harvard University. 


surface of the structure can be coated with heavy metal and carbon 
to create a metallic replica that reveals both the external surface and 
internal structure of cellular membranes. The development by John 
Heuser of deep-etching techniques, in which greater amounts of 
surface ice are removed, led to a fascinating look at cellular orga- 
nelles. Examples of specimens prepared by this technique are shown 
in FIGURES 18.21, 8.38, and 9.38, where it can be seen that the indi- 
vidual parts of the cell stand out in deep relief against the back- 
ground. The technique delivers very high resolution and can be used 
to reveal the structure and the distribution of macromolecular com- 
plexes, such as those of the cytoskeleton, as they are presumed to 
exist within the living cell. 


18.6 Scanning Electron 
Microscopy 


The TEM has been exploited most widely in the examination of the 
internal structure of cells. In contrast, the scanning electron micro- 
scope (SEM) is utilized primarily to examine the surfaces of objects 
ranging in size from a virus to an animal head (FIGURE 18.22). The 
construction and operation of the SEM are very different from that 
of the TEM. The goal of specimen preparation for the SEM is to 
produce an object that has the same shape and surface properties as 
the living state, but is devoid of fluid, as required for observing the 
specimen under vacuum. Because water constitutes such a high per- 
centage of the weight of living cells and is present in association 
with virtually every macromolecule, its removal can have a very 
destructive effect on cell structure. When cells are simply air dried, 


(b) 


FIGURE 18.21 Deep etching. Electron micrograph of ciliary axonemes from 
the protozoan Tetrahymena. The axonemes were fixed, frozen, and fractured, 
and the frozen water at the surface of the fractured block was evaporated 
away, leaving a portion of the axonemes standing out in relief, as visualized in 
this metal replica. The arrow indicates a distinct row of outer dynein arms. 
Source: From Ursula W. Goodenough and John E. Heuser, J. Cell Biol. 
95:800, 1982, Fig. 3, Right Photo. Reproduced with permission of the 
Rockefeller University Press. 


FIGURE 18.22 Scanning electron microscopy. Scanning electron micrographs 
of (a) a T4 bacteriophage (x275,000) and (b) the head of an insect (x40). 

Source: (a) From A. N. Broers, B. J. Panessa, and J. F. Gennaro, Science 189:635, 
1975; © 1975. Reprinted with permission from AAAS; (b) courtesy of H. F. Howden 
and L. E. C. Ling. 
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faces. Specimens to be examined in the SEM are fixed, passed 
through a series of alcohols, and then dried by a process of critical- 
point drying. Critical-point drying takes advantage of the fact that a 
critical temperature and pressure exist for each solvent at which the 
density of the vapor is equal to the density of the liquid. At this point, 
there is no surface tension between the gas and the liquid. The sol- 
vent of the cells is replaced with a liquid transitional fluid (generally 
carbon dioxide), which is vaporized under pressure so that the cells 
are not exposed to any surface tension that might distort their three- 
dimensional configuration. 

Once the specimen is dried, it is coated with a thin layer of 
metal, which makes it suitable as a target for an electron beam. In the 
TEM, the electron beam is focused by the condenser lenses to simul- 
taneously illuminate the entire viewing field. In the SEM, electrons 
are accelerated as a fine beam that scans the specimen. In the TEM, 
electrons pass through the specimen to form the image. In the SEM, 
the image is formed by electrons that are reflected back from the 
specimen (backscattered) or by secondary electrons given off by 
the specimen after being struck by the primary electron beam. These 
electrons strike a detector that is located near the surface of the 
specimen. 

Image formation in the SEM is indirect. In addition to the beam 
that scans the surface of the specimen, another electron beam syn- 
chronously scans the face of a cathode-ray tube, producing an image 
similar to that seen on a television screen. The electrons that bounce 
off the specimen and reach the detector control the strength of 
the beam in the cathode-ray tube. The more electrons collected 
from the specimen at a given spot, the stronger the signal to the tube 
and the greater the intensity of the beam on the screen at the corre- 
sponding spot. The result is an image on the screen that reflects the 
surface topology of the specimen because it is this topology (the 
crevices, hills, and pits) that determines the number of electrons col- 
lected from the various parts of the surface. 

As evident in the micrographs of Figure 18.22, an SEM can 
provide a great range of magnification (from about 15 to 150,000 
times for a standard instrument). Resolving power of an SEM is 
related to the diameter of the electron beam. Newer models are 
capable of delivering resolutions of less than 5nm, which can be 
used to localize gold-labeled antibodies bound to a cell’s surface. 
The SEM also provides remarkable depth of focus, approximately 
500 times that of the light microscope at a corresponding magnifi- 
cation. This property gives SEM images their three-dimensional 
quality. At the cellular level, the SEM allows the observer to appre- 
ciate the structure of the outer cell surface and all of the various 
processes, extensions, and extracellular materials that interact with 
the environment. 


18.7 Atomic Force Microscopy 


Although it is not an electron microscope, the atomic force micro- 
scope (AFM) is a high-resolution scanning instrument that is 
becoming increasingly important in nanotechnology and molecular 
biology. Several AFM-derived images can be found in the text, 
including Figures 4.24a, 5.25, 7.29c, and 9.44. The AFM operates by 
scanning a sharp, microsized tip (probe) over the surface of the spec- 
imen. In one type of AFM, the probe is attached to a tiny oscillating 


beam (or cantilever), whose frequency of oscillations changes as the 
tip encounters variations in the topography of the specimen. These 
changes in oscillation of the beam can be converted into a three- 
dimensional topographic image of the surface of the specimen. 
Unlike other techniques for the determination of molecular struc- 
ture, such as X-ray crystallography and cryo-EM, which average the 
structure of many individual molecules, AFM provides an image of 
each individual molecule as it is oriented in the field (see Figure 5.25). 
Another limitation of electron microscopic and X-ray crystallo- 
graphic technologies is that they are only able to provide static 
images or “snapshots.” The development of high-speed AFM (HS- 
AFM), which is shown schematically in FIGURE 18.23, now allows 
researchers to obtain rapid sequential images of a macromolecule so 
that its activity can be followed over real time. This technique is ele- 
gantly illustrated in the micrographs of Figure 9.44, which depict the 
steps taken by a single myosin V molecule as it moves along an actin 
filament. The opportunity to watch a “movie” of an individual pro- 
tein as it carries on its normal activities is a remarkable achievement 
in a field that has been studying the structure of these molecules for 
more than half a century. 

The probe of an AFM can be used as more than a monitoring 
device; it can also be employed as a “nanomanipulator” to push or 
pull on the specimen in an attempt to measure various mechanical 
properties. This capability was used to show that a single intermedi- 
ate filament can be stretched to several times its normal length. In 
a different protocol, the AFM tip can be coated with ligands for a 


AFM position 
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FIGURE 18.23 High-speed atomic force microscopy. The basic 
elements of an HS-AFM are shown in this schematic drawing. The sample 
is mounted on a component (the piezo actuator) that is attached to the 
microscope stage (not shown). Signals from the actuator cause the 
cantilever to oscillate up and down so that the AFM tip intermittently 
contacts the sample as it scans over the sample’s surface. Forces that 
develop between the sample and AFM tip cause deflections of the tip, 
which are detected by a laser beam that is reflected off of the back of the 
AFM tip. Movements in the position of the laser beam, which reflect 
topographical differences along the sample surface, are recognized by a 
position detector and this information is used to construct an image of 
the specimen. The AFM-based movie of the movement of a myosin V 
molecule shown in Figure 9.44. was “filmed” at approximately 7 frames 
per second. 

Source: C. Veigel and C. F. Schmidt, Nature Revs. Mol. Cell Biol. 12:166, 
2011; copyright 2011. Nature Reviews Molecular Cell Biology by 

Nature Publishing Group. Reproduced with permission of Nature 
Publishing Group in the format reuse in a book/textbook via Copyright 
Clearance Center. 


particular receptor, and measurements can be made of the affinity 
of that receptor for the ligand in question. 


18.8 The Use of Radioisotopes 


A tracer is a substance that reveals its presence in one way or another and 
thus can be localized or monitored during the course of an experi- 
ment. Depending on the substance and the type of experiment, a 
tracer might be fluorescently labeled, spin labeled, density labeled, 
or radioactively labeled. In each case, a labeled group enables a mol- 
ecule to be detected without affecting the specificity of its interac- 
tions. Radioactive molecules, for example, participate in the same 
reactions as nonradioactive species, but their location can be fol- 
lowed and the amount present can be measured. 

The identity of an atom (whether it be an iron atom, a chlorine 
atom, or some other type), and thus its chemical properties, is deter- 
mined by the number of positively charged protons in its nucleus. 
All hydrogen atoms have a single proton, all helium atoms have two 
protons, all lithium atoms have three protons, and so forth. Not all 
hydrogen, helium, or lithium atoms, however, have the same num- 
ber of neutrons. Atoms having the same number of protons and dif- 
ferent numbers of neutrons are said to be isotopes of one another. 
Even hydrogen, the simplest element, can exist as three different iso- 
topes, depending on whether the atom has 0, 1, or 2 neutrons in its 
nucleus. Of these three isotopes of hydrogen, only the one contain- 
ing two neutrons is radioactive; it is tritium CH). 

Isotopes are radioactive when they contain an unstable combi- 
nation of protons and neutrons. Atoms that are unstable tend to break 
apart, or disintegrate, thus achieving a more stable configuration. 
When an atom disintegrates, it releases particles or electromagnetic 
radiation that can be monitored by appropriate instruments. 
Radioactive isotopes occur throughout the periodic table of ele- 
ments, and they can be produced from nonradioactive elements in 
the laboratory. Many biological molecules can be purchased in a 
radioactive state, that is, containing one or more radioactive atoms 
as part of their structure. 

The half-life (t,,.) of a radioisotope is a measure of its instabil- 
ity. The more unstable a particular isotope, the greater the likelihood 
that a given atom will disintegrate in a given amount of time. If one 
starts with one Curie? of tritium, half that amount of radioactive 
material will be left after approximately 12 years (which is the half- 
life of this radioisotope). In the early years of research into photosyn- 
thesis and other metabolic pathways, the only available radioisotope 
of carbon was ''C, which has a half-life of approximately 20 min- 
utes. Experiments with ''C were literally carried out on the run so 
that the amount of incorporated isotope could be measured before 
the substance disappeared. The availability in the 1950s of “C, 
a radioisotope having a half-life of 5700 years, was greeted with great 
celebration. The radioisotopes of greatest importance in cell biologi- 
cal research are listed in Table 18.1, along with information on their 
half-lives and the nature of their radiation. 

Three main forms of radiation can be released by atoms during 
their disintegration. The atom may release an alpha particle, which 
consists of two protons and two neutrons and is equivalent to the 


2A Curie is that amount of radioactivity required to yield 3.7 x10" disintegrations 
per second. 


TABLE 18.1 Properties of a Variety of Radioisotopes 
Used in Biological Research 


Type of particle(s) 
emitted 


Symbol and 
atomic weight 


Half-life 


3H 12.3 yr 
20 min 
5700 yr 
15.1 hr 
14.3 d 
87.1d 
12.4 hr 
152d 
45d 
5.3 yr 
12.8 hr 
250d 
8.0d 


Beta 
Beta 
Beta 
Beta, Gamma 
Beta 
Beta 
Beta, Gamma 
Beta 
Beta, Gamma 
Beta, Gamma 
Beta, Gamma 
Beta, Gamma 
Beta, Gamma 


nucleus of a helium atom; a beta particle, which is equivalent to an 
electron; and/or gamma radiation, which consists of electromagnetic 
radiation or photons. The most commonly used isotopes are beta 
emitters, which are monitored by either of two different methodolo- 
gies: liquid scintillation spectrometry or autoradiography. 

Liquid scintillation spectrometry is used to measure the amount 
of radioactivity in a given sample. The technique is based on the 
property of certain molecules, termed phosphors or scintillants, to 
absorb some of the energy of an emitted particle and release that 
energy in the form of light. In preparing a sample for liquid scintilla- 
tion counting, one mixes the sample with a solution of the phosphor 
in a glass or plastic scintillation vial. This brings the phosphor and 
radioactive isotope into very close contact so that radiation from 
even the weakest beta emitters can be efficiently measured. Once 
mixed, the vial is placed into the counting instrument, where it is 
lowered into a well whose walls contain an extremely sensitive pho- 
todetector. As radioactive atoms within the vial disintegrate, the 
emitted particles activate the scintillants, which emit flashes of light. 
The light is detected by a photocell, and the signal is amplified by a 
photomultiplier tube within the counter. After correcting for back- 
ground noise, the amount of radioactivity present in each vial is dis- 
played on a printout. 

Autoradiography is a broad-based technique used to deter- 
mine where a particular isotope is located, whether in a cell, in a 
polyacrylamide gel, or on a nitrocellulose filter. The importance of 
autoradiography in early discoveries on the synthetic activities of 
cells was described in the pulse-chase experiments on page 260. 
Autoradiography takes advantage of the ability of a particle emitted 
from a radioactive atom to activate a photographic emulsion, much 
like light or X-rays activate the emulsion that coats a piece of film. If 
the photographic emulsion is brought into close contact with a radi- 
oactive source, the particles emitted by the source leave tiny, black 
silver grains in the emulsion after photographic development. 
Autoradiography is used to localize radioisotopes within tissue sec- 
tions that have been immobilized on a slide or TEM grid. The steps 
involved in the preparation of a light microscopic autoradiograph 
are as follows. The emulsion is applied to the sections on the slide or 
grid as a very thin overlying layer, and the specimen is put into a 
lightproof container to allow the emulsion to be exposed by the 
emissions. The longer the specimen is left before development, the 
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oped slide or grid is examined in the microscope, the location of the 
silver grains in the layer of emulsion just above the tissue indicates 
the location of the radioactivity in the underlying cells. An electron 
microscopic micrograph using this technique is shown in Figure 8.3a. 


18.9 Cell Culture 


Throughout this book we have stressed the approach in cell biol- 
ogy that attempts to understand particular processes by their anal- 
ysis in a simplified, controlled, in vitro system. The same approach 
can be applied to the study of cells themselves because they too 
can be removed from the influences they are normally subject to 
within a complex multicellular organism. Learning to grow cells 
outside the organism, that is, in cell culture, has proved to be one 
of the most valuable technical achievements in the entire study of 
biology. A quick glance through any journal in cell biology reveals 
that the majority of the articles describe research carried out on 
cultured cells. The reasons for this are numerous: Cultured cells 
can be obtained in large quantity; most cultures contain only a 
single type of cell; many different cellular activities, including 
endocytosis, cell movement, cell division, membrane trafficking, 
and macromolecular synthesis, can be studied in cell culture; cells 
can differentiate in culture; and cultured cells respond to treat- 
ment with drugs, hormones, growth factors, and other active 
substances. 

The early tissue culturists employed media containing a great 
variety of unknown substances. Cell growth was accomplished by 
adding fluids obtained from living systems, such as lymph, blood 
serum, or embryo homogenates. It was found that cells required a 
considerable variety of nutrients, hormones, growth factors, and 
cofactors to remain healthy and proliferate. Even today, most culture 
media contain large amounts of serum. The importance of serum (or 
the growth factors it contains) on the proliferation of cultured cells is 
shown in the cell-growth curves in Figure 16.4. 

One of the primary goals of cell culturists has been to develop 
defined, serum-free media that supports the growth of cells. Using a 
pragmatic approach in which combinations of various ingredients 
are tested for their ability to support cell growth and proliferation, 
a growing number of cell types have been successfully cultured in 
“artificial” media that lack serum or other natural fluids. As would 
be expected, the composition of these chemically defined media is 
relatively complex; they consist of a mixture of nutrients and vita- 
mins, together with a variety of purified proteins, including insulin, 
epidermal growth factor, and transferrin (which provides the cells 
with iron). 

Because they are so rich in nutrients, tissue culture media are a 
very inviting habitat for the growth of microorganisms. To prevent 
bacteria from contaminating cell cultures, tissue culturists must go 
to great lengths to maintain sterile conditions within their working 
space. This is accomplished by using sterile gloves, sterilizing all 
supplies and instruments, employing low levels of antibiotics in the 
media, and conducting activities within a sterile hood. 

The first step in cell culture is to obtain the cells. In most cases, 
one need only remove a vial of frozen, previously cultured cells from 
a tank of liquid nitrogen, thaw the vial, and transfer the cells to the 
waiting medium. A culture of this type is referred to as a secondary 
culture because the cells are derived from a previous culture. In a 


primary culture, on the other hand, the cells are obtained directly 
from the organism. Most primary cultures of animal cells are 
obtained from embryos, whose tissues are more readily dissociated 
into single cells than those of adults. Dissociation is accomplished 
with the aid of a proteolytic enzyme, such as trypsin, which digests 
the extracellular domains of proteins that mediate cell adhesion 
(Chapter 7). The tissue is then washed free of the enzyme and usu- 
ally suspended in a saline solution that lacks Ca’* ions and contains 
a substance, such as ethylenediamine tetraacetate (EDTA), that 
binds (chelates) calcium ions. As discussed in Chapter 7, calcium 
ions play a key role in cell-cell adhesion, and their removal from tis- 
sues greatly facilitates the separation of cells. 

Normal (nonmalignant) cells can divide a limited number of 
times (typically 50 to 100) before they undergo senescence and death 
(page 475). Because of this, many of the cells that are commonly 
used in tissue culture studies have undergone genetic modifications 
that allow them to be grown indefinitely. Cells of this type are 
referred to as a cell line, and they typically grow into malignant 
tumors when injected into susceptible laboratory animals. The fre- 
quency with which a normal cell growing in culture spontaneously 
transforms into a cell line depends on the organism from which it 
was derived. Mouse cells, for example, transform at a relatively high 
frequency; human cells transform only rarely, if ever. Human cell 
lines (e.g., HeLa cells, discussed in the opener of Chapter 16) are 
typically derived from human tumors or from cells treated with 
cancer-causing viruses or chemicals. 

A number of laboratories are moving away from traditional two- 
dimensional culture systems, where cells are grown on the flat sur- 
face of a culture dish, to three-dimensional cultures, in which cells are 
grown in a three-dimensional matrix consisting of synthetic and/or 
natural extracellular materials. These materials can be purchased as 
products containing proteins and other components derived from 
natural basement membranes (Figures 7.4 and 7.8). Cells within the 
body do not live on flat, hard-plastic surfaces, and, consequently, it is 
thought that three-dimensional matrices provide a much more natu- 
ral environment for cultured cells. Consequently, the morphology 
and behavior of cells in these three-dimensional environments are 
more like those of cells observed within the tissues of the body. This 
is reflected in differences in the cytoskeletal organization, types of 
cell adhesions, signaling activities, and states of differentiation of 
cells in the two types of culture systems. In addition, 3D culture 
systems are also better suited to study cell-cell interactions because 
they allow cells to come into contact with one another at any point 
along their surface. FIGURE 18.24 shows images of human fibroblasts 
grown on either a flat surface or within a three-dimensional 
matrix. The cell in Figure 18.24a has pressed itself against the 
substratum, creating a highly unnatural upper (dorsal) and lower 
(ventral) surface with unusually broad and flattened lamellipodia. In 
contrast, the same type of cell grown in 3D culture (Figure 18.24b) 
has a typical spindle-shaped structure without any obvious surface 
differentiation. 

Many different types of plant cells can also grow in culture. In 
one approach, plant cells are treated with the enzyme cellulase, 
which digests away the surrounding cell wall, releasing the naked 
cell, or protoplast. Protoplasts can then be grown in a chemically 
defined medium that promotes their growth and division. Under 
suitable conditions, the cells can grow into an undifferentiated 
clump of cells, called a callus, which can be induced to develop 
shoots from which a plant can regenerate. In an alternate approach, 
the cells in leaf tissue can be induced by hormone treatment to lose 


(a) 


(b) 


FIGURE 18.24 A comparison of the morphology of cells growing in 2D versus 3D cultures, (a) A human fibroblast growing on a flat, fibronectin-coated 
substratum in 2D culture assumes a highly flattened shape with broad lamellipodia. Integrin-containing adhesions appear white. (b) The same type of cell 
growing in a 3D matrix assumes a nonflattened, spindle-shaped morphology. The extracellular fibronectin matrix appears in blue. Bar equals 10 um. 

Source: (a—b) From Edna Cukierman, Cell 130:603, 2007, Fig. 1, reprinted with permission from Elsevier. Courtesy of Kenneth M. Yamada, National Institute 


of Dental and Craniofacial Research, NIH. 


their differentiated properties and develop into callus material. The 
callus can then be transferred to liquid media to start a cell culture. 


18.10 The Fractionation of 
a Cell’'s Contents by Differential 
Centrifugation 


Most cells contain a variety of different organelles. If one sets out to 
study a particular function of mitochondria or to isolate a particular 
enzyme from the Golgi complex, it is useful to first isolate the rele- 
vant organelle in a purified state. Isolation of a particular organelle 
in bulk quantity is generally accomplished by the technique of dif- 
ferential centrifugation, which depends on the principle that, as 
long as they are more dense than the surrounding medium, particles 
of different size and shape travel toward the bottom of a centrifuge 
tube at different rates when placed in a centrifugal field. 

To carry out this technique, cells are first broken open by 
mechanical disruption, typically using a mechanical homogenizer. 
Cells are homogenized in an isotonic buffered solution (often con- 
taining sucrose), which prevents the rupture of membrane vesicles 
due to osmosis. The homogenate is then subjected to a series of 
sequential centrifugations at increasing centrifugal forces. The steps 
in this technique were discussed in Chapter 8 and illustrated in 
Figure 8.5. Initially, the homogenate is subjected to low centrifugal 
forces for a short period of time so that only the largest cellular 
organelles, such as nuclei (and any remaining whole cells), are sedi- 
mented into a pellet. At greater centrifugal forces, relatively large 
cytoplasmic organelles (mitochondria, chloroplasts, lysosomes, 
and peroxisomes) can be spun out of suspension (Figure 8.5). In 


subsequent steps, microsomes (the fragments of vacuolar and retic- 
ular membranes of the cytosol) and ribosomes are removed from 
suspension. This last step requires the ultracentrifuge, which can 
generate speeds of 75,000 revolutions per minute, producing forces 
equivalent to 500,000 times that of gravity. Once ribosomes have 
been removed, the supernatant consists of the cell’s soluble phase 
and those particles too small to be removed by sedimentation. 

The initial steps of differential centrifugation do not yield pure 
preparations of a particular organelle, so that further steps are usu- 
ally required. In many cases, further purification is accomplished by 
centrifugation of one of the fractions through a density gradient, as 
shown in FIGURE 18.25, which distributes the contents of the sample 
into various layers according to their density. The composition of 
various fractions can be determined by microscopic examination, or 


Resuspended 
material from 
20,000 g pellet 


Lysosomes filled 


1 molar — | with Triton WR1339 
sucrose 65,000 g/2 hr (1.12 gram/ml) 
Mitochondria 
Sucrose (1.18 gram/ml) 
density 
2 molar — gradient Peroxisomes 
sucrose (1.23 gram/ml) 


FIGURE 18.25 Purification of subcellular fractions by density-gradient 
equilibrium centrifugation. In this particular example, the medium is 
composed of a continuous sucrose-density gradient, and the different 
organelles sediment until they reach a place in the tube equal to their own 
density, where they form bands. The 20,000 g pellet is obtained as shown 
in Figure 8.5. 
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specific for particular organelles. 

Cellular organelles isolated by differential centrifugation retain 
a remarkably high level of normal activity, as long as they are not 
exposed to denaturing conditions during their isolation. Organelles 
isolated by this procedure can be used in cell-free systems to study a 
wide variety of cellular activities, including the synthesis of mem- 
brane-bound proteins (page 263), the formation of transport vesicles 
(page 279), DNA synthesis (page 527), and the transport of solutes 
and development of ionic gradients (see Figure 5.23). 


18.11 Purification and 
Characterization of Proteins by 
Liquid Column Chromatography 


During the course of this book, we have considered the properties of 
many different proteins. Before information about the structure or 
function ofa particular protein can be obtained, that protein must be 
isolated in a relatively pure state. Because most cells contain thou- 
sands of different proteins, the purification of a single species can be 
a challenging mission, particularly if the protein is present in the cell 
at low concentration. In this section, we will briefly survey a few of 
the techniques used to purify proteins. 

Purification of a protein is generally performed by the stepwise 
removal of contaminants. Two proteins may be very similar in one 
property, such as overall charge, and very different in another prop- 
erty, such as molecular size or shape. Consequently, the complete 
purification of a given protein usually requires the use of successive 
techniques that take advantage of different properties of the proteins 
being separated. Purification is measured as an increase in specific 
activity, which is the ratio of the amount of that protein to the total 
amount of protein present in the sample. Some identifiable feature of 
the specific protein must be utilized as an assay to determine the 
relative amount of that protein in the sample. If the protein is an 
enzyme, its catalytic activity may be used as an assay to monitor puri- 
fication. Alternatively, assays can be based on immunologic, electro- 
phoretic, electron microscopic, or other criteria. Measurements of 
total protein in a sample can be made using various properties, includ- 
ing total nitrogen, which can be very accurately measured and is quite 
constant at about 16 percent of the dry weight for all proteins. 

Chromatography is a term for a variety of techniques in 
which a mixture of dissolved components is fractionated as it moves 
through some type of porous matrix. In liquid chromatographic 
techniques, components in a mixture can become associated with 
one of two alternative phases: a mobile phase, consisting of a moving 
solvent, and an immobile phase, consisting of the matrix through 
which the solvent is moving.’ In the chromatographic procedures 
described below, the immobile phase consists of materials that are 
packed into a column. The proteins to be fractionated are dissolved 
in a solvent and then passed through the column. The materials that 
make up the immobile phase contain sites to which the proteins 
in solution can bind. As individual protein molecules interact with 
the materials of the matrix, their progress through the column is 
retarded. Thus the greater the affinity of a particular protein for the 


Liquid chromatography is distinguished from gas chromatography in which the 
mobile phase is represented by an inert gas. 


matrix material, the slower its passage through the column. Because 
different proteins in the mixture have different affinity for the 
matrix, they are retarded to different degrees. As the solvent passes 
through the column and drips out the bottom, it is collected as frac- 
tions in a series of tubes. Those proteins in the mixture with the least 
affinity for the column appear in the first fractions to emerge from 
the column. The resolution of many chromatographic procedures 
has been improved with the development of high performance liquid 
chromatography (HPLC), in which long, narrow columns are used, 
and the mobile phase is forced under high pressure through a tightly 
packed noncompressible matrix composed of exceptionally small 
(e.g., 5 um diameter) particles. 


lon-Exchange Chromatography 


Proteins are large, polyvalent electrolytes, and it is unlikely that many 
proteins in a partially purified preparation have the same overall 
charge. The overall charge of a protein is a summation of all the indi- 
vidual charges of its component amino acids. Because the charge of 
each amino acid depends on the pH of the medium (see Figure 2.27), 
the charge of each protein also depends on the pH. As the pH is low- 
ered, negatively charged groups become neutralized and positively 
charged groups become more numerous. The opposite occurs as the 
pH is increased. A pH exists for each protein at which the total num- 
ber of negative charges equals the total number of positive charges. 
This pH is the isoelectric point, at which the protein is neutral. 
The isoelectric point of most proteins is below pH 7. 

Ionic charge is used as a basis for purification in a variety of tech- 
niques, including ion-exchange chromatography. Ion-exchange 
chromatography depends on the ionic bonding of proteins to an 
inert matrix material, such as cellulose, containing covalently linked 
charged groups. Two of the most commonly employed ion-exchange 
resins are diethylaminoethyl (DEAE) cellulose and carboxymethyl 
(CM) cellulose. DEAE-cellulose is positively charged and therefore 
binds negatively charged molecules; it is an anion exchanger. CM- 
cellulose is negatively charged and acts as a cation exchanger. The 
resin is packed into a column, and the protein solution is allowed to 
percolate through the column in a buffer whose composition pro- 
motes the binding of some or all of the proteins to the resin. Proteins 
are bound to the resin reversibly and can be displaced by increasing 
the ionic strength of the buffer (which adds small ions to compete 
with the charged groups of the macromolecules for sites on the resin) 
and/or changing its pH. Proteins are eluted from the column in order 
from the least strongly bound to the most strongly bound. FIGURE 18.26 
shows a schematic representation of the separation of two protein 
species by stepwise elution from an ion-exchange column. 


Gel Filtration Chromatography 


Gel filtration separates proteins (or nucleic acids) primarily on the 
basis of their effective size (hydrodynamic radius). Like ion-exchange 
chromatography, the separation material consists of tiny beads that 
are packed into a column through which the protein solution slowly 
passes. The materials used in gel filtration are composed of cross- 
linked polysaccharides (dextrans or agarose) of different porosity, 
which allow proteins to diffuse in and out of the beads. The tech- 
nique is best illustrated by example (FIGURE 18.27). 

Suppose one is attempting to purify a globular protein having a 
molecular mass of 125,000 daltons. This protein is present in solution 
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FIGURE 18.26 lon-exchange chromatography. The separation of two 
proteins by DEAE-cellulose. In this case, a positively charged ion-exchange 
resin is used to bind the negatively charged protein. 


with two contaminating proteins of similar shape, one much larger 
at 250,000 daltons and the other much smaller at 75,000 daltons. 
One way the protein might be purified is to pass the mixture through 
a column of Sephadex G-150 beads, which allows entry to globular 
proteins that are less than about 200 kDa. When the protein mixture 
passes through the column bed, the 250 kDa protein is unable to 
enter the beads and remains dissolved in the moving solvent phase. 
As a result, the 250 kDa protein is eluted as soon as the preexisting 
solvent in the column (the bed volume) has dripped out. In contrast, 
the other two proteins can diffuse into the interstices within the 
beads and are retarded in their passage through the column. As 
more and more solvent moves through the column, these proteins 
move down its length and out the bottom, but they do so at different 
rates. Among those proteins that enter the beads, smaller species are 
retarded to a greater extent than larger ones. Consequently, the 125- 
kDa protein is eluted in a purified state, while the 75-kDa protein 
remains in the column. 


Mixture of 
3 proteins 


Porous 
beads 


FIGURE 18.27 Gel filtration chromatography. The separation of three 
globular proteins having different molecular mass, as described in the text. 
Among proteins of similar basic shape, larger molecules are eluted before 
smaller molecules. 


Affinity Chromatography 


Techniques described up to this point utilize the bulk properties of a 
protein to effect purification or fractionation. Another purification 
technique called affinity chromatography takes advantage of 
the unique structural properties of a protein, allowing one protein 
species to be specifically withdrawn from solution while all others 
remain behind in solution (FIGURE 18.28). Proteins interact with 
specific substances: enzymes with substrates, receptors with ligands, 
antigens with antibodies, and so forth. Each of these types of pro- 
teins can be removed from solution by passing a mixture of proteins 
through a column in which the specific interacting molecule (sub- 
strate, ligand, antibody, etc.) is covalently linked to an inert, immo- 
bilized material (the matrix, Figure 18.284). If, for example, an 
impure preparation of an acetylcholine receptor is passed through a 
column containing agarose beads to which an acetylcholine ana- 
logue is attached, the receptor binds specifically to the beads as long 
as the conditions in the column are suitable to promote the interac- 
tion (page 148). Once all the contaminating proteins have passed 
through the column and out the bottom, the acetylcholine receptor 
molecules can be displaced from their binding sites on the matrix by 
changing the ionic composition and/or pH of the solvent in the col- 
umn. Thus, unlike the other chromatographic procedures that sepa- 
rate proteins on the basis of size or charge, affinity chromatography 
can achieve a near-total purification of the desired molecule in a 
single step. In recent years, affinity chromatography has been widely 
used to purify proteins that have been genetically modified to 
include a specific protein “tag? on its N- or C-terminus. Common 
tags include TAP tags (discussed in Section 2.15), glutathione-S- 
transferase (GST) tags or a His tag (composed of a string of six histi- 
dines). His-tagged proteins are purified using a resin onto which 
nickel ions have been immobilized, while GST-tagged proteins are 
purified based on their affinity for glutathione-immobilized resin. 
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FIGURE 18.28 Affinity chromatography. (a) Schematic representation of 
the coated agarose beads to which only a specific protein can combine. 
(b) Steps in the chromatographic procedure. 
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Protein Interactions 


One of the ways to learn about the function of a protein is to iden- 
tify the proteins with which it interacts. Several techniques are 
available for determining which proteins in a cell might be capable 
of interacting with a given protein that has already been identified. 
One of these techniques was just discussed: affinity chromatogra- 
phy. Another technique uses antibodies. Consider, for example, that 
protein A, which has already been identified and purified, is part of 
a complex with two other proteins in the cytoplasm, proteins B and 
C. Once protein A has been purified, an antibody against this 
protein can be obtained and used as a probe to bind and remove 
protein A from solution. If a cell extract is prepared that contains 
the A-B-C protein complex, and the extract is incubated with the 
anti-A antibody, then binding of the antibody to the A protein will 
result in the coprecipitation of other proteins that are bound to A, in 
this case proteins B and C, which can then be identified. 
Coprecipitation of DNA fragments using the ChIP technique was 
illustrated in Figure 12.46. 

One technique that has been widely used to search for pro- 
tein—protein interactions is the yeast two-hybrid system, which 
was invented in 1989 by Stanley Fields and Ok-kyu Song at the 
State University of New York in Stony Brook. The technique is 
illustrated in FIGURE 18.29 and depends on the expression of a 
reporter gene, such as [-galactosidase (lacZ), whose activity is 
readily monitored by a test that detects a color change when the 
enzyme is present in a population of yeast cells. Expression of the 
lacZ gene in this system is activated by a particular protein—a tran- 
scription factor—that contains two domains, a DNA-binding 
domain and an activation domain (Figure 18.29a). The DNA-bind- 
ing domain mediates binding to the promoter, and the activation 
domain mediates interaction with other proteins involved in the 
activation of gene expression. Both domains must be present for 
transcription to occur. To employ the technique, two different 
types of recombinant DNA molecules are prepared. One DNA 
molecule contains a segment of DNA encoding the DNA-binding 
domain of the transcription factor linked to a segment of DNA 
encoding the “bait” protein (X). The bait protein is the protein that 
has been characterized and the one for which potential binding 
partners are being sought. When this recombinant DNA is 
expressed in a yeast cell, a hybrid protein such as that depicted in 
Figure 18.29b is produced in the cell. The other DNA molecule 
contains a portion of the transcription factor encoding the activa- 
tion domain linked to DNA encoding an unknown protein (Y). 
Such DNAs (or cDNAs as they are called) are prepared from 
mRNAs by reverse transcriptase as described on page 731. Let’s 
assume that Y is a protein capable of binding to the bait protein. 
When a recombinant DNA encoding Y is expressed in a yeast cell, 
a hybrid protein such as that depicted in Figure 18.29c is produced 
in the cell. When produced in a cell alone, neither the X- nor Y- 
containing hybrid protein is capable of activating transcription of 
the lacZ gene (Figure 18.29b,c). However, if both of these particular 
recombinant DNA molecules are introduced into the same yeast 
cell (as in Figure 18.29d), the X and Y proteins can interact with 
one another to reconstitute a functional transcription factor, 
an event that can be detected by the cell’s ability to produce 
B-galactosidase. Using this technique, researchers are able to “fish” 
for proteins encoded by unknown genes that are capable of inter- 
acting with the “bait” protein. The Y2H methodology is particu- 
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FIGURE 18.29 Use of the yeast two-hybrid system. This test for 
protein-protein interaction depends on a cell being able to put together 
two parts ofa transcription factor. (a) The two parts of the transcription 
factor—the DNA-binding domain and the activation domain—are seen 
here as the transcription factor binds to the promoter of a gene (lacZ) 
encoding B-galactosidase. (b) In this case, a yeast cell has synthesized the 
DNA-binding domain of the transcription factor linked to a known “bait” 
protein X. This complex cannot activate transcription. (c) In this case, a 
yeast cell has synthesized the activation domain of the transcription factor 
linked to an unknown “fish” protein Y. This complex cannot activate 
transcription. (d) In this case, a yeast cell has synthesized both X and Y 
protein constructs, which reconstitutes a complete transcription factor, 
allowing lacZ expression, which is readily detected. (e) If the second DNA 
had encoded a protein, for example, Z, that could not bind to X, no 
expression of the reporter gene would have been detected. 


larly well-suited for screening large numbers of proteins and its use 
in large-scale (high-throughput) proteomic studies is discussed in 
Section 2.15. In recent years, the two-hybrid approach pioneered in 


yeast has been adapted for use in mammalian cells. To date, studies 
using mammalian two-hybrid systems tend to be focused on the 
interactions of specific proteins rather than screening of large-scale 
protein libraries. 


18.13 Characterization of 
Proteins by Polyacrylamide Gel 
Electrophoresis 


Another powerful technique that is widely used to separate proteins 
is electrophoresis. Electrophoresis depends on the ability of 
charged molecules to migrate when placed in an electric field. The 
electrophoretic separation of proteins is usually accomplished using 
polyacrylamide gel electrophoresis (PAGE), in which the pro- 
teins are driven by an applied current through a gelated matrix. The 
matrix is composed of polymers of a small organic molecule (acryla- 
mide) that is cross-linked to form a molecular sieve. A polyacryla- 
mide gel may be formed as a thin slab between two glass plates or as 
a cylinder within a glass tube. Once the gel has polymerized, the slab 
(or tube) is suspended between two compartments containing 
buffer in which opposing electrodes are immersed. In a slab gel, the 
concentrated, protein-containing sample is layered in slots along the 
top of the gel, as shown in step 1 of FIGURE 18.30. The protein sam- 
ple is prepared in a solution containing sucrose or glycerol, whose 
density prevents the sample from mixing with the buffer in the 
upper compartment. A voltage is then applied between the buffer 
compartments, and current flows across the slab, causing the pro- 
teins to move toward the oppositely charged electrode (step 2). 
Separations are typically carried out using alkaline buffers, which 
make the proteins negatively charged and cause them to migrate 
toward the positively charged anode at the opposite end of the gel. 
Following electrophoresis, the slab is removed from the glass plates 
and stained (step 3). 

The relative movement of proteins through a polyacrylamide 
gel depends on the charge density (charge per unit of mass) of the 
molecules. The greater the charge density, the more forcefully the 
protein is driven through the gel, and thus the more rapid its rate of 
migration. But charge density is only one important factor in PAGE 
fractionation; size and shape also play a role. Polyacrylamide forms 
a cross-linked molecular sieve that entangles proteins passing 
through the gel. The larger the protein, the more it becomes entan- 
gled, and the slower it migrates. Shape is also a factor because com- 
pact globular proteins move more rapidly than elongated fibrous 
proteins of comparable molecular mass. The concentration of 
acrylamide (and cross-linking agent) used in making the gel is also 
an important factor. The lower the concentration of acrylamide, the 
less the gel becomes cross-linked, and the more rapidly a given pro- 
tein molecule migrates. A gel containing 5 percent acrylamide might 
be useful for separating proteins of 60 to 250 kDa, whereas a gel of 15 
percent acrylamide might be useful for separating proteins of 10 
to 50 kDa. 

The progress of electrophoresis is followed by watching the 
migration of a charged tracking dye that moves just ahead of the 
fastest proteins (step 2, Figure 18.30). After the tracking dye has 
moved to the desired location, the current is turned off, and the gel 
is removed from its container. Gels are typically stained with 
Coomassie Blue or silver stain to reveal the locations of the pro- 
teins. If the proteins are radioactively labeled, their locations can be 


Sample with tracking [| 
dye being 7 4 
loaded into well -_— = = of 


Polyacrylamide gel slab 
positioned between two 


glass plates 
o | | 


Upper reservoir with buffer | 


Negative electrode 
(cathode) 


Tracking dye 
shows extent 
of electrophoretic 
movement 


Positive electrode 
(anode) 


@ Lower reservoir with buffer 


Larger / / 
Ss / 
«~ »& / / 
~ = SS a / 
~N 
— x SS / 
= > / 
SS = = i 
Sac as ae - = 
> ~ ~ = 
Smaller ~ \ Re, es 2 
Se Ss ys SN 
~N x ae 
~ ~ A 
eS 
~ 
ae 
Staining solution in tray e Gel 


FIGURE 18.30 Polyacrylamide gel electrophoresis. The protein samples 
are typically dissolved in a sucrose solution whose density prevents the 
sample from mixing with the buffer and then loaded into the wells with a 
fine pipette as shown in step 1. In step 2, a direct current is applied across 
the gel, which causes the proteins to move into the polyacrylamide along 
parallel lanes. When carried out in the detergent SDS, which is usually the 
case, the proteins move as bands at rates that are inversely proportional to 
their molecular mass. Once electrophoresis is completed, the gel is 
removed from the glass frame and stained in a tray (step 3). 


determined by pressing the gel against a piece of X-ray film to 
produce an autoradiograph, or the gel can be sliced into fractions 
and individual proteins isolated. Alternatively, the proteins in the 
gel can be transferred by a second electrophoretic procedure to a 
nitrocellulose membrane to form a blot (page 722). Proteins 
become absorbed onto the surface of the membrane in the same 
relative positions they occupied in the gel. In a Western blot, the 
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SDS-PAGE 


Polyacrylamide gel electrophoresis (PAGE) is usually carried out in 
the presence of the negatively charged detergent sodium dodecyl sul- 
fate (SDS), which binds in large numbers to all types of protein mol- 
ecules (page 126). The electrostatic repulsion between the bound SDS 
molecules causes the proteins to unfold into a similar rod-like shape, 
thus eliminating differences in shape as a factor in separation. The 
number of SDS molecules that bind to a protein is roughly propor- 
tional to the protein’s molecular mass (about 1.4 g SDS/ g protein). 
Consequently, each protein species, regardless of its size, has an 
equivalent charge density and is driven through the gel with the same 
force. However, because the polyacrylamide is highly cross-linked, 
larger proteins are held up to a greater degree than smaller proteins. 
As a result, proteins become separated by SDS-PAGE on the basis of 
a single property—their molecular mass. In addition to separating 
the proteins in a mixture, SDS-PAGE can be used to determine the 
molecular mass of the various proteins by comparing the positions of 
the bands to those produced by proteins of known size. Examples of 
SDS-PAGE are shown on Figure 4.32c and on page 149. 


Two-Dimensional Gel Electrophoresis 


In 1975, a technique called two-dimensional gel electrophoresis was 
developed by Patrick O’Farrell at the University of California, San 
Francisco, to separate complex mixtures of proteins using two differ- 
ent properties of the molecules. Proteins are first separated in a 
tubular gel according to their isoelectric point (page 712) by a tech- 
nique called isoelectric focusing. After separation, the gel is removed 
and placed on top of a slab of SDS-saturated polyacrylamide and 


pH 
7.0 6.8 6.55 6.3 6.1 


7.45 7.3 7.2 7.1 


6.0 5.9 


Molecular weight x 10-% 


_ 15 


FIGURE 18.31 Two-dimensional gel electrophoresis. A two-dimensional 
polyacrylamide gel of HeLa cell nonhistone chromosomal proteins labeled 
with [35S]methionine. Over a thousand different proteins can be resolved by 
this technique. 

Source: From J. L. Peterson and E. H. McConkey, J. Biol. Chem. 251:550, 
1976. © 1976 the American Society for Biochemistry and Molecular Biology. 


subjected to SDS-PAGE. The proteins move into the slab gel and 
become separated according to their molecular mass (FIGURE 18.31). 
Once separated, individual proteins can be removed from the gel 
and digested into peptide fragments that can be analyzed by mass 
spectrometry. The resolution of the technique is sufficiently high to 
distinguish most of the proteins in a cell. Because of its great resolving 
power, two-dimensional gel electrophoresis is ideally suited to detect 
changes in the proteins present in a cell under different conditions, at 
different stages in development or the cell cycle, or in different organ- 
isms. The technique, however, is not suitable for distinguishing 
among proteins that have high molecular mass, that are highly 
hydrophobic, or that are present at very low copy numbers per cell. 


18.14 Characterization of Proteins 
by Spectrometry 


One of the simplest and most widely used methods to determine the 
amount of protein (or nucleic acid) present in a given solution is to 
measure the amount of light of a specific wavelength that is absorbed 
by that solution. The instrument used for this measurement is a 
spectrophotometer. To make this type of measurement, the solu- 
tion is placed in a special, flat-sided, quartz container (quartz is used 
because, unlike glass, it does not absorb ultraviolet light), termed a 
cuvette, which is then placed in the light beam of the spectropho- 
tometer. The amount of light that passes through the solution unab- 
sorbed (i.e., the transmitted light) is measured by photocells on the 
other side of the cuvette. 

Of the 20 amino acids incorporated into proteins, two of them, 
tyrosine and phenylalanine, absorb light in the ultraviolet range with 
an absorbance maximum at about 280 nm. If the proteins being stud- 
ied have a typical percentage of these amino acids, then the absorb- 
ance of the solution at this wavelength provides a measure of protein 
concentration. Alternatively, one can use a variety of chemical assays, 
such as the Lowry or Biuret technique, in which the protein in solu- 
tion is engaged in a reaction that produces a colored product whose 
concentration is proportional to the concentration of protein. 


18.15 Characterization of Proteins 
by Mass Spectrometry 


As discussed in Section 2.15, the emerging field of proteomics depends 
heavily on the analysis of proteins by mass spectrometry. Mass spec- 
trometers are analytical instruments used primarily to measure the 
masses of molecules, determine chemical formulas and molecular 
structure, and identify unknown substances. Mass spectrometers 
accomplish these feats by converting the substances in a sample into 
positively charged, gaseous ions, which are accelerated through a 
curved tube toward a negatively charged plate (FIGURE 18.32). As the 
ions pass through the tube, they are subjected to a magnetic field that 
causes them to separate from one another according to their molecu- 
lar mass [or more precisely according to their mass-to-charge (m/z) 
ratio]. The ions strike an electronic detector located at the end of the 
tube. Smaller ions travel faster and strike the detector more rapidly 
than larger ions. The input to the detector is converted into a series of 
peaks of ascending m/z ratio (as in Figure 2.47). 
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FIGURE 18.32 Principles of operation of a mass spectrometer. 
Source: From J. E. Brady, J. Russell, and J. R. Holum, Chemistry, 3d ed.; 
copyright © 2000, John Wiley and Sons, Inc. Reprinted with permission of 
John Wiley and Sons, Inc. 


Mass spectrometers have been a favorite instrument of chemists 
for many years, but it has only been in the past two decades that 
biologists have discovered their remarkable analytic powers. Now, 
using mass spectrometry (MS), protein biochemists are able to rap- 
idly identify the proteins present in a particular type of cell, orga- 
nelle, or protein complex. To carry out this analysis, the protein 
sample is generally digested with trypsin, fractionated by liquid 
chromatography, and the peptides introduced into a mass spec- 
trometer where they are gently ionized and made gaseous by one of 
two procedures. Development of these peptide ionization tech- 
niques has been the key in adapting MS to the study of proteins. In 
one procedure, called matrix-assisted laser desorption ionization 
(MALDI), the protein sample is applied as part of a crystalline 
matrix, which is irradiated by a laser pulse. The energy of the laser 
excites the matrix and the absorbed energy converts the peptides 
into gaseous ions. In an alternate procedure, called electrospray ioni- 
zation (ESI), an electric potential is applied to a peptide solution, 
causing the peptides to ionize and the liquid to spray as a fine mist 
of charged particles that enter the spectrometer. Because it acts on 
molecules in solution, ESI is well suited to ionize peptides prepared 
from proteins fractionated by a widely employed liquid chromatog- 
raphy technique. 

Once the molecular masses of the peptides in the sample have 
been determined, the proteins can be identified by a database search 
as discussed in Section 2.15. If each of the proteins cannot be identi- 
fied unambiguously, one or more of the peptides generated by tryp- 
tic digestion can be fragmented* in a second step and subjected to 
another round of mass spectrometry. This two-step procedure 
(called tandem MS, or MS/MS) yields the amino acid sequence of 
the peptides, which can then be assembled into the set of proteins 
from which they are derived. MS/MS is so powerful that complex 
mixtures of hundreds of unknown proteins can be digested and sub- 
jected to mass spectrometry and the identity of each of the proteins 
in the mixture determined. MS/MS can also be used to identify the 
specific posttranslational modifications that are present on a par- 
ticular protein under a particular set of physiological conditions. 


“Fragmentation is accomplished within the mass spectrometer by collision of the 
peptides with an inert gas. The energy of collision breaks peptide bonds to produce a 
random collection of fragments of the original peptide. The amino acid sequence of 
each fragment, and hence that of the original peptide, can be determined by searching 
a database containing the masses of theoretical fragments having every possible 
sequence of amino acids that can be formed from the proteins encoded by that 
genome. 


18.16 Determining the Structure 
of Proteins and Multisubunit 
Complexes 


X-ray crystallography (or X-ray diffraction) utilizes protein crystals, 
which are bombarded with a fine beam of X-rays (FIGURE 18.33). 
The radiation that is scattered (diffracted) by the electrons of the 
protein’s atoms strikes an electron-sensitive detector placed behind 
the crystal. The diffraction pattern produced by a crystal is deter- 
mined by the structure within the protein; the large number of mol- 
ecules in the crystal reinforces the reflections, causing it to behave as 
if it were one giant molecule. The positions and intensities of the 
reflections, such as those on the photographic plate of Figure 2.33, 
can be related mathematically to the electron densities within the 
protein, because it is the electrons of the atoms that produced them. 
The resolution obtained by X-ray diffraction depends on the num- 
ber of spots that are analyzed. 

Myoglobin was the first protein whose structure was determined 
by X-ray diffraction. The protein was analyzed successively at 6, 2, 
and 1.4 A, with years elapsing between each completed determina- 
tion. Considering that covalent bonds are between 1 and 1.5 A in 
length and noncovalent bonds between 2.8 and 4 A in length, the 
information gathered for a protein depends on the resolution 
achieved. This is illustrated by a comparison of the electron density 
of a small organic molecule at four levels of resolution (FIGURE 18.34). 
In myoglobin, a resolution of 6 A was sufficient to show the manner 
in which the polypeptide chain is folded and the location of the heme 
moiety, but it was not sufficient to show structure within the chain. 
Ata resolution of 2 A, groups of atoms could be separated from one 
another, whereas at 1.4 A, the positions of individual atoms were 
determined. To date, the structures of several hundred proteins have 
been determined at atomic resolution <1.2 A and a fewas lowas 0.66 A. 

Over the years, X-ray diffraction technology has improved 
greatly. It took Max Perutz 22 years to solve the structure of hemo- 
globin (Figure 2.40), a task that today might take a few weeks. In 
most current studies: (1) Intense, highly focused X-ray beams are 
generated by synchrotrons (see page 96), which are high-energy par- 
ticle accelerators that produce X-rays as a by-product, and (2) highly 
sensitive electronic detectors (charge-coupled devices, or CCDs) 
that provide a digital readout of the diffraction data have replaced 
photographic plates. Use of these instruments in conjunction with 
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FIGURE 18.33 X-ray diffraction analysis. Schematic diagram of the 
diffraction of X-rays by atoms of one plane of a crystal onto a photographic 
plate. The ordered array of the atoms within the crystal produces a 
repeating series of overlapping circular waves that spread out and intersect 
the film. As with diffraction of visible light, the waves form an interference 
pattern, reinforcing each other at some points on the film and canceling 
one another at other points. 
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FIGURE 18.34 Electron density distribution of a small organic molecule (diketopiperazine) calculated at several levels of resolution. At the lowest 
resolution (a), only the ring nature of the molecule can be distinguished, whereas at the highest resolution (d), the electron density around each atom 


(indicated by the circular contour lines) is revealed. 


Source: After D. Hodgkin. Reprinted with permission from Nature 188:445, 1960; copyright 1960, Nature by Nature Publishing Group. Reproduced with 
permission of Nature Publishing Group in the format reuse in a book/textbook via Copyright Clearance Center. 


increasingly powerful computers allows researchers to collect and 
analyze sufficient data to determine the tertiary structure of most 
proteins in a matter of hours. As a result of these advances, 
X-ray crystallography has been applied to the analysis of larger and 
larger molecular structures. This is probably best illustrated by the 
success attained in the determination of the structure of the ribo- 
some, which is discussed in Chapter 11. In most cases, as was true in 
the study of the ribosome, the greatest challenge in this field is 
obtaining usable crystals. 

X-ray crystallography is ideally suited for determining the 
structure of soluble proteins that lend themselves to crystallization, 
but can be very challenging for the study of complex multisubunit 
structures, such as ribosomes or proteasomes, or for membrane pro- 
teins, where it is difficult to obtain the three-dimensional crystals 
required for analysis. Structural analysis of these types of specimens 
is often conducted using an alternate technique that takes advantage 
of the tremendous resolving power of the electron microscope and 
computer-based image processing techniques. 

There are two general approaches to the study of single particles 
with the electron microscope. In one approach, the particles are 
placed on an electron microscope grid, negatively stained, and then 
air-dried (as discussed on page 705). In the alternate approach, 
which is referred to as electron cryomicroscopy, or cryo-EM, the par- 
ticles are placed on a grid and rapidly frozen in a hydrated state in 
liquid nitrogen without being fixed or stained (FIGURE 18.35). For 
higher resolution studies, the use of negatively stained specimens 
has largely been replaced by frozen-hydrated particles, which are 
much less likely to generate artifacts and are much better suited for 
the study of internal features within the particle's structure. In either 
case, the grids are placed in the column of the microscope and 
photographs of the particles are taken. Each photograph is a two- 
dimensional image of an individual particle in the orientation that it 
happened to assume as it rested on the grid. When two-dimensional 
images of tens of thousands of different specimens in every conceiv- 
able orientation are averaged by high-powered computer analysis, a 
three-dimensional reconstruction of the particle at a resolution of 
~10 A can be generated. At this resolution, investigators can trace 
the polypeptide chain of a protein and even identify the location of 
bulky amino acid side chains. A model of a eukaryotic ribosome 
based on cryo-EM is illustrated in Figure 2.57, a model of a clathrin- 
coated vesicle in Figure 8.40c, and a model of a U1 snRNP particle in 
Figure 11.31. This technique, which is referred to as single-particle 
reconstruction, is also useful for capturing images of a structure, 
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FIGURE 18.35 Overview of single particle cryo-EM reconstruction. 
After a sample is prepared, a small volume of the sample is placed on a 
carbon grid, which is gently blotted to remove excess liquid and rapidly 
plunged into liquid ethane. The grids are then viewed through a 
transmission electron microscope, preferably fitted with a direct electron 
detector. From the images obtained, many thousands of particles are 
selected and placed into classes that can represent different sample 
orientations or states. The data is processed computationally, and a 3D 
density map is constructed, which reveals the structure of the protein. With 
recent advancements in technology and software, atomic-level structures 
have been solved using cryo-EM. 

Source: From Zhang Laboratory. Photos: Inga Spence/Science Source 
Images; Courtesy of Peter Shen and Adam Frost. 


such as a ribosome, at different stages during a dynamic process, 
such as the elongation step of protein synthesis. Using this approach, 


several of the major conformational changes that occur during each 
step of translation have been revealed. 

In addition, atomic-resolution structures determined by X-ray 
crystallography can be fitted into the lower resolution electron 
microscopic reconstructions to show how the individual molecules 
that make up a multisubunit complex interact and how they might 
work together to carry out a particular activity. FIGURE 18.36 shows 
the tertiary structure of a filament composed of actin and ADF (a 
member of the cofilin family, Section 9.13). The structures of the 
two proteins were determined by separate X-ray crystallographic 
studies and then fitted into an electron microscopic model of an 
actin-ADF filament. The reconstruction shown in this figure served 
as the basis for a proposed mechanism by which cofilin family pro- 
teins can induce severing and depolymerization of an actin filament 
(Section 9.10). 

Electron microscopic analysis of frozen specimens can also be 
utilized in the study of membrane proteins, such as the nicotinic ace- 
tylcholine receptor (see the Experimental Pathway for Chapter 4). 
This type of analysis requires that the membrane proteins be closely 
packed at very low temperatures (e.g., -195°C) into two-dimensional 
crystalline arrays within the plane of the membrane. This technique 
offers an advantage over X-ray crystallographic studies of membrane 
proteins in that the protein remains during the entire process within 
its own natural membrane rather than being extracted in detergent 
and crystallized in a nonmembranous environment. The structures 
illustrated in Section 4.12 were determined from combined, high- 
resolution, electron-microscopic images of many different protein 
molecules taken at various angles relative to the electron beam. This 
technique is known as electron crystallography. 


FIGURE 18.36 Combining data from electron microscopy and X-ray 
crystallography provides information on protein-protein interactions and 
the structure of multisubunit complexes. The electron microscopic 
reconstruction of an actin-ADF filament is shown in grey. The high-resolution 
X-ray crystal structures of individual actin monomers (red) and ADF 
molecules (green) have been fitted into the lower resolution EM structure. 


Source: Courtesy of Edward H. Egelman, University of Virginia, J. Cell Biol. 
163:1059, 2003 Fig. 2a. Reproduced with permission of the Rockefeller 
University Press. 


Just in the past few years, development of new electron detector 719 


hardware and image-processing software has resulted in near- 
atomic level structures being solved using cryo-EM. With the con- 
tinued improvement of technology and sample preparation, 
researchers predict that cryo-EM techniques will soon allow for the 
resolution of atomic-level structures from a relatively small number 
of images. 


18.17 Fractionation of 
Nucleic Acids 


Any systematic fractionation method must exploit differences between 
members of a mixture to effect separation. Nucleic acid molecules 
can differ from one another in overall size, base composition, topology, 
and nucleotide sequence. Accordingly, fractionation methods for 
nucleic acids are based on these features. 


Separation of DNAs by 
Gel Electrophoresis 


Of the various techniques used in the fractionation of proteins dis- 
cussed earlier, one of them, gel electrophoresis, is also widely used to 
separate nucleic acids of different molecular mass (i.e., nucleotide 
length). Small RNA or DNA molecules of a few hundred nucleotides 
or less are generally separated by polyacrylamide gel electrophoresis. 
Larger molecules have trouble making their way through the 
cross-linked polyacrylamide and are generally fractionated on aga- 
rose gels, which have greater porosity. Agarose is a polysaccharide 
extracted from seaweed; it is dissolved in hot buffer, poured into a 
mold, and caused to gelate simply by lowering the temperature. 
Separation of DNA molecules greater than about 25 kb is generally 
accomplished using the technique of pulsed-field electrophoresis in 
which the direction of the electric field in the gel is periodically 
changed, which causes the DNA molecules to reorient themselves 
during their migration (FIGURE 18.37a). 

Following electrophoresis, the DNA fragments in the gel are 
visualized by soaking the gel in a solution of stain such as ethidium 
bromide. Ethidium bromide intercalates into the double helix and 
causes the DNA bands to appear fluorescent when viewed under 
ultraviolet light (Figure 18.37b). The sensitivity of gel electrophore- 
sis is so great that DNA or RNA molecules that differ by only a single 
nucleotide can be separated from one another, a feature that gave 
rise to an invaluable method for DNA sequencing (Section 18.22). 
Because the rate of migration through a gel can also be affected by 
the shape of the molecule, electrophoresis can be used to separate 
molecules with different conformations, such as circular and linear 
or relaxed and supercoiled forms (see Figure 10.13). 


Separation of Nucleic Acids 
by Ultracentrifugation 


Common experience tells us that the stability of a solution (or sus- 
pension) depends on the components. Cream floats to the top of raw 
milk, a fine precipitate gradually settles to the bottom of a container, 
and a solution of sodium chloride remains stable indefinitely. 
Numerous factors determine whether a given component will settle 
through a liquid medium, including the size, shape, and density of 
the substance and the density and viscosity of the medium. If a 
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FIGURE 18.37 Separation of DNA restriction fragments by gel 
electrophoresis. (a) DNA is incubated with a restriction enzyme, which cuts 
it into fragments (page 723). The mixture of fragments is introduced into a 
slot, or well, in a slab of agarose and an electric current is applied. The 
negatively charged DNA molecules migrate toward the positive electrode 
and separate by size. (b) All of the DNA fragments that are present in a gel 
can be revealed by immersing the gel in a solution of ethidium bromide 
and then viewing the gel under an ultraviolet light. 


Source: (b) Phillipe Plailly/Science Photo Library/Photo Researchers, Inc. 


component in a solution or suspension is denser than its medium, 
then centrifugal force causes it to become concentrated toward the 
bottom of a centrifuge tube. Larger particles sediment more rapidly 
than smaller particles of similar shape and density. The tendency for 
molecules to become concentrated during centrifugation is counter- 
acted by the effects of diffusion, which causes the molecules to 
become redistributed in a more uniform (random) arrangement. 
With the development of ultracentrifuges, it has become possible to 


generate centrifugal forces greater than 500,000 times the force of 
gravity, which is sufficient to overcome the effects of diffusion and 
cause macromolecules to sediment toward the bottom of a centri- 
fuge tube. Centrifugation proceeds in a near vacuum to minimize 
frictional resistance. DNA (and RNA) molecules have been exten- 
sively analyzed by techniques utilizing the ultracentifuge. For the 
present purpose, we will consider two centrifugation techniques 
used in studies of nucleic acids that have been discussed in this text. 
These techniques are illustrated in FIGURE 18.38. 


VELOCITY SEDIMENTATION The rate at which a given 
molecule moves in response to centrifugal force in a centrifuge 
is known as its sedimentation velocity. Since the sedimentation 
velocity changes as the centrifugal force changes, a given molecule is 
characterized by a sedimentation coefficient, which is its sedimenta- 
tion velocity divided by the force. Throughout this book we have 
referred to various macromolecules and their complexes as having a 
particular S value. The unit S (or Svedberg, after the inventor of the 
ultracentrifuge) is equivalent to a sedimentation coefficient of 
107" sec. Because the velocity at which a particle moves through a 
liquid column depends on a number of factors, including shape, the 
determination of sedimentation coefficients does not by itself pro- 
vide the molecular mass. However, as long as one is dealing with the 
same type of molecule, the S value provides a good measure of rela- 
tive size. For example, the three ribosomal RNAs of E. coli, the 5S, 
16S, and 23S molecules, have lengths of 120, 1600, and 3200 nucleo- 
tides, respectively. 

In velocity (or rate-zonal) sedimentation, nucleic acid molecules 
are separated according to nucleotide length (Figure 18.38a). The 
sample containing the mixture of nucleic acid molecules is carefully 
layered over a solution containing an increasing concentration of 
sucrose (or other suitable substance). This preformed gradient 
increases in density (and viscosity) from the top to the bottom. 
When subjected to high centrifugal forces, the molecules move 
through the gradient at a rate determined by their sedimentation 
coefficient. The greater the sedimentation coefficient, the farther a 
molecule moves in a given period of centrifugation. Because the 
density of the medium is less than that of the nucleic acid molecules, 
even at the bottom of the tube (approximately 1.2g/ml for the 
sucrose solution and 1.7 g/ml for the nucleic acid), these molecules 
continue to sediment as long as the tube is centrifuged. In other 
words, centrifugation never reaches equilibrium. After a prescribed 
period, the tube is removed from the centrifuge, its contents are frac- 
tionated (as shown in Figure 18.38c), and the relative positions of the 
various molecules are determined. The presence of the viscous 
sucrose prevents the contents of the tube from becoming mixed due 
to either convection or handling, allowing molecules of identical S 
value to remain in place in the form of a band. If marker molecules 
of known sedimentation coefficient are present, the S values of 
unknown components can be determined. Experimental results 
obtained by sucrose-density gradient centrifugation are shown in 
Figures 11.11 and 11.15. 


EQUILIBRIUM CENTRIFUGATION In the other type of 
centrifugation technique, equilibrium centrifugation (Figure 18.38b), 
nucleic acid molecules are separated on the basis of their buoyant 
density. In this procedure, one generally employs a highly concen- 
trated solution of the salt of the heavy-metal cesium. The analysis is 
begun by mixing the DNA with the solution of cesium chloride or 
cesium sulfate in the centrifuge tube and then subjecting the tube to 
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extended centrifugation (e.g., two to three days at high forces). 
During the centrifugation, the heavy cesium ions are slowly driven 
toward the bottom of the tube, forming a continuous density gradi- 
ent through the liquid column. After a time, the tendency for cesium 
ions to be concentrated toward the bottom of the tube is counterbal- 
anced by the opposing tendency for them to become redistributed by 
diffusion, and the gradient becomes stabilized. As the cesium gradi- 
ent is forming, individual DNA molecules are driven downward or 


FIGURE 18.38 Techniques of nucleic acid sedimentation. (a) Separation 721 


of different-sized DNA molecules by velocity sedimentation. The sucrose 
density gradient is formed within the tube (step 1) by allowing a sucrose 
solution of increasing concentration to drain along the wall of the tube. 
Once the gradient is formed, the sample is carefully layered over the top of 
the gradient (steps 2 and 3), and the tube is subjected to centrifugation 
(e.g., 50,000 rem for 5 hours) as illustrated in step 4. The DNA molecules 
are separated on the basis of their size (step 5). (b) Separation of DNA 
molecules by equilibrium sedimentation on the basis of differences in base 
composition. The DNA sample is mixed with the CsCl solution (step 1) and 
subjected to extended centrifugation (e.g., 50,000 rom for 72 hours). The 
CsCl gradient forms during the centrifugation (step 2), and the DNA 
molecules band in regions of equivalent density (step 3). (c) The tube from 
the experiment of b is punctured and the contents are allowed to drip into 
successive tubes, thereby fractionating the tube’s contents. The absorbance 
of the solution in each fraction is measured and plotted as shown. 


move buoyantly upward in the tube until they reach a position that 
has a buoyant density equivalent to their own, at which point they are 
no longer subject to further movement. Molecules of equivalent den- 
sity form narrow bands within the tube. This technique is sensitive 
enough to separate DNA molecules having different base composi- 
tion (as illustrated in Figure 18.38b) or ones containing different iso- 
topes of nitrogen (‘°N vs. “N, as shown in Figure 13.30). 


18.18 Nucleic Acid Hybridization 


Nucleic acid hybridization includes a variety of related tech- 
niques that are based on the observation that two single-stranded 
nucleic acid molecules of complementary base sequence can form 
a double-stranded hybrid. Consider a situation in which one has 
a mixture of hundreds of fragments of DNA of identical length and 
overall base composition that differ from one another solely in their 
base sequence. Assume, for example, that one of the DNA fragments 
constitutes a portion of the B-globin gene and all the other fragments 
contain unrelated genes. The only way to distinguish between the 
fragment encoding the B-globin polypeptide and all the others is to 
carry out a molecular hybridization experiment using complemen- 
tary molecules as probes. 

In the present example, incubation of the mixture of denatured 
DNA fragments with an excess number of B-globin mRNAs would 
drive the globin fragments to form double-stranded DNA-RNA 
hybrids, leaving the other DNA fragments single-stranded. There 
are a number of ways one could separate the DNA-RNA hybrids 
from the single-stranded fragments. For example, the mixture could 
be passed through a column of hydroxylapatite under ionic condi- 
tions in which the hybrids would bind to the calcium phosphate salts 
in the column, while the nonhybridized DNA molecules would pass 
through unbound. The hybrids could then be released from the col- 
umn by increasing the concentration of the elution buffer. 

Experiments using nucleic acid hybridization require the incu- 
bation of two populations of complementary single-stranded nucleic 
acids under conditions (ionic strength, temperature, etc.) that pro- 
mote formation of double-stranded molecules. Depending on the 
type of experiment being conducted, the two populations of reacting 
molecules may both be present in solution, or one of the populations 
may be immobilized, for example, by localization within a chromo- 
some (as in Figure 10.20). 
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FIGURE 18.39 Determining the location of specific DNA fragments in a gel by a Southern blot. As described in the figure, the fractionated DNA 
fragments are washed out of the gel and trapped onto a nitrocellulose membrane, which is incubated with radioactively (or fluorescently) labeled DNA (or 
RNA) probes. The location of the hybridized fragments is determined autoradiographically (or microscopically if the probes are fluorescently labeled). During 
the blotting procedure, capillary action draws the buffer upward into the paper towels. As the buffer moves through the electrophoretic gel, it dissolves the 


DNA fragments and transfers them to the surface of the adjacent membrane. 


In many cases, one of the populations of single-stranded nucleic 
acids to be employed in a hybridization experiment is present within 
a gel. Consider a population of DNA fragments that have been pre- 
pared from genomic DNA and fractionated by gel electrophoresis 
(FIGURE 18.39). To carry out the hybridization, the gel is treated to 
render the DNA single-stranded. The single-stranded DNA is trans- 
ferred from the gel to a nitrocellulose membrane and fixed onto the 
membrane by heating it to 80°C in a vacuum. The procedure by 
which DNA is transferred to a membrane is termed blotting. Once 
the DNA is attached, the membrane is incubated with a labeled, 
single-stranded DNA (or RNA) probe capable of hybridizing to a 
complementary group of fragments. Unbound probe is then washed 
away, and the location of the bound probe is determined autoradio- 
graphically as shown in Figure 18.39. The experiment just described 
and depicted in Figure 18.39 using a radioactive probe is called a 
Southern blot (named after Edwin Southern, its developer). One or a 
few DNA restriction fragments that contain a particular nucleotide 
sequence can be identified in a Southern blot, even if there are 
thousands of unrelated fragments present in the gel. An example of 
a Southern blot is shown in Figure 10.19. RNA molecules can also be 
separated by electrophoresis and identified with a labeled DNA 
probe after being blotted onto a membrane. An example of this pro- 
cedure, which is called a Northern blot, is shown in Figure 11.33. 

DNA probes can be labeled in a variety of ways. A radioactive 
probe incorporates a radioactive isotope (such as **P) at one or 
more locations in the molecule. The presence of the probe is detected 
by autoradiography as shown in Figure 18.39. Probes can also be 
labeled with fluorophores and detected by fluorescence. Another 
commonly used label is biotin, a small organic molecule that can be 
covalently linked to the DNA backbone. Biotin is detected by the 
protein avidin (or streptavidin), which binds tightly to it. The avidin 


itself must be labeled for detection, such as with a fluorophore as 
shown in Figures 10.20 and 10.21. 

Nucleic acid hybridization can also provide a measure of the 
similarity in nucleotide sequence between two samples of DNA, as 
might be obtained, for example, from two different organisms. The 
more distant the evolutionary relationship between the two species, 
the greater the divergence of their DNA sequences. If purified DNA 
from species A and B is mixed together, denatured, and allowed to 
reanneal, a percentage of the DNA duplexes are formed by DNA 
strands from the two species. Because they contain mismatched 
bases, such duplexes are less stable than those formed by DNA 
strands of the same species, and this instability is reflected in the 
lower temperature at which they melt. When DNAs from different 
species are allowed to reanneal in different combinations, the melt- 
ing temperature (T_,, Section 10.8) of the hybrid duplexes provides a 
measure of the evolutionary distance between the organisms. Two 
other important types of nucleic acid hybridization protocols are 
discussed in detail in the text: in situ hybridization on page 386 and 
hybridization to cDNA microarrays on page 485. 


18.19 Chemical Synthesis of DNA 


Hybridization analysis requires single-stranded nucleic acid mole- 
cules for use as probes. Other fundamental techniques for the 
manipulation and analysis of DNA in the laboratory also require 
short single-stranded nucleic acid molecules, or oligonucleotides. 
Chemical synthesis of DNA and RNA is therefore a key supporting 
technology for many procedures. 

The development of chemical techniques to synthesize polynu- 
cleotides having a specific base sequence was begun by H. Gobind 


Khorana in the early 1960s as part of an attempt to decipher the 
genetic code. Khorana and co-workers continued to refine their 
techniques, and a decade after their initial work on the code, they 
succeeded in synthesizing a complete bacterial tyrosine tRNA gene, 
including the nontranscribed promoter region. The gene totaling 
126 base pairs was put together from over 20 segments, each of 
which was individually synthesized and later joined enzymatically. 
This artificial gene was then introduced into bacterial cells carrying 
mutations for this tRNA, and the synthetic DNA was able to replace 
the previously deficient function. The first chemically synthesized 
gene encoding an average-sized protein, human interferon, was pre- 
pared in 1981, an effort that required the synthesis and assembly of 
67 different fragments to produce a single duplex of 514 base pairs 
containing initiation and termination signals recognized by the bac- 
terial RNA polymerase. 

The chemical reactions that link nucleotides have been 
automated, and oligonucleotide synthesis is now carried out by 
computer-controlled machines hooked to reservoirs of reagents. The 
operator enters the desired nucleotide sequence into the computer 
and keeps the instrument supplied with materials. The oligonucleo- 
tide is assembled one nucleotide at a time from the 3’ to the 5’ end 
of the molecule, up to a total of about 200 nucleotides. Modifications 
such as biotin and fluorophores can be incorporated into the mole- 
cules. If a double-stranded molecule is needed, it is synthesized as 
two complementary single strands that can be hybridized together. 
Longer synthetic molecules are made in segments that are joined 
together as illustrated by the experiment discussed on page 22. 


18.20 Recombinant DNA 
Technology 


Over the past 30 years, tremendous advances have been made in the 
analysis of eukaryotic genomes. This progress began as molecular 
biologists learned to construct recombinant DNA molecules, 
which are molecules containing DNA sequences derived from more 
than one source. Recombinant DNAs can be used in myriad ways. 
We will begin by considering one of the most important applica- 
tions: the isolation from the genome of a particular segment of DNA 
that encodes a particular polypeptide. First, however, it is necessary 
to consider a class of enzymes whose discovery and use has made the 
formation of recombinant DNA molecules possible. 


Restriction Endonucleases 


During the 1970s, it was found that bacteria contained nucleases 
that would recognize short nucleotide sequences within duplex 
DNA and cleave the DNA backbone at specific sites on both strands 
of the duplex. These enzymes are called type II restriction endo- 
nucleases, or simply restriction enzymes. They were given this 
name because they function in bacteria to destroy viral DNAs that 
might enter the cell, thereby restricting the growth of the viruses. The 
bacterium protects its own DNA from nucleolytic attack by methyl- 
ating the bases at susceptible sites, a chemical modification that 
blocks the action of the enzyme. 

Enzymes from several hundred different prokaryotic organisms 
have been isolated that, together, recognize over 100 different nucle- 
otide sequences. The sequences recognized by most of these enzymes 
are four to six nucleotides long and are characterized by a particular 


type of internal symmetry. Consider the particular sequence recog- 723 


nized by the enzyme EcoR1: 


i 
3’—CTTAAG—5’ 
5 Sa —3' 


This segment of DNA is said to have twofold rotational symmetry 
because it can be rotated 180°C without change in base sequence. 
Thus, if one reads the sequence in the same direction (3’ to 5’ or 5’ 
to 3’) on either strand, the same order of bases is observed. 
A sequence with this type of symmetry is called a palindrome. When 
the enzyme EcoR1 attacks this palindrome, it breaks each strand at 
the same site in the sequence, which is indicated by the arrows 
between the A and G residues. The red dots indicate the methylated 
bases in this sequence that protect the host DNA from enzymatic 
attack. Some restriction enzymes cleave bonds directly opposite one 
another on the two strands producing blunt ends, whereas others, 
such as EcoR1, make staggered cuts. 

The discovery and purification of restriction enzymes have 
been invaluable in the advances made by molecular biologists in 
recent years. Because a particular sequence of four to six nucleotides 
occurs quite frequently simply by chance, any type of DNA is sus- 
ceptible to fragmentation by these enzymes. The use of restriction 
enzymes allows the DNA of the human genome, or that of any other 
organism, to be dissected into a precisely defined set of specific frag- 
ments. Once the DNA from a particular individual is digested with 
one of these enzymes, the fragments generated can be fractionated 
on the basis of length by gel electrophoresis (as in FIGURE 18.40a). 
Different enzymes cleave the same preparation of DNA into differ- 
ent sets of fragments, and the sites within the genome that are cleaved 
by various enzymes can be identified and ordered into a restriction 
map such as that depicted in Figure 18.40b. 


Formation of Recombinant DNAs 


Recombinant DNAs can be formed in a variety of ways. In the 
method shown in Figure 18.41, DNA molecules from two different 
sources are treated with a restriction enzyme that makes staggered 
cuts in the DNA duplex. Staggered cuts leave short, single-stranded 
tails that act as “sticky ends” that can bind to a complementary sin- 
gle-stranded tail on another DNA molecule to restore a double- 
stranded molecule. In the example depicted in FIGURE 18.41, one of 
the DNA fragments that will make up the recombinant molecule is a 
bacterial plasmid. Plasmids are small, circular, double-stranded 
DNA molecules that are separate from the main bacterial chromo- 
some. The other DNA fragment in Figure 18.41 is obtained from 
human cells following treatment with the same restriction enzyme 
used to open the plasmid. When the human DNA fragments and the 
treated plasmid are incubated together in the presence of DNA 
ligase, the two types of DNAs become hydrogen bonded to one 
another by their sticky ends and are then ligated to form circular 
DNA recombinants, as in Figure 18.41. The first recombinant DNA 
molecules were formed by this basic method in 1972 by Paul Berg, 
Herbert Boyer, Annie Chang, and Stanley Cohen of Stanford 
University and the University of California, San Francisco, marking 
the birth of modern genetic engineering. 

By following the procedure just described, a large number 
of different recombinant molecules are produced, each of which 
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724 FIGURE 18.40 The construction of a restriction map rs —— ORIGIN ————E —ORIGIN 
of the small circular genome of the DNA tumor virus ' Ba ac 
polyoma. (a) Autoradiographs of 525 labeled DNA 
fragments that have been subjected to gel electrophore- 
sis. The gel on the left shows the pattern of DNA 77 _—_ Hpail-1 
fragments obtained after a complete digestion of the cll -2 
polyoma genome with the enzyme Hpall. To determine : 
how these eight fragments are pieced together to make __— Hpall - 3 _ 
up the intact genome, it is necessary to treat the DNA in _— Hpail-4 
such a way that overlapping fragments are generated. * 
Overlapping fragments can be produced by treating the A 
intact genome with a second enzyme that cleaves the A— 
molecule at different sites, or by treating the genome ___ Hpalll - 5 
with the same enzyme under conditions where the DNA _ ___ Hpall-6 Cries —_— 
is not fully digested as it was in the left gel. The two gels 
on the right represent examples of partial digests of the D—, — 
polyoma genome with Hpall. The middle gel shows the ee Heall-7 E a _—Hpall-1+F 
fragments generated by partial digestion of the Hpall - ee 
superhelical circular DNA, and the gel on the right shows HH ss a St 
the Hpall fragments formed after the circular genome is Hpall - 2 a me — K 
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that makes only one cut in the circle). (b) The restriction sat ‘ J 
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shown in their ordered arrangement below the map. -—— BLUE 
(Fragments L and M migrate to the bottom of the gel in 
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Source: From Beverly E. Griffin, Mike Fried, and Allison 
Cowie, Proc. Nat'l. Acad. Sci. U.S.A. 71:2078, 1974. (a) 
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contains a bacterial plasmid with a segment of human DNA incor- 
porated into its circular structure (see Figure 18.42). Suppose you 
were interested in isolating a single gene from the human genome, 
for example, the gene that codes for insulin. Because your goal is to 
obtain a purified preparation of the one type of recombinant DNA 
that contains the insulin-coding fragment, you must separate this 
one fragment from all of the others. This is done by a process called 
DNA cloning. We will return to the search for the insulin gene after 
describing the basic methodology behind DNA cloning. 


DNA Cloning 


DNA cloning is a technique to produce large quantities of a specific 
DNA segment. The DNA segment to be cloned is first linked to a 
vector DNA, which is a vehicle for carrying foreign DNA into a 


suitable host cell, such as the bacterium E. coli. The vector contains 
sequences that allow it to be replicated within the host cell. Two 
types of vectors are commonly used to clone DNAs within bacterial 
hosts. In one approach, the DNA segment to be cloned is introduced 
into the bacterial cell by joining it to a plasmid, as described previ- 
ously, and then causing the bacterial cells to take up the plasmid 
from the medium. In an alternate approach, the DNA segment is 
joined to a portion of the genome of the bacterial virus lambda (1), 
which is then allowed to infect a culture of bacterial cells, producing 
large numbers of viral progeny, each of which contains the foreign 
DNA segment. Either way, once the DNA segment is inside a bacte- 
rium, it is replicated along with the bacterial (or viral) DNA and 
partitioned to the daughter cells (or progeny viral particles). Thus, 
the number of recombinant DNA molecules increases in proportion 
to the number of bacterial cells (or viral progeny) that are formed. 
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FIGURE 18.41 Formation of a recombinant DNA molecule. In this 
example, a preparation of bacterial plasmids is treated with a restriction 
enzyme that makes a single cut within each bacterial plasmid. This same 
restriction enzyme is used to fragment a preparation of human genomic 
DNA into small fragments. Because they have been treated with the same 
restriction enzyme, the cleaved plasmid DNA and the human DNA 
fragments will have sticky ends. When these two populations are incubated 
together, the two DNA molecules become noncovalently joined to each 
other and are then covalently sealed by DNA ligase, forming a recombinant 
DNA molecule. 


From a single recombinant plasmid or viral genome inside a single 
bacterial cell, millions of copies of the DNA can be formed within a 
short period of time. Once the amount of DNA has been sufficiently 
amplified, the recombinant DNA can be purified and used in other 
procedures. In addition to providing a means to amplify the amount 
of a particular DNA sequence, cloning can also be used as a tech- 
nique to isolate a pure form of any specific DNA fragment among a 
large, heterogeneous population of DNA molecules. We will begin 
by discussing DNA cloning using bacterial plasmids. 


CLONING EUKARYOTIC DNAS IN BACTERIAL 
PLASMIDS The foreign DNA to be cloned is first inserted into 
the plasmid to form a recombinant DNA molecule. Plasmids used 
for DNA cloning are modified versions of those found in bacterial 
cells. Like the natural counterparts from which they are derived, these 
plasmids contain an origin of replication and one or more genes that 
make the recipient cell resistant to one or more antibiotics. Antibiotic 


resistance allows investigators to select for those cells that contain 725 


the recombinant plasmid. 

As first demonstrated by Avery, Macleod, and McCarty (page 379), 
bacterial cells can take up DNA from their medium. This phenomenon 
forms the basis for cloning plasmids in bacterial cells (FIGURE 18.42). 
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FIGURE 18.42 An example of DNA cloning using bacterial plasmids. 
DNA is extracted from human cells, fragmented with EcoR1, and the 
fragments are inserted into a population of bacterial plasmids. Techniques 
are available to prevent the formation of plasmids that lack a foreign DNA 
insert. Once a bacterial cell has picked up a recombinant plasmid from the 
medium, the cell gives rise to a colony of cells containing the recombinant 
DNA molecule. In this example, most of the bacteria contain eukaryotic 
DNAs of unknown function (labeled as ?), while one contains a portion 

of the DNA encoding ribosomal RNA, and another contains DNA 
encoding insulin. 
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are added to a bacterial culture that has been pretreated with cal- 
cium ions. When subjected to a brief heat shock, such bacteria are 
stimulated to take up DNA from their surrounding medium. Usually 
only a small percentage of cells are competent to pick up and retain 
one of the recombinant plasmid molecules. Once it is taken up, the 
plasmid replicates autonomously within the recipient and is passed 
on to the progeny during cell division. Those bacteria containing a 
recombinant plasmid can be selected from the others by growing the 
cells in the presence of the antibiotic to which resistance is conferred 
by one or more genes on the plasmid. 

We began this discussion with the goal of isolating a small DNA 
fragment that contains the sequence for the insulin gene. To this 
point, we have formed a population of bacteria containing many dif- 
ferent recombinant plasmids, very few of which contain the gene 
being sought (as in Figure 18.42). One of the great benefits of DNA 
cloning is that, in addition to producing large quantities of particular 
DNAs, it allows one to separate different DNAs from a mixture. It 
was noted above that plasmid-containing bacteria can be selected 
from those without plasmids by treatment with antibiotics. Once 
this has been done, the plasmid-bearing cells can be grown at low 
density on petri dishes so that all of the progeny of each cell (a clone 
of cells) remain physically separate from the progeny of other cells. 
Because a large number of different recombinant plasmids were pre- 
sent initially in the medium, the different cells plated on the dish 
contain different foreign DNA fragments. Once the cells containing 
the various plasmids have grown into separate colonies, the investi- 
gator can search through the colonies for the few that contain the 
gene being sought—in this case, the insulin gene. Screening is typi- 
cally done by selecting individual colonies and expanding the popu- 
lation of each in test tubes containing media. The plasmid DNA 
from the bacteria in each test tube is then isolated (often using an 
easy-to-use prepackaged purification “kit”) and submitted for DNA 
sequencing. 


CLONING EUKARYOTIC DNAS IN PHAGE GENOMES 
A less common cloning vector is the bacteriophage lambda, which is 
depicted in FIGURE 18.43. The lambda genome is a linear, double- 
stranded DNA molecule approximately 50 kb in length. The modi- 
fied strain used in most cloning experiments contains two cleavage 
sites for the enzyme EcoR1, which fragments the genome into three 
large segments. Conveniently, all the information essential for infec- 
tion and cell lysis is contained in the two outer segments, so that the 
dispensable middle fragment can be replaced by a piece of eukary- 
otic DNA up to approximately 25 kb. Recombinant DNA molecules 
can be packaged into phage heads in vitro, and these genetically 
engineered phage particles can be used to infect host bacteria. (Phage 
DNA molecules lacking the insert are too short to be packaged.) 
Once in a bacterium, the eukaryotic DNA segment is amplified 
along with the viral DNA and then packaged into a new generation 
of virus particles, which are released when the cell is lysed. The 
released particles infect new cells, and soon a clear spot (or plaque) 
in the bacterial “lawn” is visible at the site of infection. Each plaque 
contains millions of phage particles, each carrying a single copy of 
the same eukaryotic DNA fragment. 

The cloning of larger-sized DNA in either bacteria (as BACs) or 
yeast (as YACs) is discussed on page 731. 
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FIGURE 18.43 Protocol for cloning eukaryotic DNA fragments in 
lambda phage. The steps are described in the text. 


18.21 Enzymatic Amplification 
of DNA by PCR 


In 1983, a new technique was conceived by Kary Mullis of Cetus 
Corporation that has become widely used to amplify specific DNA 
fragments without the need for bacterial cells. This technique is 
known as polymerase chain reaction (PCR). There are many 
different PCR protocols used for a multitude of different applica- 
tions in which anywhere from one to a large population of related 
DNAs can be amplified. PCR amplification is readily adapted to 
RNA templates by first converting them to complementary DNAs 
using reverse transcriptase. 


Process of PCR polymerase and all four deoxyribonucleotides, along with a large 727 
excess of two short, synthetic DNA fragments (oligonucleotides) 


The basic procedure used in PCR is depicted in FIGURE 18.44. The that are complementary to DNA sequences at the 3’ ends of the 


technique employs a heat-stable DNA polymerase, called Taq poly- 
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FIGURE 18.44 Polymerase chain reaction (PCR). As discussed in the text, the procedure takes advantage of a heat-resistant DNA polymerase whose 
activity is not destroyed when the temperature is raised to separate the two strands of the double helix. With each cycle of duplication, the strands are 
separated, flanking segments (primers) bind to the ends of the selected region, and the polymerase copies the intervening segment. 


728 separate into their two component strands. The mixture is then 
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cooled to about 60°C to allow the primers to hybridize to the strands 
of the target DNA, and the temperature is raised to about 72°C, to 
allow the thermophilic polymerase to add nucleotides to the 3’ end 
of the primers. As the polymerase extends the primer, it selectively 
copies the target DNA, forming new complementary DNA strands. 
The temperature is raised once again, causing the newly formed and 
the original DNA strands to separate from each other. The sample is 
then cooled to allow the synthetic primers in the mixture to bind once 
again to the target DNA, which is now present at twice the original 
amount. This cycle is repeated over and over again, each time dou- 
bling the amount of the specific region of DNA that is flanked by the 
bound primers. Billions of copies of this one specific region can be 
generated in just a few hours using a thermal cycler that automati- 
cally changes the temperature of the reaction mixture to allow each 
step in the cycle to take place. 


Applications of PCR 


AMPLIFYING DNA FOR CLONING OR ANALYSIS 
Since its invention, PCR has found many uses. It can generate many 
copies of a specific DNA fragment prior to cloning the fragment, an 
efficient approach if the target sequence is known in sufficient detail 
that the nucleotide sequence of two complementary primers can be 
specified. This is particularly helpful in cases where the source DNA 
is very scarce, because PCR can generate large amounts of DNA 
from minuscule samples, such as that in a single cell. PCR has been 
used in criminal investigations to generate quantities of DNA from a 
spot of dried blood left on a crime suspect's clothing or even from 
the DNA present in part of a single hair follicle left at the scene of a 
crime. For this purpose, one selects regions of the genome for ampli- 
fication that are highly polymorphic (i.e., vary at high frequency 
within the population), so that no two individuals will have the 
same-sized DNA fragments (as in Figure 10.19). This same proce- 
dure can be used to study DNA fragments from well-preserved fossil 
remains that may be millions of years old. The activity of DNA 
polymerase in PCR is also employed in DNA sequencing (see 
Section 18.22). 


TESTING FOR THE PRESENCE OF SPECIFIC DNA 
SEQUENCES Suppose you wanted to determine whether a tis- 
sue sample contains a particular virus. You could answer this ques- 
tion by a Southern hybridization (page 722) or you could use PCR. 
In the PCR approach, nucleic acid is isolated from the sample and 
PCR primers complementary to the viral DNA are added, along with 
the other PCR reagents. The reaction is then allowed to proceed. If 
the virus genome is present in the sample, the PCR primers will 
hybridize to it and the PCR reaction will generate a product. If the 
virus is not present, the PCR primers will not hybridize and no prod- 
uct will be generated. Thus, in this application, the PCR reaction 
itself serves as the detection system. 


COMPARING DNA MOLECULES If two DNA molecules 
have the same base sequence, they will yield the same PCR products 
in reactions with identical primers. This is the premise for quick 
assays that compare the similarity of two DNA samples such as 
genomic DNA from bacterial isolates. PCR is performed on the 
samples using several primers, which can be specifically designed or 
randomly generated. The products are separated by gel electrophoresis 


and compared. The more similar the sequences of the bacterial 
genomes, the more similar their PCR products will be. 


QUANTIFYING DNA OR RNA TEMPLATES PCR can 
also be used to determine how much of a specific nucleotide 
sequence (DNA or RNA) is present in a mixed sample. One approach 
to this quantitative PCR uses the binding of a dye specific for double- 
stranded DNA to quantify the amount of double-stranded product 
being generated. The rate of accumulation of product is proportional 
to the amount of template present in the sample. 

Another approach uses what have been called “molecular bea- 
cons.” These are short reporter oligonucleotides with a fluorophore 
bound to one end and a quencher molecule on the other end that 
hybridize in the middle of the target sequence to be amplified. As 
long as the short oligonucleotide is intact, the fluorophore and 
quencher are close enough in proximity that fluorescence is 
quenched. When the DNA polymerase synthesizes a new strand of 
DNA complementary to the template, its exonuclease activity 
(page 523) degrades the reporter oligonucleotide. The fluorophore is 
thus separated from the quencher and fluoresces. The amount of 
fluorophore liberated in a given PCR cycle is directly proportional to 
the number of template molecules being copied by the polymerase. 


18.22 DNA Sequencing 


By 1970, the amino acid sequence of a long list of proteins had been 
determined, yet virtually no progress had been made toward the 
sequencing of nucleotides in DNA. There were several reasons for 
this state of affairs. Unlike DNA molecules, polypeptides come in 
defined and manageable lengths; a given polypeptide species could 
be readily purified; a variety of techniques were available to cleave 
the polypeptide at various sites to produce overlapping fragments; 
and the presence of 20 different amino acids having widely varying 
properties made separation and sequencing of small peptides a 
straightforward task. Then in the mid-1970s, a revolution in DNA 
sequence technology took place. By 1977, the complete nucleotide 
sequence of an entire viral genome was reported, that of @X174, 
some 5375 nucleotides in length. This milestone in molecular biol- 
ogy was accomplished in the laboratory of Frederick Sanger, who 
had determined the first amino acid sequence of a polypeptide 
(insulin) some 25 years earlier. By 2001 the rough draft of the human 
genome sequence—equivalent to roughly 3 billion base pairs—was 
published; the product of years of work by hundreds of scientists. 

These advances in DNA sequencing were made possible by 
developments in several areas: molecular approaches to DNA 
sequencing, instrumentation that could be automated, more power- 
ful and widely available computers, and software for data analysis. 
The initial key was the development of a feasible approach for deter- 
mining the sequence of large DNA fragments. This advance itself 
became possible because of the discovery of restriction enzymes and 
the development of cloning technologies, which provided the means 
necessary to prepare a defined DNA fragment in sufficient quantity 
to carry out the necessary biochemical procedures. 

By 1980, a DNA sequencing methodology developed by Sanger 
and A. R. Coulson of the Medical Research Council in Cambridge, 
England, had gained widespread acceptance as the method of choice. 
Following the development of PCR, the Sanger-Coulson sequencing 
approach and PCR were merged into a sequencing procedure that 
combines the biochemistry of Sanger-Coulson with the repetitive 


cycles and automation of PCR. This so-called cycle sequencing 
became widely used in genome sequencing and its basic steps are 
outlined in FIGURE 18.45. Although the Sanger-Coulson approach is 
no longer used in major DNA sequencing projects, its historic 
importance as the technique employed in all of the earlier genome- 
sequencing studies, including that of the Human Genome Project, as 
well as the elegance of its biochemical methodology, makes a brief 
description a valuable learning tool in molecular biology. 

In this procedure, one begins with a population of identical tem- 
plate molecules, either a PCR product or a cloned DNA fragment. 
The template DNA is mixed with a primer that is complementary to 
the 3’ end of one strand of the region to be sequenced. The reaction 
mixture also contains the heat stable Taq DNA polymerase, all four 
deoxyribonucleoside triphosphate precursors (ANTPs), and a low 
concentration of modified precursors called dideoxyribonucleoside 
triphosphates, or ddNTPs. Each ddNTP (ddATP, ddGTP, ddCTP, and 
ddTTP) has been modified by the addition of a different-colored 
fluorescent dye to its 3’ end. 

The sequencing reaction begins, like PCR, by heating the 
mixture to a temperature that causes the two template strands to 
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denature (Figure 18.45a, step 1). Next, the reaction is cooled so that 
the primer can hybridize to the template DNA (step 2). Note that, in 
contrast to PCR, only one primer is present, so that only one of the 
two strands of template DNA can hybridize to a primer. In step 3, the 
Taq polymerase adds dNTPs to the end of the primer that are com- 
plementary to the template molecule, synthesizing a new comple- 
mentary strand of DNA. Every now and then, the polymerase inserts 
a ddNTP instead of a dNTP. Dideoxynucleotides lack a hydroxyl 
group at both their 2’ and 3’ positions. When one of these nucleo- 
tides has been incorporated onto the end of a growing chain, the lack 
of the 3’ OH makes it impossible for the polymerase to add another 
nucleotide, thus causing chain termination (Figure 18.45, step 4). 
Because the ddNTP is present at much lower concentration in the 
reaction mixture than the corresponding dNTP, the incorporation 
of the ddNTP is infrequent and random; it may be incorporated near 
the beginning of one chain, near the middle of another, or not until 
the end of a third chain. Regardless, when the ddNTP is incorporated, 
growth of the chain ceases at that point. 

After the chain extension phase of the reaction is complete, the 
temperature is raised again to denature the new double-stranded 
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TT GGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTAT 
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FIGURE 18.45 DNA sequencing. (a) The basic steps in sequencing a small hypothetical fragment by the Sanger-Coulson (dideoxy) technique, as 
described in the text. (b) Gel lanes in which fluorescently labeled daughter molecules have been separated. The color of the band is determined by the 
identity of the dideoxynucleotide at the 3’ end of the DNA strand. (c) The sequence of nucleotides in the template strand is interpreted by a computer that 
“reads” from the bottom to the top of the gel, using the intensity and wavelength of the fluorescent light as input. The computer generates an “electro- 


pherogram” showing the intensity and color of the detected fluorescence, along with the DNA sequence interpretation. 
Source: (b, c) From Leroy Hood and David Galas, Nature 421:445, 2003, Fig. 1c and 1d. Reprinted by permission from Macmillan Publishers Ltd. 
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ation is repeated many times, generating a large population of 
daughter DNA strands that by now includes molecules in which a 
ddNTP has been incorporated at every position. For every A on the 
template strand, for example, there will be daughter molecules that 
terminate in a ddTTP at that position. When all cycles are complete, 
the reaction products are separated by electrophoresis on very thin 
capillary gels (Figure 18.45, step 5). 

High-resolution gel electrophoresis can separate fragments that 
differ by only one nucleotide in length, so that each successive band 
in the gel contains molecules that are one nucleotide longer than 
those in the previous band. Because each ddNTP was labeled with a 
unique fluorescent dye, the color of the band (as read by an auto- 
mated laser detector) reveals the identity of the terminal nucleotide 
on each daughter molecule (Figure 18.45b). The order of the colors 
in the gel therefore corresponds to the base sequence of the template 
molecule (Figure 18.45c). 

Beginning about 2005, a revolution in DNA sequencing technol- 
ogy has taken place, driven by the goal of sequencing genomes—both 
human and others—rapidly and inexpensively. Costs for sequencing 
massive amounts of DNA have decreased dramatically over the past 
few years, and by 2014 it became possible to sequence the genome of 
a given person for a cost somewhere in the range of about $1,000, 
which is orders of magnitude cheaper than just a few years earlier. 

This progress has been made possible by the development of 
entirely new strategies for DNA sequencing. The first of these new 
strategies is referred to as massively parallel sequencing (or 2d genera- 
tion sequencing, see Experimental Walkthrough video for Chapter 10) 
and has dominated genomic sequencing efforts for the past several 
years. Unlike the sequencing steps of the original Human Genome 
Project, massively parallel sequencing technologies do not require 
that pieces of the genome be cloned in yeast or bacteria. Instead, 
fragments are prepared directly from the whole genome and each is 
subsequently amplified to form a huge population. Like the Sanger- 
Coulson approach, massively parallel sequencing techniques are 
based on polymerase-dependent DNA synthesis, but they do not 
utilize premature strand termination nor do they require separation 
of strands by electrophoresis, which limits the number of samples 
that can be simultaneously monitored. Instead, they accomplish the 
direct identification of individual nucleotides as they are being 
incorporated by the polymerases in real time. A number of different 
instruments are available, each capable of carrying out a variation of 
this type of rapid automated DNA sequencing. In all of these cases, 
huge numbers (up to ~ 10°) of DNA molecules are immobilized on a 
surface and then incubated with DNA polymerase in the presence of 
the four different NTPs. Various strategies are used to identify the 
nucleotides that are sequentially incorporated into each of the com- 
plementary strands as they are synthesized on the massive numbers 
of “parallel” DNA templates. 

The strands that are synthesized in all of these platforms are 
relatively short (less than 100 bases in length) compared to the ear- 
lier sequencers that utilized electrophoretic methods (approximately 
800 bases in length). These newer sequencers are also more error-prone 
than the instruments used in first generation efforts. However, there 
are so many copies of each DNA strand being read simultaneously 
that high accuracy can be achieved by determining the most abun- 
dant sequence generated. The short segments being synthesized 
(called “reads”) do become a problem when researchers are trying to 
determine the DNA sequence of the genome of a new species, 


because it can be very difficult to stitch together (assemble) a vast 
number of short reads into the very long DNA molecules present in 
a eukaryotic chromosome. Consequently, these DNA sequencing 
technologies are best suited for studying individual genomes from a 
species, such as the human species, where researchers already pos- 
sess a reference genome into which a given DNA fragment can be 
placed. Despite the challenges, genomes of new species, such as the 
panda, have been assembled using these instruments. Another prob- 
lem with the use of massively parallel sequencers is that the data 
cannot be sorted into a person’s separate maternal and paternal 
haploid genomes, or their haplotypes (page 402). This may limit 
their usefulness in applications for genomic medicine. 

Over the past few years, a new collection of DNA sequencing 
instruments, often referred to as third generation sequencers, have 
been (and will be) introduced into the market. As a group, these 
sequencers use very different mechanistic strategies for determining 
nucleotide sequences, although most of them act on single DNA 
molecules (which eliminates the need for DNA amplification), 
generate long reads (e.g., 1000 nucleotides), and operate at lower cost 
and faster rates than second generation instruments. We will briefly 
consider a couple of these sequencing strategies. In one case, sequenc- 
ing is accomplished by pulling each DNA molecule through a tiny 
hole, or “nanopore” and identifying each nucleotide, one at a time, as 
it passes through the opening. Nucleotide identification is based on 
differences in ionic properties among the four nucleotides that can be 
detected as each nucleotide passes one after another through the 
nanopore. With large numbers of nanopores operating simultane- 
ously, the potential for very rapid sequencing is possible. Other third 
generation sequencers work ona “sequencing-by-synthesis” approach. 
In one such instrument, a laser-based detection system identifies 
fluorescently labeled nucleotides as they are incorporated by a single 
DNA polymerase moving processively along a DNA strand. A hun- 
dred thousand or more of these reactions are monitored simultaneously 
on different DNA strands, thereby generating large amounts of data 
during short incubation periods (“runs”). 

Once the nucleotide sequence of a segment of DNA has been 
determined, various software tools can be employed to analyze it. 
For example, the amino acid sequence encoded by the DNA can be 
determined and compared to other known amino acid sequences to 
provide information about the polypeptide’s possible function. The 
amino acid sequence also provides clues as to the tertiary structure 
of the protein, particularly those parts of the polypeptide that may 
act as membrane-spanning segments of integral membrane proteins. 
The nucleotide sequence itself can also be compared to other known 
nucleotide sequences. Such comparisons can be used to assess the 
evolutionary relatedness or history of the DNA sequences, to iden- 
tify the DNA fragment just sequenced, or to compare the genomic 
features of various organisms or individuals. 


18.23 DNA Libraries 


DNA cloning is often used to produce DNA libraries, which are 
collections of cloned DNA fragments. Two basic types of DNA librar- 
ies can be created: genomic libraries and cDNA libraries. Genomic 
libraries are produced from total DNA extracted from nuclei and 
contain all of the DNA sequences of the species. Once a genomic 
library of a species is available, researchers can use the collection to 


isolate specific DNA sequences, such as those containing the human 
insulin gene. cDNA libraries, on the other hand, are derived from 
DNA copies of an RNA population. cDNA libraries are typically 
produced from messenger RNAs present in a particular cell type and 
thus correspond to the genes that are active in that type of cell. 


Genomic Libraries 


We will first examine the production of a genomic library. In one 
approach, genomic DNA is treated with one or two restriction 
enzymes that recognize very short nucleotide sequences under 
conditions of low enzyme concentration, such that only a small per- 
centage of susceptible sites are actually cleaved. Two commonly used. 
enzymes that recognize tetranucleotide sequences are HaellI (recog- 
nizes GGCC) and Sau3A (recognizes GATC). A given tetranucleo- 
tide would be expected to occur by chance with such a high frequency 
that any sizable segment of DNA is sensitive to fragmentation. Once 
the DNA is partially digested, the digest is fractionated by gel elec- 
trophoresis or density gradient centrifugation, and those fragments 
of suitable size (e.g., 20 kb in length) are incorporated into lambda 
phage particles. These phage are used to generate the million or so 
plaques needed to ensure that every single segment of a mammalian 
genome is represented. 

Because the DNA is treated with enzymes under conditions in 
which most susceptible sites are not being cleaved, for all practical 
purposes, the DNA is randomly fragmented. Once the phage recom- 
binants are produced, they can be stored for later use and, as such, 
constitute a permanent collection of all the DNA sequences present 
in the genome of the species. Whenever an investigator wants to iso- 
late a particular sequence from the library, the phage particles can be 
grown in bacteria and the various plaques (each originating from the 
infection of a single recombinant phage) screened for the presence of 
that sequence by in situ hybridization. 

The use of randomly cleaved DNA has an advantage in the con- 
struction of a library because it generates overlapping fragments that 
can be used in the analysis of regions of the chromosome extending 
out in both directions from a particular sequence, a technique 
known as chromosome walking. For example, if one isolates a frag- 
ment containing the coding region of a globin gene, that particular 
fragment can be labeled and used as a probe to screen the genomic 
library for fragments with which it overlaps. The process is then 
repeated using the new fragments as labeled probes in successive 
screening steps, as one gradually isolates a longer and longer part of 
the original DNA molecule. Using this approach, one can study the 
organization of linked sequences in an extended region of a 
chromosome. 

Neither plasmid nor lambda phage vectors are suitable for clon- 
ing DNAs larger than about 20 to 25 kb in length. Several vectors 
have been developed that allow investigators to clone much larger 
pieces of DNA. One of the most important of these vectors is the 
yeast artificial chromosome (YAC), which can accept segments 
of foreign DNA as large as 1000 kb (one million base pairs). As the 
name implies, YACs are artificial versions of a normal yeast chromo- 
some. They contain all of the elements of a yeast chromosome that 
are necessary for the structure to be replicated during S phase and 
segregated to daughter cells during mitosis, including one or more 
origins of replication, telomeres at the ends of the chromosomes, 
and a centromere to which the spindle fibers can attach during 


chromosome separation. In addition to these elements, YACs are 731 


constructed to contain (1) a gene whose encoded product allows 
those cells containing the YAC to be selected from those that lack the 
element and (2) the DNA fragment to be cloned. Like other cells, 
yeast cells can take up DNA from their medium, which provides the 
means by which YACs are introduced into the cells. 

Over the past few years, laboratories involved in sequencing 
genomes have relied heavily on an alternate cloning vector, called a 
bacterial artificial chromosome (BAC), which is also able to 
accept large foreign DNA fragments (up to about 500 kb). BACs are 
specialized bacterial plasmids (F factors) that contain a bacterial ori- 
gin of replication and the genes required to regulate their own repli- 
cation. BACs have the advantage over YACs in high-speed sequencing 
projects because they can be cloned in E. coli, which readily picks up 
exogenous DNA, has an extremely short generation time, can be 
grown at high density in simple media, and does not “corrupt” the 
cloned DNA through recombination. 

DNA fragments cloned in YACs and BACs are typically greater 
than 100kb in length. Fragments of such large size are usually 
produced by treatment of DNAs with restriction enzymes that 
recognize particularly long nucleotide sequences (seven to eight 
nucleotides) containing CG dinucleotides. As noted on page 500, 
CG dinucleotides have special functions in the mammalian genome, 
and presumably because of this they do not appear nearly as often as 
would be predicted by chance. The restriction enzyme Nof1, for 
example, which recognizes the 8-nucleotide sequence GCGGCCGC, 
typically cleaves mammalian DNA into fragments several hundred 
thousand base pairs long. These fragments can then be incorporated 
into YACs or BACs and cloned within host yeast or bacterial cells. 


cDNA Libraries 


Up to this point, the discussion has been restricted to cloning DNA 
fragments isolated from extracted DNA, that is, genomic fragments. 
When working with genomic DNA, one is generally seeking to iso- 
late a particular gene or family of genes from among hundreds of 
thousands of unrelated sequences. In addition, the isolation of 
genomic fragments allows one to study a variety of topics, including 
(1) regulatory sequences flanking the coding portion of a gene; 
(2) noncoding intervening sequences; (3) various members of a 
multigene family, which often lie close together in the genome; 
(4) evolution of DNA sequences, including their duplication and 
rearrangement as seen in comparisons of the DNA of different spe- 
cies; and (5) interspersion of transposable genetic elements. 

The cloning of cDNAs, as opposed to genomic DNAs, has been 
especially important in the analysis of gene expression and alterna- 
tive splicing. To produce a cDNA library, a population of mRNAs is 
isolated and used as a template by reverse transcriptase to form a 
population of DNA-RNA hybrids, as shown in FIGURE 18.46a. The 
DNA-RNA hybrids are converted to a double-stranded cDNA 
population by nicking the RNA with RNase H and replacing it with 
DNA by DNA polymerase I. The double-stranded cDNA is then 
combined with the desired vector (in this case a plasmid) and 
cloned as shown in Figure 18.46b. Messenger RNA populations 
typically contain thousands of different messages, but individual 
species may be present in markedly different numbers (abun- 
dance). As a result, a cDNA library has to contain a million or so 
different cDNA clones to be certain that the rarer mRNAs will be 
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represented. In addition, reverse transcriptase is not a very effi- 
cient enzyme and tends to fall off its template mRNA before the 
copying job is completed. As a result, it can be difficult to obtain a 
population of full-length cDNAs. As with experiments using 


FIGURE 18.46 Synthesizing cDNAs for cloning in a plasmid. (a) In this 
method of cDNA formation, a short poly(dT) primer is bound to the poly(A) 
of each mRNA, and the mRNA is transcribed by reverse transcriptase (which 
requires the primer to initiate DNA synthesis). Once the DNA-RNA hybrid 
is formed, the RNA is nicked by treatment with RNase H, and DNA 
polymerase | is added to digest the RNA and replace it with DNA, just as it 
does during DNA replication in a bacterial cell (page 523). (b) To prepare 
the blunt-ended cDNA for cloning, a short stretch of poly(G) is added to 
the 3’ ends of the cDNA and a complementary stretch of poly(C) is added 
to the 3’ ends of the plasmid DNA. The two DNAs are mixed and allowed 
to form recombinants, which are sealed and used to transform bacterial 
cells in which they are cloned. 


genomic DNA fragments, clones must be screened to isolate one 
particular sequence from a heterogeneous population of recombi- 
nant molecules. 

The analysis of cloned cDNAs serves several functions. It is gen- 
erally easier to study a diverse population of cDNAs than the corre- 
sponding population of mRNAs, so one can use the cDNAs to learn 
about the diversity and abundance of mRNAs present in a cell, the 
percentage of mRNAs shared by two different types of cells, or the 
variety of alternatively spliced mRNAs generated from a specific pri- 
mary transcript. A single cloned and amplified cDNA molecule is 
also very useful. In more practical endeavors, the purified cDNA can 
be readily sequenced to provide a shortcut in determining the amino 
acid sequence of a polypeptide; labeled cDNAs are used as probes to 
screen for complementary sequences among recombinant clones; 
and because they lack introns, cDNAs have an advantage over 
genomic fragments when one is attempting to synthesize eukaryotic 
proteins in bacterial cell cultures. 


18.24 DNA Transfer into 
Eukaryotic Cells and Mammalian 
Embryos 


We have discussed in previous sections how eukaryotic genes can be 
isolated, modified, and amplified. In this section we will consider 
some of the ways in which genes can be introduced into eukaryotic 
cells, where they are generally transcribed and translated. One of the 
most widely used strategies to accomplish this goal is to incorporate 
the DNA into the genome of a nonreplicating virus and allow the 
virus to infect the cell. Viral-mediated gene transfer is called 
transduction. Depending on the type of virus used, the gene of 
interest can be expressed transiently for a period of hours to days, or 
it can be stably integrated into the genome of the host cell. Stable 
integration is usually accomplished using modified retroviruses, 
which contain an RNA genome that is reverse transcribed into DNA 
inside the cell. The DNA copy is then inserted into the DNA of the 
host chromosomes. Retroviruses have been used in many of the 
recent attempts at gene therapy to transfer a gene into the cells of a 
patient who is lacking that gene. Generally, these clinical trials have 
not had much success due to the low efficiency of infection of cur- 
rent viral vectors. 

A number of procedures are available to introduce naked DNA 
into cultured cells, a process called transfection. Most often the 
cells are treated with either calcium phosphate or DEAE-dextran, 
both of which form a complex with the added DNA that promotes its 
adherence to the cell surface. It is estimated that only about 1 in 10° 


cells takes up the DNA and incorporates it stably into the chromo- 
somes. It is not known why this small percentage of cells in the pop- 
ulation are competent to be transfected, but those that are transfected 
generally pick up several fragments. One way to select for those cells 
that have taken up the foreign DNA is to include a gene that allows 
the transfected cells to grow in a particular medium in which non- 
transfected cells cannot survive. Because transfected cells typically 
take up more than one fragment, the gene used for selection need 
not be located on the same DNA fragment as the gene whose role is 
being investigated (called the transgene). 

Two other procedures to transfect cells are electroporation and 
lipofection. In electroporation, the cells are incubated with DNA in spe- 
cial vials that contain electrodes that deliver a brief electric pulse. 
Application of the current causes the plasma membrane to become 
transiently permeable to DNA molecules, some of which find their way 
into the nucleus and become integrated into the chromosomes. In lipo- 
fection, cells are treated with DNA that is bound to positively charged 
lipids (cationic liposomes) that are capable of fusing with the lipid 
bilayer of the cell membrane and delivering the DNA to the cytoplasm. 

One of the most direct ways to introduce foreign genes into a 
cell is to microinject DNA directly into the cell’s nucleus. The nuclei 
of oocytes and eggs are particularly well suited for this approach. 
Xenopus oocytes, for example, have long been used to study the 
expression of foreign genes. The nucleus of the oocyte contains all 
the machinery necessary for RNA synthesis; when foreign DNA is 
injected into the nucleus, it is readily transcribed. In addition, the 
RNAs synthesized from the injected templates are processed nor- 
mally and transported to the cytoplasm, where they are translated 
into proteins that can be detected immunologically or by virtue of 
their specific activity. 

Another favorite target for injected DNA is the nucleus of a 
mouse embryo (FIGURE 18.47). The goal in such experiments is not 
to monitor the expression of the gene in the injected cell, but to have 
the foreign DNA become integrated into the egg’s chromosomes to 
be passed on to all the cells of the embryo and subsequent adult. 
Animals that have been genetically engineered so that their chromo- 
somes contain foreign genes are called transgenic animals, and 
they provide a means to monitor when and where in the embryo 
particular genes are expressed (as in Figure 12.34), and to determine 


FIGURE 18.47 Microinjection of DNA into the nucleus of a recently 
fertilized mouse egg. The egg is held in place by a suction pipette shown at 
the right, while the injection pipette is shown penetrating the egg at the left. 
Source: From Thomas E. Wagner et al., Proc. Nat'l. Acad. Sci. U.S.A. 
78:6377, 1981. 
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and life of the animal. 


Transgenic Animals 


In 1981, Ralph Brinster of the University of Pennsylvania and 
Richard Palmiter of the University of Washington succeeded in 
introducing a gene for rat growth hormone (GH) into the fertilized 
eggs of mice. The injected DNA was constructed so as to contain 
the coding portion of the rat GH gene ina position just downstream 
from the promoter region of the mouse metallothionein gene. 
Under normal conditions, the synthesis of metallothionein is 
greatly enhanced following the administration of metals, such as 
cadmium or zinc, or glucocorticoid hormones. The metallothio- 
nein gene carries a strong promoter, and it was hoped that placing 
the GH gene downstream from it might allow the GH gene to be 
expressed following treatment of the transgenic mice with metals or 
glucocorticoids. As illustrated in FIGURE 18.48, this expectation 
was fully warranted. 

On the practical side, transgenic animals provide a mechanism 
for creating animal models, which are laboratory animals that 
exhibit a particular human disease they would normally not be sub- 
ject to. This approach is illustrated by the transgenic mice that carry a 
gene encoding a mutant version of the human amyloid precursor pro- 
tein (APP). As discussed on page 64, these mice develop neurological 
and behavioral symptoms reminiscent of Alzheimer’s disease and are 
an important resource in the development of therapies for this terri- 
ble disease. Transgenic animals are also being developed as part of 
agricultural biotechnology. For example, pigs born with foreign 
growth hormone genes incorporated into their chromosomes grow 
much leaner than control animals lacking the genes. The meat of the 


FIGURE 18.48 Transgenic mice. This photograph shows a pair of 
littermates at age 10 weeks. The larger mouse developed from an egg that 
had been injected with DNA containing the rat growth hormone gene 
placed downstream from a metallothionein promoter. The larger mouse 
weighs 44 g; the smaller, uninjected control weighs 29g. The rat GH gene 
was transmitted to offspring that also grew larger than the controls. 

Source: Courtesy of Ralph L. Brinster, from work reported in R. D. Palmiter 
et al., Nature 300:611, 1982. Reprinted by permission from Macmillan 
Publishers Ltd. 
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stimulates the conversion of nutrients into protein rather than fat. 


Transgenic Plants 


Plants are also prime candidates for genetic engineering. Researchers 
have demonstrated that whole plants can be grown from individual 
cultured plant cells. This provides the opportunity to alter the genetic 
composition of plants by introducing DNA into the chromosomes of 
cultured cells that can subsequently be grown into mature plants. One 
way to introduce foreign genes is to incorporate them into the Ti plas- 
mid of the bacterium Agrobacterium tumefaciens. Outside of the labo- 
ratory, this bacterium lives in association with dicotyledonous plants, 
where it causes the formation of tumorous lumps called crown galls. 
During an infection, a section of the Ti plasmid termed the T-DNA 
region is passed from the bacterium into the plant cell. This part of the 
plasmid becomes incorporated into the plant cell’s chromosomes and 
induces the cell to proliferate and provide nutrients for the bacterium. 

The Ti plasmid can be isolated from bacteria and linked with 
foreign genes to produce a recombinant plasmid that is taken up in 
culture by undifferentiated dicotyledonous plant cells, including 
those of carrots and tobacco. FIGURE 18.49 shows the use of recom- 
binant Ti plasmids to introduce foreign DNA into meristem cells at 
the tip of newly formed shoots. The shoots can then be rooted and 
grown into mature plants. This technique, which is called T-DNA 
transformation, has been used to transform plant cells with genes 
derived from bacteria that code for insect-killing toxins, thus pro- 
tecting the plants from insect predators. 

Other procedures have been developed to introduce foreign 
genes into cells of monocotyledonous plants. In one of the more exotic 
approaches, plant cells can serve as targets for microscopic, DNA- 
coated tungsten pellets fired by a “gene gun” This technique has been 
used to genetically alter a number of different types of plant cells. 

The two most important activities that plant geneticists might 
improve by genetic engineering are photosynthesis and nitrogen 
fixation, both crucial bioenergetic functions. Any significant 
improvement in photosynthetic efficiency would mean great 
increases in crop production. It is hoped that a modified version of 
the CO,-fixing enzyme might be engineered that is less susceptible 
to photorespiration (page 216). Nitrogen fixation is an activity car- 
ried out by certain genera of bacteria (e.g., Rhizobium) that live in a 
symbiotic relationship with certain plants (e.g., soybean, peanut, 
alfalfa, and pea). The bacteria reside in swellings, or leguminous nod- 
ules, located on the roots, where they remove N, from the atmo- 
sphere, reduce it to ammonia, and deliver the product to the cells of 
the plant. Geneticists are looking for a way to isolate genes involved 
in this activity from the bacterial genome and introduce them into 
the chromosomes of nonleguminous plants that presently depend 
heavily on added fertilizer for their reduced nitrogenous com- 
pounds. Alternatively, it might be possible to alter the genome of 
either plant or bacteria so that new types of symbiotic relationships 
can be developed. 


18.25 Gene Editing and Silencing 


Until fairly recently, investigators discovered new genes and 
learned of their function by screening for mutants that exhibited 
abnormal phenotypes (see Section 8.2 on the study of protein 
secretion). It was only through the process of random mutation 
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FIGURE 18.49 Formation of transgenic plants using the Ti plasmid. 
The transgene is spliced into the DNA of the Ti plasmid, which is 
reintroduced into host bacteria. Bacteria containing the recombinant 
plasmid are then used to transform plant cells, in this case cells of the 
meristem at the tip of a dissected shoot apex. The transformed shoots are 
transferred to a selection medium where they develop roots. The rooted 
plants can then be transferred to potting soil. 


that the existence of genes became apparent. This process of learn- 
ing about genotypes by studying mutant phenotypes is known as 
forward genetics. Since the development of techniques for gene 


cloning and DNA sequencing, researchers have been able to iden- 
tify and study genes directly without knowing anything about the 
function of their encoded protein. This condition has become 
especially familiar in recent years with the sequencing of entire 
genomes and the identification of thousands of genes whose func- 
tion remains unknown. Over the past two decades, researchers 
have developed means to carry out reverse genetics, which is a pro- 
cess of determining phenotype (i.e., function) based on the knowl- 
edge of genotype. The basic approach of reverse genetics is to 
eliminate the function of a specific gene, and then determine what 
effect the elimination of that function has on phenotype. We will 
first consider how researchers introduce specific mutations into 
genes in vitro, then discuss two widely used techniques for elimi- 
nating gene function in vivo. 


In Vitro Mutagenesis 


As is evident throughout this book, the isolation of naturally occur- 
ring mutants has played an enormously important role in determin- 
ing the function of genes and their products. But natural mutations 
are rare events, and it isn't feasible to use such mutations to study 
the role of particular amino acid residues in the function of a par- 
ticular protein. Rather than waiting for an organism to appear with 
an unusual phenotype and identifying the responsible mutation, 
researchers can mutate the gene (or its associated regulatory 
regions) in a desired way and observe the resulting phenotypic 
change. These techniques, collectively termed in vitro mutagenesis, 
require that the gene, or at least the gene segment, to be mutated has 
been cloned. 

A procedure developed by Michael Smith of the University of 
British Columbia called site-directed mutagenesis (SDM) 
allows researchers to make very small, specific changes in a DNA 
sequence, such as the substitution of one base for another, or the 
deletion or insertion of a very small number of bases. SDM is usu- 
ally accomplished by first synthesizing a DNA oligonucleotide con- 
taining the desired change, allowing this oligonucleotide to 
hybridize to a single-stranded preparation of the normal DNA, and 
then using the oligonucleotide as a primer for DNA polymerase. 
The polymerase elongates the primer by adding nucleotides that are 
complementary to the normal DNA. The modified DNA can then 
be cloned and the effect of the genetic alteration determined by 
introducing the DNA into an appropriate host cell. Scientists typi- 
cally use SDM to ask very targeted questions about the function of a 
gene or protein. For example, they might change one amino acid 
into another to gain insight into the role of that specific site in the 
overall function of a protein. Alternatively, they might introduce 
small changes in the regulatory region of a gene and determine the 
effect on gene expression. If the intent of site-directed mutagenesis 
is simply to eliminate the function of a gene, less specific methods 
can be used. For example, cutting a gene sequence at a restriction 
site, using DNA polymerase to make the single-stranded regions of 
the sticky ends into double-stranded DNA, and then ligating those 
ends back together can destroy the reading frame of a protein. In 
other instances, an entire restriction fragment might be removed 
from the gene. 

Constructing a mutation in vitro is only one aspect of reverse 
genetics. To study the effect of an engineered mutation on pheno- 
type, it is necessary to substitute the mutant allele for the normal 
gene in the organism in question. The development of a technique 
for introducing mutations into the genome of the mouse opened the 


door for reverse genetic studies in mammals, and literally revolu- 735 


tionized the study of mammalian gene function. 


Knockout Mice 


We have described in many places in this text the phenotypes of 
mice that lack a functional copy of a particular gene. For example, it 
was noted that mice lacking a functional copy of the p53 gene invari- 
ably develop malignant tumors (Section 16.7). These animals, which 
are called knockout mice, can provide a unique insight into the 
genetic basis of a human disease, as well as a basis for studying the 
various cellular activities in which the product of a particular gene 
might be engaged. Knockout mice are born as the result of a series of 
experimental procedures depicted in FIGURE 18.50. 

The various procedures used to generate knockout mice were 
developed in the 1980s by Mario Capecchi at the University of Utah, 
Oliver Smithies at the University of Wisconsin, and Martin Evans at 
Cambridge University. The first step is to isolate an unusual type of 
cell that has virtually unlimited powers of differentiation. These 
cells, called embryonic stem cells (page 18), are found in the 
mammalian blastocyst, which is an early stage of embryonic 
development comparable to the blastula stage of other animals. A 
mammalian blastocyst (Figure 18.50) is composed of two distinct 
parts. The outer layer makes up the trophectoderm, which gives rise 
to most of the extraembryonic membranes characteristic of a mam- 
malian embryo. The inner surface of the trophectoderm contacts a 
cluster of cells called the inner cell mass that projects into the spa- 
cious cavity (the blastocoel). The inner cell mass gives rise to the 
cells that make up the embryo. The inner cell mass contains the 
embryonic stem (ES) cells, which differentiate into all of the various 
tissues of which a mammal is composed. 

ES cells can be isolated from blastocysts and cultured in vitro 
under conditions where the cells grow and proliferate. The ES cells 
are then transfected with a DNA fragment containing a nonfunc- 
tional, mutant allele of the gene to be knocked out, as well as antibi- 
otic-resistance genes that can be used to select for cells that have 
incorporated the altered DNA into their genome. Of those cells that 
take up the DNA, approximately one in 10* undergo a process of 
homologous recombination in which the transfected DNA replaces the 
homologous DNA sequence. Through use of this procedure, ES cells 
that are heterozygous for the gene in question are produced and then 
selected for on the basis of their antibiotic resistance. In the next step, 
a number of these donor ES cells are injected into the blastocoel of a 
recipient mouse embryo. In the protocol depicted in Figure 18.50, the 
recipient embryo is obtained from a black strain. The injected embryo 
is then implanted into the oviduct of a female mouse that has been 
hormonally prepared to carry the embryo to term. As the embryo 
develops in its surrogate mother, the injected ES cells join the embryos 
own inner cell mass and contribute to formation of embryonic tis- 
sues, including the germ cells of the gonads. These chimeric mice can 
be recognized because their coat contains characteristics of both the 
donor and recipient strains. To determine whether the germ cells also 
contain the knockout mutation, the chimeric mice are mated to a 
member of an inbred black strain. If the germ cells do contain the 
knockout mutation, the offspring will be heterozygous for the gene in 
all of their cells. Heterozygotes can be distinguished by their brown 
coat coloration. These heterozygotes are then mated to one another 
to produce offspring that are homozygous for the mutant allele. These 
are the knockout mice that lack a functional copy of the gene. Any 
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FIGURE 18.50 Formation of knockout mice. The steps are described in 
the text. 


gene within the genome, or any DNA sequence for that matter, can be 
altered in any desired manner using this experimental approach. 

In some cases, the deletion of a particular gene can lead to the 
absence of a particular process, which provides convincing evidence 
that the gene is essential for that process. Often, however, the dele- 
tion of a gene that is thought to participate in an essential process 
causes little or no alteration in the animal's phenotype. Such results 
can be difficult to interpret. It is possible, for example, that the gene 
is not involved in the process being studied or, as is usually the case, 
the absence of the gene product is compensated by the product of an 
entirely different gene. Compensation by one gene for another can 
be verified by producing mice that lack both of the genes in question 
(i.e., a double knockout). 

In other cases, the absence of a gene leads to the death of the 
mouse during early development, which also makes it difficult to 
determine the role of the gene in cellular function. Researchers are 
often able to get around this problem by using a technique that 
allows a particular gene to be switched off in only one or several 
desired tissues, while the gene is expressed in the remainder of the 
animal. These conditional knockouts, as they are called, allow 
researchers to study the role of the gene on the development or func- 
tion of the affected tissue. A current goal of the International 
Knockout Mouse Consortium is to generate conditional knockouts 
of every gene in the mouse genome. 


RNA Interference 


Knockout mice have been an invaluable strategy to learn about gene 
function, but the generation of these animals is laborious and costly. 
In the past few years a new technique in the realm of reverse genetics 
has come into widespread use. As discussed in Section 11.10, RNA 
interference (RNAi) is a process in which a specific mRNA is 
degraded due to the presence of a small double-stranded siRNA 
whose sequence is contained within the mRNA sequence. The func- 
tions of plant, nematode, or fruit fly genes can be studied by simply 
introducing an siRNA into one of these organisms by various means 
and examining the phenotype of the organism that results from 
depletion of the corresponding mRNA. Determining the roles of 
genes at the cellular level, rather than the organismal level, can be 
accomplished by studying the effects of these same siRNAs on the 
activities of cultured cells. Using these approaches, researchers can 
gather information about the functions of large numbers of genes in 
a relatively short period of time. As discussed on page 432, RNAi can 
be used to study gene function in mammalian cells by incubating the 
cells with small dsRNAs encapsulated in lipids or by genetically 
engineering the cells to produce the siRNAs themselves. Once inside 
the cell, the siRNA guides the degradation of the target mRNA, leav- 
ing the cell unable to produce additional protein encoded by that 
gene. Any deficiencies in the phenotype of the cell can be attributed 
to a marked reduction in the level of the protein being investigated. 
As in the case of gene knockouts, the absence of a phenotype follow- 
ing treatment with an siRNA does not necessarily mean that the 
gene in question is not involved in a particular process, as another 
gene may be capable of compensating for the absence of expression 
of the gene being targeted. 

Libraries containing thousands of siRNAs, or vectors contain- 
ing DNA encoding these RNAs, are also available for the study of 
gene function in a number of model organisms and in human cells. 


Determining gene function by RNA interference. 
The figure shows a gallery of immunofluorescence images of cultured 
Drosophila cells that had been incubated with various double-stranded 
siRNAs. The cells shown here had accumulated in metaphase of mitosis as 
the result of a separate treatment that blocked progression into anaphase. 
During the course of this study, over 4 million cells were examined. It was 
possible to screen such a large number of cells by having a computer select 
the images of cells that were in metaphase and then crop and assemble the 
images into panels as shown in this photograph. The images could then be 
screened for the presence of abnormal mitotic spindles either by human 
observers or by computer analysis using programs designed to detect 
specific features of an abnormal spindle. (An even larger study of the same 
type can be found in Nature 464:721, 2010.) 


Source: Courtesy of Gohta Goshima and Ronald D. Vale. 


Researchers using these libraries can study the effects of the elimina- 
tion of a gene's expression on virtually any cellular process. It should 
be possible, for example, to determine which genes are involved in 
mitotic spindle assembly by treating cells with a library of siRNAs 
from across the entire genome and then screening all of the treated 
cells at the time of cell division for the presence of an abnormal 
mitotic spindle. shows a gallery of images of cultured 
Drosophila cells in the metaphase stage of mitosis. Each of the cells 
depicted in this gallery had been treated with an siRNA directed 
against a different gene in the Drosophila genome. Over 14,000 
different siRNAs were tested in this study, of which approximately 


200 caused the treated cells to have an abnormal metaphase spindle. 737 


More than half of the siRNAs that affected mitotic spindle assembly 
targeted genes that were not previously known to be involved in the 
formation of this structure, thus providing new insights into the 
functions of these genes. 


Genome Editing Using 
Engineered Nucleases 


As discussed in Chapter 13, cells possess a number of systems to rap- 
idly recognize and repair DNA damage. When double-stranded 
breaks (DSB) occur in DNA, the free ends are bound by enzymes 
and ligated back together. Repair can either happen by simply gluing 
the two free ends together by a process known as nonhomologous 
end joining (page 534), or by homology-directed repair in which a 
homologous piece of DNA is used as a template to guide the repair 
process. 

Over the past decade, researchers have developed several strate- 
gies that utilize DSBs and repair enzymes in order to make specific 
changes to the genome of a cell or organism. Broadly, all of the tech- 
niques involve making a DSB at a specific target sequence using an 
engineered nuclease. Depending on the experimental conditions, a 
number of outcomes are then possible, including sequence insertion, 
deletion, or alteration. 

The key challenge of this technique has been to create nucleases 
that are able to recognize specific target sequences and efficiently 
cleave the DNA only in that spot. The first engineered nucleases, 
called zinc-finger nucleases (ZFNs) were built using the DNA-rec- 
ognition domain of zinc finger proteins fused to a nonspecific nucle- 
ase. Different DNA sequences could be targeted by varying the 
order, number, and type of zinc finger modules used in the ZFN. 
TALENs were developed later using a different DNA-recognizing 
protein module, but followed a similar design. 

A new genome editing technique, named CRISPR, has taken the 
genome editing field by storm since it was first characterized in 2012 
by the laboratories of Jennifer Doudna (University of California, 
Berkeley) and Emmanuelle Charpentier (Umea University) and 
demonstrated to work in mammalian cells by the laboratory of Feng 
Zhang (Broad Insitute). CRISPR was first discovered for its role in 
bacterial innate immunity, as discussed in Section 11.13. When bac- 
teria and archaea are exposed to phage, some species take up seg- 
ments of the phage DNA and incorporate them into their own 
genome. The DNA can later be transcribed into RNA and incorpo- 
rated into a protein known as Cas9. Cas9 is a nuclease and uses the 
short phage sequences in its RNA as a template to identify and destroy 
phage DNA that it encounters in the cell through the introduction of 
DSBs. Doudna and Charpentier’s groups discovered that by intro- 
ducing lab-designed RNAs to Cas9, they could engineer the nuclease 
to identify and cut DNA of arbitrary sequence with high specificity 
and efficiency in vitro ( 18.52). In the years following this dis- 
covery, the CRISPR system has been demonstrated to work in numer- 
ous model systems, including cultured human cells and whole adult 
organisms (See the Experimental Walkthrough Video: Editing the genome 
using CRISPR for this chapter). While current research is moving rapidly 
towards developing CRISPR as a method to treat genetic diseases in 
humans, there are still numerous questions (both ethical, as discussed 
in the chapter opener, and technical) that remain to be addressed. 
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FIGURE 18.52 Overview of the CRISPR technique of genome editing. Cas9 (blue) utilizes a guide RNA (purple) that can recognize and base-pair with a 
target genomic sequence (thick red strands), Cas9 also requires a short sequence, known as a PAM sequence (orange), downstream for the target sequence. 
Upon finding a matching sequence, Cas9 cuts both strands of target DNA, resulting in a double-stranded break which is rapidly repaired by DNA repair 
enzymes. CRISPR can be used to edit, delete, or add new DNA to a specific location in the genome. 


Source: From Jennifer A. Doudna, Nature 495:50-51, 2013. Reprinted by permission from Macmillan Publishers Ltd. 


18.26 The Use of Antibodies 


As discussed in Chapter 17, antibodies (or immunoglobulins) are 
proteins produced by lymphoid tissues in response to the presence 
of foreign materials, or antigens. One of the most striking properties 
of antibodies, and one that makes them so useful to biological 
researchers, is their remarkable specificity. A cell may contain thou- 
sands of different proteins, yet a preparation of antibodies binds only 
to those select molecules within the cell having a small part that fits 
into the antigen-binding sites of the antibody molecules. Antibodies 
can often be obtained that distinguish between two polypeptides 
that differ by only a single amino acid or by a single posttranslational 
modification (as on page 496). 

Biologists have long taken advantage of antibodies and have 
developed a wide variety of techniques involving their use. There are 
basically two different approaches to the preparation of antibody mol- 
ecules that interact with a given antigen. In the traditional approach, an 
animal (typically a rabbit or goat) is repeatedly injected with the anti- 
gen, and after a period of several weeks, blood is drawn that contains 
the desired antibodies. Whole blood is treated to remove cells and clot- 
ting factors so as to produce an antiserum, which can be tested for its 
antibody titer and from which the immunoglobulins can be purified. 
Although this method of antibody production remains in use, it suffers 
from certain inherent disadvantages. Because of the mechanism of 
antibody synthesis, an animal invariably produces a variety of different 
species of immunoglobulin, that is, immunoglobulins with different V 
regions in their polypeptide chains, even when challenged with a 
highly purified preparation of antigen. An antiserum that contains a 
variety of immunoglobulins that bind to the same antigen is said to be 


polyclonal. Because immunoglobulins are too similar in structure to be 
fractionated, it becomes impossible to obtain a preparation of a single 
purified species of antibody by this technique. 

In 1975, Cesar Milstein and Georges Kohler of the Medical 
Research Council in Cambridge, England, carried out a far-reaching 
set of experiments that led to the preparation of antibodies directed 
against specific antigens. To understand their work, it is necessary to 
digress briefly. A given clone of antibody-producing cells (which is 
derived from a single B lymphocyte) synthesizes antibodies having 
identical antigen-combining sites. The heterogeneity of the antibodies 
produced when a single purified antigen is injected into an animal 
results from the fact that many different B lymphocytes are activated, 
each having membrane-bound antibodies with an affinity for a differ- 
ent part of the antigen. An important question arose: Was it possible to 
circumvent this problem and obtain a single species of antibody mole- 
cule? Consider for a moment the results of a procedure in which one 
injects an animal with a purified antigen, waits a period of weeks for 
antibodies to be produced, removes the spleen or other lymphoid 
organs, prepares a suspension of single cells, isolates those cells produc- 
ing the desired antibody, and grows these particular cells as separate 
colonies so as to obtain large quantities of this particular immunoglob- 
ulin. If this procedure were followed, one should obtain a preparation 
of antibody molecules produced by a single colony (or clone) of cells, 
which is referred to as a monoclonal antibody. But antibody-pro- 
ducing cells do not grow and divide in culture, so an additional manip- 
ulation had to be introduced to obtain monoclonal antibodies. 

Malignant myeloma cells are a type of cancer cell that grows 
rapidly in culture and produces large quantities of antibody. But 
myeloma cells are of little use as analytic tools because they are not 


formed in response to a particular antigen. Instead, myeloma cells 
develop from the random conversion of a normal lymphocyte to the 
malignant state, and therefore, they produce the antibody that was 
being synthesized by the particular lymphocyte before it became 
malignant. 

Milstein and Kéhler combined the properties of these two types of 
cells, the normal antibody-producing lymphocyte and the immortal 
myeloma cell. They accomplished this feat by fusing the two types of 
cells to produce hybrid cells, called hybridomas, that grow and prolif- 
erate indefinitely, producing large amounts of a single (monoclonal) 
antibody. The antibody produced is one that was being synthesized by 
the normal lymphocyte prior to fusion with the myeloma cell. 

The procedure for the production of monoclonal antibodies is 
illustrated in FIGURE 18.53. The antigen (whether present in a solu- 
ble form or as part of a cell) is injected into the mouse to cause the 
proliferation of specific antibody-forming cells (step 1, Figure 18.53). 
After a period of several weeks, the spleen is removed and dissoci- 
ated into single cells (step 2), and the antibody-producing lympho- 
cytes are fused with a population of malignant myeloma cells (step 
3), which makes the hybrid cells immortal, that is, capable of unlim- 
ited cell division. Hybrid cells (hybridomas) are selected from non- 
fused cells by placing the cell mixture in a medium in which only the 
hybrids can survive (step 4). Hybridomas are then grown clonally in 
separate wells (step 5) and individually screened for production of 
antibody against the antigen being studied. Hybrid cells containing 
the appropriate antibody (step 6) can then be cloned in vitro or in 
vivo (by growing as tumor cells in a recipient animal), and unlimited 
amounts of the monoclonal antibody can be prepared. Once the 
hybridomas have been produced, they can be stored indefinitely in a 
frozen state, and aliquots can be made available to researchers 
around the world. One of the most important features of this meth- 
odology is that one does not have to begin with a purified antigen to 
obtain an antibody. In fact, the antigen to which the monoclonal 
antibody is sought may even be a minor component of the entire 
mixture. 

Once a preparation of antibody molecules is in hand, whether 
obtained by conventional immunologic techniques or by formation 
of hybridomas, these molecules can be used as highly specific probes 
in a variety of analytic techniques. For example, antibodies can be 
used in protein purification. When a purified antibody is added to a 
crude mixture of proteins, the specific protein being sought selec- 
tively combines with the antibody and precipitates from solution. 
Antibodies can also be used in conjunction with various types of 
fractionation procedures to identify a particular protein (antigen) 
among a mixture of proteins. In a Western blot, for example, a 
mixture of proteins is first fractionated by two-dimensional gel elec- 
trophoresis (as in Figure 18.31). The fractionated proteins are then 
transferred to a sheet of nitrocellulose filter, which is then incubated 
with a preparation of antibodies that have been labeled either radio- 
actively or fluorescently. The location on the filter of the specific 
protein bound by the antibody can be determined from the location 
of the bound radioactivity or fluorescence. 

In addition to their use in research, monoclonal antibodies are 
playing a valuable role in diagnostic medicine in tests to determine 
the concentration of specific proteins in the blood or urine. In one 
example, monoclonal antibodies form the basis of certain home- 
pregnancy tests that monitor the presence of a protein (chorionic 
gonadotrophin) that appears in the urine a few days after concep- 
tion. Monoclonal antibodies have also become very useful as thera- 
peutic agents in treating human disease (page 655). To date, efforts 
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FIGURE 18.53 Formation of monoclonal antibodies. The steps are 
described in the text. The HAT medium is so named because it contains 
hypoxanthine, aminopterin, and thymidine. This medium allows cells with a 
functional hypoxanthine—guanine phosphoribosyl transferase (HGPRT) to 
grow, but it does not support the growth of cells lacking this enzyme, such 
as the unfused myeloma cells used in this procedure. 


to develop human hybridomas that produce human monoclonal 
antibodies have not been successful. To get around this stumbling 
block, mice have been genetically engineered so that the antibodies 
they produce are increasingly human in amino acid sequence. A 
number of these humanized monoclonal antibodies have been 
approved for the treatment of several diseases. More recently, mice 
have been engineered so that their immune system is essentially 
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human in structure. 

The first fully human antibody (Humira), which is approved for 
the treatment of rheumatoid arthritis (page 669), was produced by a 
very different technique that uses bacteriophage rather than hybri- 
domas as the basis for monoclonal antibody production. The tech- 
nique is known as phage display. In this technique, billions of 
different phage particles are generated in which each phage carries a 
gene encoding a human antibody molecule that possesses a unique 
variable region (page 679). Different phage within this vast library 
encode different antibodies, that is, antibodies with different varia- 
ble regions. In each case, the antibody gene is fused to a gene encod- 
ing one of the viral coat proteins so that when the phage is assembled 
within a host cell, the antibody molecule is displayed on the surface 
of the viral particle. Suppose that you have a protein (antigen) that 
you believe would be a good target for a particular therapeutic anti- 
body. The antigen is purified and allowed to interact with a sample 
of each of the billions of phage particles that make up the phage 
library. Those phage that bind to the antigen with high affinity can 
be identified and allowed to multiply with an appropriate host cell. 
Once it has been amplified this way, the DNA encoding the antibody 
gene can be isolated and used to transfect an appropriate mamma- 
lian cell. The genetically engineered cells can then be grown in large 
cultures to produce therapeutic quantities of the antibody. 
Production of antibodies in cultured mammalian cells is an expen- 
sive venture and alternative “living factories” are being pursued. 
Among the alternatives being considered for this purpose are goats, 
rabbits, and tobacco cells. 

A quick scan through this text will reveal numerous micro- 
graphs depicting the immunolocalization of a particular protein 


within a cell as visualized in the light or electron microscope. 
Immunolocalization of proteins within a cell depends on the use of 
antibodies that have been specifically prepared against that particu- 
lar protein. Once prepared, the antibody molecules are linked (con- 
jugated) to a substance that makes them visible under the microscope, 
but does not interfere with the specificity of their interactions. For 
use with the light microscope, antibodies are generally complexed 
with small fluorescent molecules, such as fluorescein or rhodamine, 
to generate derivatives that are then incubated with the cells or 
sections of cells. The binding sites are then visualized with the 
fluorescence microscope. This technique is called direct immuno- 
fluorescence. It is often preferable to carry out antigen localization 
by a variation of this technique called indirect immunofluores- 
cence. In indirect immunofluorescence, the cells are incubated with 
unlabeled antibody, which is allowed to form a complex with the 
corresponding antigen. The location of the antigen-antibody couple 
is then revealed in a second step using a preparation of fluorescently 
labeled antibodies whose combining sites are directed against the 
antibody molecules used in the first step. Indirect immunofluores- 
cence results in a brighter image because numerous secondary 
antibody molecules can bind to a single primary antibody. Indirect 
immunofluorescence also has a practical advantage; the conjugated 
(fluorescent) antibody is readily purchased from vendors. 
Immunofluorescence provides remarkable clarity because only the 
proteins bound by the antibody are revealed to the eye; all of the 
unlabeled materials remain invisible. Localization of antigens in the 
electron microscope is accomplished using antibodies that have been 
tagged with electron-dense materials, such as the iron-containing 
protein ferritin or gold particles. An example of this technique is 
shown in Figure 8.23c,d. 


Glossary © $A 


Many key terms and concepts are defined here. The parenthetical numbers 
following most definitions identify the chapter and section in which the 
term is first defined. For example, a term followed by (3.2) is first defined 
in Chapter 3, Section 2: “Enzymes.” Terms defined in The Human 
Perspective or Experimental Pathways essays are identified parenthetically 
with HP or EP. For example, a term followed by (EP1) is first defined in the 
Experimental Pathways essay of Chapter 1. 


A 


A (aminoacyl) site Site into which aminoacyl-tRNAs enter the 
ribosome-mRNA complex. (11.16) 


Absorption spectrum A plot of the intensity of light absorbed relative 
to its wavelength. (6.4) 


Acid A molecule that is capable of releasing a hydrogen ion. (2.4) 


Acid hydrolases Hydrolytic enzymes with optimal activity at an acid 
pH. (8.15) 


Actin A globular, cytoskeletal protein that polymerizes to form a 
flexible, helical filament capable of interacting with myosin. Actin 
filaments provide mechanical support for eukaryotic cells, determine 
the cell’s shape, and enable cell movements. (9.10) 


Actin-binding proteins Any of nearly 100 different proteins belonging 
to numerous families that affect the assembly of actin filaments, their 
physical properties, and their interactions with one another and with 
other cellular organelles. (9.13) 


Action potential The collective changes in membrane potential, 
beginning with depolarization to threshold and ending with return to 
resting potential, that occur with stimulation of an excitable cell and act 
as the basis for neural communication. (4.16) 


Action spectrum A plot of the relative rate (or efficiency) of a process 
produced by light of various wavelengths. (6.4) 


Activation energy The minimal kinetic energy needed for a reactant to 
undergo a chemical reaction. (3.5) 


Active site The part of an enzyme molecule that is directly involved in 
binding the substrate. (3.5) 


Active transport The energy-requiring process in which a substance 
binds to a specific transmembrane protein, changing its conformation 
to allow passage of the substance through the membrane against 
the electrochemical gradient for that substance. (4.14) 


Adaptive immune response A specific response to a pathogen that 
requires previous exposure to that agent. Includes responses mediated 
by antibodies and T lymphocytes. (17.1) 


Adenosine triphosphate (ATP) Nucleotide consisting of adenosine 
bonded to three phosphate groups; it is the principal immediate-energy 
source for prokaryotic and eukaryotic cells. (2.18) 


Adherens junctions (zonulae adherens) Adherens junctions are a type 
of specialized adhesive junction particularly common in epithelia. The 
plasma membranes in this region are separated by 20 to 35 nm and are 
sites where cadherin molecules are concentrated. The cells are held 
together by linkages between the extracellular domains of cadherin 
molecules that bridge the gap between neighboring cells. (7.9) 


Aerobes Organisms dependent on the presence of oxygen to metabolize 
energy-rich compounds. (5.1) 


Affinity chromatography Protein purification technique that 
utilizes the unique structural properties of a protein that allow the 
molecule to be specifically withdrawn from solution while all other 
molecules remain in solution. The solution is passed through a 
column in which a specific interacting molecule (such as a substrate, 
ligand, or antigen) is immobilized by linkage to an inert material (the 
matrix). (18.11) 


Alleles Alternate forms of the same gene. (10.1) 


Allosteric modulation Modification of the activity of an enzyme 
through interaction with a compound that binds to a site (i.e., allosteric 
site) other than the active site. (3.12) 


Allosteric site A region ofa protein that causes the protein to change its 
conformation when it binds another molecule. (3.12) 


Alpha (a) helix One possible secondary structure of polypeptides, in 
which the backbone of the chain formsa spiral (i-e., helical) conformation. 
(2.9) 


Amide bond The chemical bond that forms between carboxylic acids 
and amines (or acidic and amino functional groups) while producing a 
molecule of water. (2.5) 


Aminoacyl-tRNA synthetase (aaRS) An enzyme that covalently links 
amino acids to the 3’ ends of their cognate tRNA(s). Each amino acid is 
recognized by a specific aminoacyl-tRNA synthetase. (11.15) 


Amphipathic The biologically important property of a molecule 
having both hydrophobic and hydrophilic regions. (2.7, 4.2) 


Anabolic pathway A metabolic pathway resulting in the synthesis of 
relatively complex products. (3.9) 


Anaerobes Organisms that utilize energy-rich compounds through 
oxygen-independent metabolic pathways such as glycolysis and 
fermentation. (5.1) 

Anaerobic pathway Enzymatic pathway for generating ATP from 
glucose without the need for oxygen. (3.10) 

Anaphase The stage of mitosis during which sister chromatids separate 
from one another. (14.10) 

Anaphase A The movement of the chromosomes toward the poles 
during mitosis. (14.10) 

Anaphase B Elongation of the mitotic spindle, which causes the two 
spindle poles to move farther apart. (14.10) 

Angstrom (A) The unit, equivalent to 0.1 nm, used to describe atomic 
and molecular dimensions. (1.7) 

Animal models Laboratory animals that exhibit characteristics of a 
particular human disease. (18.24) 

Anion An ionized atom or molecule with a net negative charge. (2.1) 
Antenna The light harvesting molecules of a photosynthetic unit that 
absorbs photons of varying wavelengths and transfers the energy to the 
pigment molecules at the reaction center. (6.5) 

Antibody An immunoglobulin protein produced by plasma cells 
derived from B lymphocytes that interacts with the surface of a pathogen 
or foreign substance to facilitate its destruction. (17.1) 

Anticodon A three-nucleotide sequence in each tRNA that functions 
in the recognition of the complementary mRNA codon. (11.15) 
Antigen Any substance recognized by an immune system as foreign to 
the organism. (17.2) 
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Antigen-presenting cells (APCs) Cells that display portions of protein 
antigens at their surface. The portions are derived by proteolysis of larger 
antigens. The antigenic peptides are displayed in conjunction with MHC 
molecules. Virtually any cell in the body can function as an APC by 
displaying peptides in combination with MHC class I molecules, which 
provides a mechanism for the destruction of infected cells. In contrast, 
macrophages, dendritic cells, and B cells are referred to as “professional” 
APCs because they function to ingest materials, process them, and display 
them to T,, cells in combination with MHC class II molecules. (17.6) 


Antiport A mode of membrane transport in which one transported 
species is moved across the membrane in one direction, coupled with 
the motion of a different species across the membrane in the opposite 
direction (4.14) 


Antiserum Fluid containing desired antibodies that remains after the 
removal of the cells and clotting factors from whole blood that has been 
exposed to given antigen. (18.26) 


Apoptosis A type of orderly, or programmed, cell death in which the cell 
responds to certain signals by initiating a normal response that leads to 
the death of the cell. Death by apoptosis is characterized by the overall 
compaction of the cell and its nucleus, the orderly dissection of the 
chromatin into pieces at the hands ofa special DNA-splitting endonuclease, 
and the rapid engulfment of the dying cell by phagocytosis. (15.17) 


Artifact A structure seen in a microscopic image that results from the 
coagulation or precipitation of materials that had no existence in the 
living cell. (18.5) 


Assay Some identifiable feature of a specific protein, such as the 
catalytic activity of an enzyme, used to determine the relative amount of 
that protein in a sample. (18.11) 


Aster “Sunburst” arrangement of microtubules around each centrosome 
during mitosis. (14.7) 


Asymmetric cell division Cell division that produces two non-identical 
daughter cells. (14.1) 


Atomic force microscope (AFM) A_ high-resolution scanning 
instrument that is becoming increasingly important in nanotechnology 
and molecular biology. The AFM operates by scanning a delicate probe 
over the surface of the specimen. (18.7) 


ATP synthase The ATP-synthesizing enzyme of the inner mitochondrial 
membrane, which F, headpiece and the F, basal piece, the latter of which 
is embedded in the membrane itself. (5.7) 


Autoantibodies Antibodies capable of reacting with the body’s own 
tissues. (17.2) 


Autoimmune diseases Diseases characterized by an attack of the 
immune system against the body’s own tissues. Includes multiple sclerosis, 
insulin-dependent diabetes mellitus, and rheumatoid arthritis. (17.3) 


Autophagy The destruction of organelles and their replacement during 
which an organelle is surrounded by a double membrane. The membrane 
surrounding the organelle then fuses with a lysosome. (8.15) 


Autoradiography A technique for visualizing biochemical processes 
by allowing an investigator to determine the location of radioactively 
labeled materials within a cell. Tissue sections containing radioactive 
isotopes are covered with a thin layer of photographic emulsion, which 
is exposed by radiation emanating from the tissue. Sites in the cells 
containing radioactivity are revealed under the microscope by silver 
grains after development of the overlying emulsion. (8.2, 18.8) 
Autotroph Organism capable of surviving on CO, as its principal 
carbon source. (6.1) 

Axon A single, prominent extension that emerges from the cell body 


and conducts outgoing impulses away from the cell body and toward the 
target cell(s). (4.16) 


Axoneme The central, microtubule-containing core of a cilium or 
flagellum. Most axonemes consist of nine peripheral doublets, two 
central microtubules, and numerous accessory structures. (9.7) 


B lymphocytes (or B cells) Lymphocytes that respond to antigen by 
proliferating and differentiating into plasma cells that secrete blood- 
borne antibodies. These cells attain their differentiated state in the bone 
marrow. (17.1) 


Bacterial artificial chromosome (BAC) Cloning vector capable of 
accepting large foreign DNA fragments that can be cloned in bacteria. 
Consists of an F plasmid with an origin of replication and the genes 
required to regulate replication. BACs have played a key role in genome 
sequencing. (18.23) 


Basal body A structure that resides at the base of the cilium or 
flagellum and which generates their outer microtubules. Basal bodies 
are identical in structure to centrioles. Both can give rise to one 
another. (9.5) 


Base Any molecule that is capable of accepting a hydrogen ion. (2.4) 


Base composition analysis The relative amounts of each base in 
various samples of DNA. (10.5) 


Base excision repair (BER) A cut-and-patch mechanism for the 
removal from the DNA of altered nucleotides, e.g., uracil (formed from 
cytosine) and 8-oxoguanine. (13.8) 


Basement membrane (basallamina) Thickened layer of approximately 
50 to 200 nm of extracellular matrix that surrounds muscle and fat cells 
and underlies the basal surface of epithelial tissues such as the skin, the 
inner lining of the digestive and respiratory tracts, and the inner lining 
of blood vessels. (7.2) 


Beta (6) clamp One of the noncatalytic components of the replisome 
that encircles the DNA and keeps the polymerase associated with the 
DNA template. (13.4) 


Beta (6) sheet One possible secondary structure of a polypeptide, in 
which several B-strands lie parallel to each other, creating the 
conformation of a sheet. (2.9) 


Beta (f) strand One possible secondary structure of a polypeptide, in 
which the backbone of the chain assumes a folded (or pleated) 
conformation. (2.9) 


Biochemicals Compounds synthesized by living organisms. (2.5) 


Bioenergetics The study of the various types of energy transformations 
that occur in living organisms. (3.1) 


Biosynthetic pathway (secretory pathway) Route through the 
cytoplasm by which materials are synthesized in the endoplasmic 
reticulum or Golgi complex, modified during passage through the Golgi 
complex, and transported within the cytoplasm to various destinations 
such as the plasma membrane, a lysosome, or a large vacuole of a plant 
cell. The alternative term secretory pathway has been used because 
many of the materials synthesized in the pathway are destined to be 
discharged (secreted) outside the cell. (8.1) 


Bivalent (tetrad) The complex formed during meiosis by a pair of 
synapsed homologous chromosomes. (14.13) 


Bright-field microscope A microscope in which light from the 
illuminating source is caused to converge on the specimen by the 
substage condenser, thereby forming a cone of bright light that can enter 
the objective lens. (18.2) 


Buffers Compounds that can interact with either free hydrogen or 
hydroxyl ions, minimizing a change in pH. (2.4) 


Cc 


C, pathway Alternate pathway for carbon fixation utilizing phosphoe- 
nolpyruvate as the CO, acceptor to produce four-carbon compounds 
(predominantly malate and oxaloacetate). (6.10) 


C,plants Plants that depend solely on the C, pathway to fix atmospheric 
CO,,. (6.9) 

C, plants Plants, primarily tropical grasses, that use the C, carbon 
fixation pathway. (6.10) 


Cadherins A family of related glycoproteins that mediate Ca’*- 
dependent cell-cell adhesion. (7.7) 


Calcium-binding proteins Proteins, such as calmodulin, that bind calcium 
and allow the calcium to elicit a variety of cellular responses. (15.14) 


Calmodulin A small, calcium-binding protein that is widely 
distributed. Each molecule of calmodulin contains four binding sites for 
calcium. (15.14) 


Calvin cycle (or Calvin-Benson cycle) The pathway for conversion of 
CO, into carbohydrate; the cycle occurs in cyanobacteria and all 
eukaryotic photosynthetic cells. (6.9) 


CAM plants Plants that utilize PEP carboxylase in fixing CO, just as C, 
plants do, but conduct the light-dependent reactions and carbon fixation 
at different times of the day, so that stomata can be closed during the 
peak water loss hours of the day. (6.10) 


Cancer stem cells Cells that are thought to exist in some tumors that 
are responsible for maintaining the tumor and enabling it to spread. 
(16.17) 


Carbohydrates (Glycans) Organic molecules including simple sugars 
(monosaccharides) and multisaccharide polymers, which largely serve 
as energy-storage and structural compounds in cells. (2.6) 


Caspases A family of cysteine proteases that are activated at an early 
stage of apoptosis and are responsible for the degradative events 
observed during cell death. (15.17) 


Catabolic pathway A metabolic pathway in which relatively complex 
molecules are broken down into simpler products. (3.9) 


Cation An ionized atom or molecule with an extra positive charge. (2.1) 
Cell culture The technique used to grow cells outside the organism. (18.9) 


Cell cycle The stages through which a cell passes from one cell division 
to the next. (14.1) 


Cell division The process by which new cells originate from other 
living cells. (14.1) 


Cell-free system An experimental system to study cellular activities 
that does not require whole cells. Such systems typically contain a 
preparation of purified proteins and/or subcellular fractions and are 
amenable to experimental manipulation. (8.2) 


Cell fusion Technique whereby two different types of cells (from one 
organism or from different species) are joined to produce one cell with 
one, continuous plasma membrane. (4.7) 


Cell line Cells that are commonly used in tissue culture studies that 
have undergone genetic modifications that allow them to be grown 
indefinitely. (18.9) 

Cell-mediated immunity Carried out by T lymphocytes, that when 
activated, can specifically recognize and kill an infected (or foreign) 
cell. (17.1) 

Cell plate Structure between the cytoplasm of two newly formed 
daughter cells that gives rise to a new cell wall in plant cells. (7.14, 14.11) 
Cell signaling Communication in which information is relayed across 
the plasma membrane to the cell interior and often to the cell nucleus by 
means of a series of molecular interactions. (15.1) 


Cell theory Theory of biological organization, which has three 
tenets: all organisms are made up of one or more cells; the cell is the 
structural unit of life; cells only arise from the division of preexisting 
cells. (1.1) 


Cellulose Unbranched glucose polymer with $(1>4) linkages that 
assembles into cables and serves as a principal structural element of 
plant cell walls. (2.6) 


Cell wall A rigid, nonliving structure that provides support and 
protection for the cell it surrounds. (7.14) 


Central Dogma The concept that DNA contains the information to 
make RNA, and RNA contains the information to make protein. (11.1) 


Centrioles Cylindrical structures, about 0.2 sm across and typically 
about twice as long, that contain nine evenly spaced fibrils, each of 
which appears in cross section as a band of three microtubules. 
Centrioles are nearly always found in pairs, with each of the members 
situated at right angles to one another. (9.5) 


Centromere Marked indentation on a mitotic chromosome that serves 
as the site of kinetochore formation. (12.5) 


Centrosome A complex structure that contains two barrel-shaped 
centrioles surrounded by amorphous, electron-dense pericentriolar 
material (or PCM) where microtubules are nucleated. (9.5, 14.7) 


Checkpoint Mechanisms that halt the progress of the cell cycle if (1) 
any of the chromosomal DNA is damaged, or (2) certain critical 
processes, such as DNA replication or chromosome alignment during 
mitosis, have not been properly completed. (14.5) 


Chemiosmotic mechanism The mechanism for ATP synthesis 
whereby the movement of electrons through the electron-transport 
chain results in establishment of a proton gradient across the bacterial, 
thylakoid, or inner mitochondrial membrane, with the gradient acting 
as a high-energy intermediate, linking oxidation of substrates to the 
phosphorylation of ADP. (5.2) 


Chemoautotroph An autotroph that utilizes the energy stored in 
inorganic molecules (such as ammonia, hydrogen sulfide, or nitrites) to 
convert CO, into organic compounds. (6.1) 


Chiasmata (singular chiasma) Specific points of attachment between 
homologous chromosomes of bivalents, observed as the homologous 
chromosomes move apart at the beginning of the diplotene stage of meiotic 
prophase 1. The chiasmata are located at the sites on the chromosomes at 
which genetic exchange during crossing over had previously occurred. (14.13) 


Chlorophyll The most important light-absorbing photosynthetic 
pigment. (6.4) 


Chloroplast A specialized membrane-bound cytoplasmic organelle 
that is the principal site of photosynthesis in eukaryotic cells. (6.1) 


Cholesterol Sterol found in animal cells that can constitute up to half of 
the lipid in a plasma membrane, with the relative proportion in any 
membrane affecting its fluid behavior. (4.2) 


Chromatid Paired, rod-shaped members of mitotic chromosomes that 
together represent the duplicated chromosomes formed during 
replication in the previous interphase. (14.7) 


Chromatin A complex nucleoprotein material that makes up the 
chromosomes of eukaryotes. (1.3, 12.2) 


Chromatin remodeling complexes Multisubunit protein complexes 
that use the energy released by ATP hydrolysis to alter chromatin 
structure and allow binding of transcription factors to regulatory sites in 
the DNA. (12.15) 


Chromatography A term used for a wide variety of techniques in 
which a mixture of dissolved components is fractionated as it moves 
through some type of immobile matrix. (18.11) 
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Chromosome compaction (chromosome condensation) A process in 
which a cell converts its chromosomes into shorter, thicker structure 
capable of being segregated during mitosis or meiosis. (14.7) 


Chromosomes Threadlike strands that are composed of the nuclear 
DNA of eukaryotic cells and are the carriers of genetic information. (10.2) 


Cilia Hairlike motile organelles that project from the surface of a 
variety of eukaryotic cells. Cilia tend to occur in large numbers on 
a cell’s surface. (9.7) 


Ciliary dynein A huge protein (up to 2 million daltons) responsible for 
conversion of the chemical energy of ATP into the mechanical energy of 
ciliary locomotion. (HP9.8) 


Cis cisternae The cisternae of the Golgi complex closest to the 
endoplasmic reticulum. (8.9) 


Cis Golgi network (CGN) The cis-most face of the organelle that is 
composed of an interconnected network of tubules located at the entry 
face closest to the ER. The CGN is thought to function primarily as a 
sorting station that distinguishes between proteins to be shipped back 
to the ER and those that are allowed to proceed to the next Golgi 
station. (8.9) 


Cisternae One of the numerous flattened membrane sacs that comprise 
the Golgi apparatus. (8.3) 


Citric acid cycle An alternative name for the TCA cycle. (5.2) 


Clonal selection theory Well supported theory that B and T 
lymphocytes develop their ability to produce specific antibodies or T 
cell receptors prior to exposure to antigen. Should an antigen then enter 
the body, it can interact specifically with those B and T lymphocytes 
bearing complementary receptors. Interaction between the antigen and 
the B or T lymphocytes leads to proliferation of the lymphocyte to form 
a clone of cells capable of responding to that specific antigen. (17.2) 


Coactivators The intermediaries that help bound transcription 
factors stimulate the initiation of transcription at the core promoter. 
(12.15) 


Coated pits Specialized domains of the plasma membrane; coated pits 
serve as collection points for receptors that bind substances that enter a 
cell by means of endocytosis. (8.17) 


Coated vesicles Vesicles that bud from a membrane compartment 
typically possess a multi-subunit protein coat that promotes 
the budding process and binds specific membrane proteins. COPI-, 
COPII-, and clathrin-coated vesicles are the best characterized 
coated vesicles. (8.10) 


Codons Sequences of three nucleotides (nucleotide triplets) in mRNAs 
that specify amino acids. (11.14) 


Coenzyme An organic, nonprotein component of an enzyme. (3.5) 


Cofactor The nonprotein component of an enzyme, it can be either 
inorganic or organic. (3.5) 


Cohesin A multiprotein complex that keeps replicated chromatids 
associated with one another until they are separated during cell division. 
(14.7) 


Collagens A family of fibrous glycoproteins known for their high 
tensile strength that function exclusively as part of the extracellular 
matrix. (7.3) 


Competitive inhibitor An enzyme inhibitor that competes with 
substrate molecules for access to the active site. (3.7) 


Complementary The relationship between the sequence of bases in the 
two strands of the double helix of DNA. Structural restrictions on the 
configurations of the bases limits bonding to the two pairs: adenine- 
thymine and guanine-cytosine. (10.5) 


Complement A system of blood-plasma proteins that act as part of the 
innate immune system to destroy invading microorganisms either 


directly (by making their plasma membrane porous) or indirectly (by 
making them susceptible to phagocytosis). (17.1) 


Conductance The movement of small ions across membranes. (4.11) 


Confocal scanning microscope A microscope in which the specimen 
is illuminated by a finely focused laser beam that rapidly scans across 
the specimen at a single depth, thus illuminating only a thin plane (or 
“optical section”) within the object. The microscope is typically used 
with fluorescently stained specimens, and the light emitted from the 
illuminated optical section is used to form an image of the section on a 
video screen. (18.3) 


Conformation The three-dimensional arrangement of the atoms 
within a molecule, often important in understanding the biological 
activity of proteins and other molecules in a living cell. (2.9) 


Conformational change A predictable movement within a molecule 
that is associated with biological activity. (2.10) 


Congression The movement of duplicated chromosomes to the 
metaphase plate during prometaphase of mitosis. (14.8) 


Connective tissue Tissue that consists largely of a variety of distinct 
fibers that interact with each other in specific ways. The deeper layer of 
the skin (the dermis) is a type of connective tissue. (7.1) 


Connexon Multisubunit complex of a gap junction formed from the 
clustering within the plasma membrane of an integral membrane 
protein called connexin. Each connexon is composed of six connexin 
subunits arranged around a central opening (or annulus) about in 
diameter. (7.12) 


Consensus sequence The most common version of a conserved 
sequence. The TTGACA sequence of a bacterial promoter (known as 
the —35 element) is an example of a consensus sequence. (11.3) 


Constitutive heterochromatin Chromatin that remains in the compacted 
state in all cells at all times and, thus, represents DNA that is permanently 
silenced. It consists primarily of highly repeated sequences. (12.4) 


Constitutive secretion Discharge of materials synthesized in the cell 
into the extracellular space in a continual manner. (8.1) 


Contrast The difference in appearance between adjacent parts of an 
object or an object and its background. (18.1) 


Conventional (or type II) myosins A family of myosins, first 
identified in muscle tissue, that are the primary motors for muscle 
contraction but are also found in a variety of nonmuscle cells. Type II 
myosins are needed for splitting a cell in two during cell division, 
generating tension at focal adhesions, and the turning behavior of 
growth cones. (9.11) 


Copper atoms of the electron transport chain A type of electron 
carrier; these atoms are located within a single protein complex of the 
inner mitochondrial membrane that accept and donate a single electron 
as they alternate between Cu2* and Cul* states. (5.4) 


Cotransport A process that couples the movement of two solutes across 
a membrane, termed symport if the two solutes are moved in the same 
direction and antiport if the two move in opposite directions. (4.14) 


Co-translational translocation The movement of a nascent peptide 
through a transmembrane channel and into the cisternal space of the ER 
as it is being synthesized by the ribosome (8.4) 

Covalent bond The type of chemical bond in which electron pairs are 
shared between two atoms. (2.1) 

CRISPR An RNA based defense pathway employed by bacteria to 
defend against viruses that has been turned into a tool for engineering 
eukaryotic genomes (11.13) 

Cristae The many deep folds that are characteristic of the inner 


mitochondrial membrane, which contain the molecular machinery of 
oxidative phosphorylation. (5.1) 


Crossing over (genetic recombination) Reshuffling of the genes on 
chromosomes (thereby disrupting linkage groups) that occurs during 
meiosis as a result of breakage and reunion of segments of homologous 
chromosomes. (10.4) 


Cyclic photophosphorylation The formation of ATP in chloroplasts 
that is carried out by PSI independent of PSII. (6.8) 


Cyclin-dependent kinases (Cdks) Enzymes that control progression 
of cells through the cell cycle. (14.4) 


Cytochrome oxidase Enzyme complex that catalyzes the final step of 
electron transport by transferring electrons from cytochrome c to 
oxygen, thus consuming oxygen and producing water. (5.5) 


Cytochromes A type of electron carrier consisting of a protein bound 
to a heme group. (5.4) 


Cytokines Proteins secreted by cells of the immune system that alter 
the behavior of other immune cells. (17.6) 


Cytokinesis The part of the cell cycle during which physical division of 
the cell into two daughter cells occurs. (14.1) 


Cytoplasmic dynein A huge protein (molecular mass over 1 million 
daltons) composed of numerous polypeptide chains. The molecule 
contains two large globular heads that act as force-generating engines. 
Evidence suggests that cytoplasmic dynein functions in the movement 
of chromosomes during mitosis and also as a minus end-directed 
microtubular motor for the movement of vesicles and membranous 
organelles through the cytoplasm. (9.3) 


Cytoskeleton An elaborate interactive network composed of three well- 
defined filamentous structures: microtubules, microfilaments, and 
intermediate filaments. These elements function to provide structural 
support; an internal framework responsible for positioning the various 
organelles within the interior of the cell; as part of the machinery required 
for movement of materials and organelles within cells; as force-generating 
elements responsible for the movement of cells from one place to another; 
as sites for anchoring messenger RNAs and facilitating their translation 
into polypeptides; and as a signal transducer, transmitting information 
from the cell membrane to the cell interior. (9.1) 


Cytosol The region of fluid content of the cytoplasm outside of the 
membranous organelles of a eukaryotic cell. (1.3) 


Dehydrogenase An enzyme that catalyzes a redox reaction by removing 
a hydrogen atom from one reactant. (3.10) 


Deletion A chromosomal aberration that occurs when a portion of a 
chromosome is missing. (12.6) 


Deletion mapping A method for identifying regions important for 
transcriptional regulation by testing the expression of artificial DNA 
constructs in which parts of the DNA sequence are missing. (12.13) 


ApH Different in pH between two sides of a membrane. In the context 
of this chapter, delta pH refers to the pH difference across the 
mitochondrial inner membrane. (5.6) 

Denaturation Separation of the DNA double helix into its two 
component strands. (2.12, 10.8) 

Dendrites Fine extensions from the cell bodies of most neurons; 
dendrites receive incoming information from external sources, typically 
other neurons. (4.16) 

Dendritic cells (DCs) Antigen presenting cells that interact with T and 
B cells to activate an immune response. (17.6) 

Deoxyribonucleic acid (DNA) A double-stranded nucleic acid composed 
of two polymeric chains of deoxyribose-containing nucleotides. (2.18) 
Depolarization A decrease in the electrical potential difference across 
a membrane. (4.16) 


Desmosomes (maculae adherens) Disc-shaped adhesive junction 
containing cadherins found in a variety of tissues, but most notably 
epithelia, where they are located basal to the adherens junction. Dense 
cytoplasmic plaques on the inner surface of the plasma membranes in 
this region serve as sites of anchorage for looping intermediate filaments 
that extend into the cytoplasm. (7.9) 


Differential centrifugation A technique used to isolate a particular 
organelle in bulk quantity, which depends on the principle that, as long 
as they are more dense than the surrounding medium, particles of 
different size and shape travel toward the bottom of a centrifuge tube at 
different rates when placed in a centrifugal field. (18.10) 


Differentiation The process through which unspecialized cells become 
more complex and specialized in structure and function. (1.5) 


Diffusion The spontaneous process in which a substance moves from 
an area of higher concentration to one of lower concentration, eventually 
reaching the same concentration in all areas. (4.9) 


Diploid Containing both members of each pair of homologous 
chromosomes, as exemplified by most somatic cells. Diploid cells are 
produced from diploid parental cells during mitosis. Contrast with 
haploid. (10.10) 


Directimmunofluorescence Technique for the intracellular localization 
of an antigen, in which antibodies are conjugated with small fluorescent 
molecules to form derivatives that are then incubated with the cells or 
sections of cells. The binding sites are then visualized with the fluorescence 
microscope. (18.26) 


Dissociation constant A measure of how tightly two molecules can 
bind to each other. (3.2) 


Disulfide bridge Forms between two cysteines that are distant from 
one another in the polypeptide backbone or in two separate polypeptides. 
They help stabilize the intricate shapes of proteins. (2.8) 


DNA cloning A technique to produce large quantities of a specific DNA 
segment, usually involving splicing the segment into a vector DNA and 
allowing the vector to be replicated within a bacterial host cell. (18.20) 


DNA-dependent RNA polymerases (RNA polymerases) The enzymes 
responsible for transcription in both prokaryotic and eukaryotic cells. 
(11.2) 


DNA footprinting A method to map transcription factor binding sites 
by testing whether the bound factor protects different regions of a gene 
from digestion by DNA-digesting enzymes. (12.13) 


DNA gyrase A type II topoisomerase that is able to change the state of 
supercoiling in a DNA molecule by relieving the tension that builds up 
during replication. It does this by traveling along the DNA and acting 
like a “swivel,” changing the positively supercoiled DNA into negatively 
supercoiled DNA. (13.2) 


DNA libraries Collections of cloned DNA fragments. Libraries are 
typically divided into genomic and cDNA libraries. (18.23) 


DNA ligase The enzyme responsible for joining DNA fragments into a 
continuous strand. (13.2) 


DNA microarrays “DNA chips” prepared by spotting DNAs from 
different genes in a known, ordered location on a glass slide. The slide is 
then incubated with fluorescently labeled cDNAs whose hybridization 
level provides a measure of the level of expression of each gene in the 
array. (12.10) 

DNA polymerases The enzymes responsible for constructing new 
DNA strands during replication or DNA repair. (13.2) 

DNA replication The process by which the DNA of a cell is copied 
prior to cell division. (13.1) 

DNA tumor viruses Viruses capable of infecting vertebrate cells, 
transforming them into cancer cells. DNA viruses have DNA in the 
mature virus particle. (16.2) 
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Dolichol phosphate Hydrophobic molecule built from more than 
20 isoprene units that assembles the basal, or core, segment of 
carbohydrate chains within glycoproteins. (8.6) 


Domain A region within a protein (or RNA) that folds and functions in 
a semi-independent manner. (2.10) 


Double-strand breaks (DSBs) DNA damage often resulting from 
ionizing radiation involving a fracture of both strands of the double 
helix. DSBs can be debilitating to a cell and at least two distinct repair 
systems are devoted to their repair. (13.8) 


Duplication A chromosome abnormality in which an extra copy of a 
portion of a chromosome is present. (12.6) 


Dynamic instability A term that relates to the assembly/disassembly 
properties of the plus end of a microtubule. The term describes the fact 
that growing and shrinking microtubules can coexist in the same region 
of a cell, and that a given microtubule can switch back and forth 
unpredictably between growing and shortening phases. (9.6) 


Dynein An exceptionally large, cargo-carrying, multisubunit motor 
protein that moves along microtubules toward their minus end. This 
family of proteins occurs as cytoplasmic dyneins and ciliary or 
axonemal dyneins. (9.3) 


E (exit) site tRNA binding site on the ribosome that is occupied by a 
tRNA after it has transferred its amino acid to the growing peptide 
chain, but before being released from the ribosome. (11.16) 


Effector A substance that brings about a cellular response to a signal. 
(15.1) 


Electrochemical gradient The overall difference in electrical charge 
and in solute concentration that determines the ability of an electrolyte 
to diffuse between two compartments. (4.9) 


Electron-transfer potential The relative affinity for electrons, such 
that a compound with a low affinity has a high potential to transfer one 
or more electrons in a redox reaction (and thus act as a reducing agent). 
(5.4) 


Electron-transport or respiratory chain Membrane-embedded electron 
carriers that accept high-energy electrons and sequentially lower the 
energy state of the electrons as they pass through the chain, with the net 
results of capturing energy for use in synthesizing ATP or other energy- 
storage molecules. (5.4) 


Electronegative atom The atom with the greater attractive force; 
the atom that can capture the major share of electrons of a covalent 
bond. (2.1) 


Electrophoresis Fractionation techniques that depend on the ability 
of charged molecules to migrate when placed in an electric 
field. (18.13) 


Embryonic stem (ES) cells A type of cell that has virtually unlimited 
powers of differentiation, found in the mammalian blastocyst, which is 
an early stage of embryonic development comparable to the blastula of 
other animals. (HP 1.6, 18.25) 


Endergonic reactions Reactions that are thermo- dynamically unfavorable 
and cannot occur spontaneously, possessing a +AG value. (3.2) 


Endocytic pathway Route for moving materials from outside the cell 
(and from the membrane surface of the cell) to compartments, such as 
endosomes and lysosomes, located within the cell interior. (8.1) 


Endocytosis Mechanism for the uptake of fluid and solutes into a 
cell. Can be divided into two types: bulk-phase endocytosis, which is 
non-specific, and receptor-mediated endocytosis (RME), which 
requires the binding of solute molecules such as LDL or transferrin to a 
specific cell-surface receptor. (8.17) 


Endomembrane system Functionally and structurally interrelated 
group of membranous cytoplasmic organelles including the endoplasmic 
reticulum, Golgi complex, endosomes, lysosomes, and vacuoles. (8.1) 


Endoplasmic reticulum (ER) A system of tubules, cisternae, and vesicles 
that divides the fluid content of the cytoplasm into a luminal space within 
the ER membrane and a cytosolic space outside the membranes. (8.3) 


Endosomes Organelles of the endocytic pathway. Materials taken up 
by endocytosis are transported to early endosomes where they are 
sorted, and then on to late endosomes and ultimately to lysosomes. Late 
endosomes also function as destination sites of lysosomal enzymes 
transported from the Golgi complex. (8.19) 


Endosymbiont theory Proposal, which is based on considerable 
evidence, that mitochondria and chloroplasts arose from symbiotic 
prokaryotes that took up residence within a primitive host cell. (EP 1.9) 


Endothermic reactions Those gaining heat under conditions of 
constant pressure and volume. (3.1) 


Energy The capacity to do work, it exists in two forms: potential and 
kinetic. (3.1) 

Enhancer A regulatory site in the DNA that may be located at considerable 
distance either upstream or downstream from the promoter that it 
regulates. Binding of one or more transcriptional factors to the enhancer 
can dramatically increase the rate of transcription of the gene. (12.15) 


Entropy (S) A measure of the relative disorder of the system or universe 
associated with random movements of matter; because all movements 
cease at absolute zero (0 K), entropy is zero only at that temperature. 
(3.1) 


Enzyme inhibitor Any molecule that can bind to an enzyme and 
decrease its activity, classified as noncompetitive or competitive based 
on the nature of the interaction with the enzyme. (3.7) 


Enzyme-substrate (ES) complex The physical association between an 
enzyme and its substrate(s), during which catalysis of the reaction takes 
place. (3.5) 


Enzymes The vitally important protein catalysts of cellular reactions. 
(3.5) 


Epigenetic inheritance Heritable changes, ie., changes that can be 
transmitted from one cell to its progeny, that do not involve changes in 
DNA sequence. Epigenetic changes can result from DNA methylation, 
covalent modification of histones, and likely other types of chromatin 
modifications. (12.7) 


Epithelial-mesenchymal transition (ETM) A process in which epithelial 
cells detach from neighboring cells and migrate, forming mesenchymal 
stem cells. These cells may go on to differentiate into a variety of cell types. 
(7.7) 


Epithelial tissue Tissue composed of closely packed cells that line the 
spaces within the body. The outer layer of skin (the epidermis) is a type 
of epithelial tissue. (7.1) 


Epitope (or antigenic determinant) Portion of an antigen that binds to 
the antigen-combining site of a specific antibody. (17.7) 


Equilibrium constant of a reaction (K,,) The ratio of product 
concentrations to reactant concentrations when a reaction is at 
equilibrium. (3.2) 

Ester bond The chemical bond that forms between carboxylic acids 
and alcohols (or acidic and alcoholic functional groups) while producing 
a molecule of water. (2.5) 

Euchromatin Chromatin that returns to its dispersed state during 
interphase. (12.4) 


Eukaryotic cells Cells (e.g., plant, animal, protist, fungal) characterized 
by an internal structure based on organelles such as the nucleus, derived 
from eu-karyon, or true nucleus. (1.3) 


Excitation-contraction coupling The steps that link the arrival of a 
nerve impulse at the muscle plasma membrane to the shortening of the 
sarcomeres deep within the muscle fiber. (9.12) 


Excited state Electron configuration of a molecule after absorption of a 
photon has energized an electron to shift from an inner to an outer 
orbital. (6.4) 


Excytosed Having undergone exocytosis, or discharged outside the 
cell. (8.1) 


Exergonic reactions Reactions that are thermodynamically favorable, 
possessing a —AG value. (3.2) 


Exocytosis The process of membrane fusion and content discharge 
during which the membrane of a secretory granule or vesicle comes into 
contact with the overlying plasma membrane with which it fuses, 
thereby forming an opening through which the contents of the granule 
or vesicle can be released. (8.14) 


Exon-junction complex (EJC) A complex of proteins deposited on the 
transcript 20-24 nucleotides upstream from the newly formed exon- 
exon junction. This conglomeration of proteins stays with the mRNA 
until it is translated. (11.18) 


Exon shuffling The movement of genetic “modules” among unrelated 
genes facilitated by the presence of introns; the introns act like inert 
spacer elements between exons. (11.8) 


Exons Those parts of a split gene that contribute to a mature RNA 
product. (11.6) 


Exothermic reactions Those releasing heat under conditions of 
constant pressure and volume. (3.1) 


Extracellular matrix (ECM) An organized network of extracellular 
materials that is present beyond the immediate vicinity of the plasma 
membrane. It may play an integral role in determining the shape and 
activities of the cell. (7.2) 


Extracellular messenger molecules The means by which cells 
usually communicate with each other. Extracellular messengers 
can travel a short distance and stimulate cells that are in close 
proximity to the origin of the message, or they can travel throughout 
the body, potentially stimulating cells that are far away from the 
source. (15.1) 


F 


Facilitated diffusion Process by which the diffusion rate of a substance 
is increased through interaction with a substance-specific membrane 
protein. (4.13) 


Facilitative transporter A transmembrane protein that binds a specific 
substance and, in so doing, changes conformation so as to facilitate 
diffusion of the substance down its concentration gradient. (4.13) 


Facultative heterochromatin Chromatin that has been specifically 
inactivated during certain phases of an organism's life. (12.4) 


Families Groupings of proteins that have arisen from a single ancestral 
gene that underwent a series of duplications and subsequent 
modifications during the course of evolution. (2.17) 


Fats Molecules consisting of a glycerol backbone linked by ester bonds 
to three fatty acids, also termed triacylglycerols. (2.7) 


Fatty acid Long, unbranched hydrocarbon chain with a single 
carboxylic acid group at one end. (2.7) 


Feedback inhibition A mechanism to control metabolic pathways 
where the end product interacts with an enzyme in the pathway, 
resulting in inactivation of the enzyme. (3.12) 


Fermentation An anaerobic metabolic pathway in which pyruvate is 
converted to another molecule (often lactate or ethanol, depending on 
the organism) and NAD* is regenerated for use in glycolysis. (3.10) 


Fibrous protein One with a tertiary structure that is greatly elongated, 
resembling a fiber. (2.10) 


First law of thermodynamics Energy can be neither created nor 
destroyed. Also known as conservation of energy. (3.1) 


Fixative A chemical solution that kills cells by rapidly penetrating the 
cell membrane and immobilizing all of its macromolecular material in 
such a way that the structure of the cell is maintained as close as possible 
to that of the living state. (18.2) 


Flagella Hairlike motile organelles that project from the surface of a 
variety of eukaryotic cells. Essentially the same structure as cilia but 
present in much fewer numbers. (9.7) 


Flavoproteins A type of electron carrier in which a polypeptide is 
bound to one of two related prosthetic groups, either FAD or FMN. 
(5.4) 


Fluorescence in situ hybridization (FISH) Technique in which probe 
DNAs (or RNAs) are labeled with fluorescent dyes, hybridized to single- 
stranded DNA that remains within its chromosome, and localized with 
a fluorescence microscope. (10.8) 


Fluorescence recovery after photobleaching (FRAP) Technique to 
study movement of membrane components that consists of three steps: 
(1) linking cellular components to a fluorescent dye, (2) irreversibly 
bleaching (removing the visible fluorescence of) a portion of the cell, (3) 
monitoring the reappearance of fluorescence (due to random movement 
of fluorescent-dyed components from outside of the bleached area) in 
the bleached portion of the cell. (4.7) 


Focal adhesions Adhesive structures characteristic of cultured cells 
adhering to the surface of a culture dish. The plasma membrane in the 
region of the focal adhesion contains clusters of integrins that connect 
the extracellular material that coats the culture dish to the actin- 
containing microfilament system of the cytoskeleton. (7.6) 


Frameshift mutations Mutations in which a single base pair is either 
added to or deleted from the DNA, resulting in an incorrect reading 
frame from the point of mutation through the remainder of the coding 
sequence. (11.17) 


Free energy change (AG) The change during a process in the amount 
of energy available to do work. (3.2) 


Free radical Highly reactive atom or molecule that contains a single 
unpaired electron. (HP 2.2) 


Freeze etching Technique in which tissue is freeze fractured, then 
exposed briefy to a vacuum so a thin layer of ice can evaporate from 
above and below the fractured surfaces, exposing features for 
identification by electron microscopy. (18.5) 


Freeze-fracture replication Technique in which a tissue sample is first 
frozen and then struck with a blade that fractures the tissue block along 
the lines of least resistance, often resulting in a fracture line between the 
two leaflets of the lipid bilayer; metals are then deposited on the exposed 
surfaces to create a shadowed replica that is analyzed by electron 
microscopy. (14.4, 18.5) 


Functional groups Particular groupings of atoms that tend to act as a 
unit, often affecting the chemical and physical behavior of the larger 
organic molecules to which they belong. (2.5) 


G 


y-tubulin A type of tubulin that plays a critical component in 
microtubule nucleation. (9.5) 


G protein-coupled receptors (GPCRs) A group of related receptors 
that span the plasma membrane seven times. The binding of the ligand 
to its specific receptor causes a change in the conformation of the 
receptor that increases its affinity for a heterotrimeric G protein 
initiating a response within the cell. (15.3) 
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with spores generated during the sporophyte stage. During the 
gametophyte stage, gametes form through the process of mitosis. 
(14.12) 


Gap junctions Sites between animal cells that are specialized for 
intercellular communication. Plasma membranes of adjacent cells come 
within about 3nm of each other, and the gap is spanned by very fine 
“pipelines,” or connexons that allow the passage of small molecules. 
(7.12) 


Gated channel An ion channel that can change conformation between 
a form open to its solute ion and one closed to the ion; such channels 
can be voltage gated, chemical gated, or mechanically gated depending 
on the nature of the process that triggers the change in conformation. 
(4.11) 


Gel filtration Purification technique in which separation of proteins 
(or nucleic acids) is based primarily on molecular mass. The separation 
material consists of tiny porous beads that are packed into a column 
through which the solution of protein slowly passes. (18.11) 


Gene duplication The duplication of a small portion of a single 
chromosome usually by a process of unequal crossing over. (10.10) 


Gene expression The production of mRNA and eventually protein by a 
given gene. (11.1) 


Gene regulatory protein A protein that is capable of recognizing a 
specific sequence of base pairs within the DNA and binding to that 
sequence with high affinity thereby altering gene expression. (12.1) 


General transcription factors (GTFs) Auxiliary proteins necessary for 
RNA polymerase to initiate transcription. These factors are termed 
“general” because the same ones are required for transcription of a 
diverse array of genes by the polymerase. (11.5) 


Genes In nonmolecular terms, a unit of inheritance that governs the 
character of a particular trait. In molecular terms, a segment of DNA 
containing the information for a single polypeptide or RNA molecule, 
including transcribed but non-coding regions. (10.1) 


Genetic code Manner in which the nucleotide sequences of DNA 
encode the information for making protein products. (11.14) 


Genetic polymorphisms Sites in the genome that vary with relatively 
high frequency among different individuals in a species population. 
(10.15) 


Genome The complement of genetic information unique to each 
species of organism. Equivalent to the DNA of a haploid set of 
chromosomes from that species. (10.1) 


Genome-wide association studies (GWASs) A scientific study seeking 
to uncover whether a certain genetic variants are associated with a 
specific trait. (HP10.16) 


Genome-wide location analysis A method for mapping the binding 
sites of a given transcription factor by physically isolating all the DNA 
sequences in the genome that are bound to a particular factor and then 
probing which DNA sequences are present. (12.13) 


GGAs An adaptor protein consisting of several distinct domains. 
(8.11) 


Globular protein One with a tertiary structure that is compact, 
resembling a globe. (2.10) 


Gluconeogenesis Process by which glucose is synthesized from 
pyruvate. (3.13) 


Glycocalyx A layer closely applied to the outer surface of the plasma 
membrane. It contains membrane carbohydrates along with 
extracellular materials that have been secreted by the cell into the 
external space, where they remain closely associated with the cell 
surface. (7.2) 


Glycogen Highly branched glucose polymer that serves as readily 
available chemical energy in most animal cells. (2.6) 


Glycolipids Sphingosine-based lipid molecules linked to carbohydrates, 
often active components of plasma membranes. (4.2) 


Glycolysis The first pathway in the catabolism of glucose, it does not 
require oxygen and results in the formation of pyruvate. (3.9) 


Glycosaminoglycans (GAGs) A group of highly acidic polysaccharides 
with the structure of —A—B—A—B—, where A and B represent two 
different sugars. (2.6) 


Glycosidic bond The chemical bond that forms between sugar 
molecules. (2.6) 


Glycosylation The reactions by which sugar groups are added to 
proteins and lipids. (4.3) 


Glycosyltransferases A large family of enzymes that transfer specific 
sugars from a specific donor (a nucleotide sugar) to a specific receptor 
(typically the growing end of an oligosaccharide chain). (8.6) 


Glyoxysome Organelle found in plant cells that serve as sites for 
enzymatic reactions including the conversion of stored fatty acids to 
carbohydrate. (5.10) 


Golgi complex Network of smooth membranes organized into a 
characteristic morphology, consisting of flattened, disc-like cisternae 
with dilated rims and associated vesicles and tubules. The Golgi 
complex functions primarily as a processing plant where proteins 
newly synthesized in the endoplasmic reticulum are modified in 
specific ways. (8.9) 

GPI-anchored proteins Peripheral membrane proteins that are 
anchored to the membrane via linkage to a glycosylphosphatidylinositol 
molecule of the bilayer. (4.4) 


Grana Orderly, stacked arrangement of thylakoids. (6.2) 


Graves’ disease Autoimmune disease of the thyroid, caused by 
production of antibodies that bind to the thyroid-stimulating hormone 
receptor, leading to stimulation and hyperthyroidism. (17.3) 


Ground state The unexcited state of an atom or molecule. (6.4) 


Growth cone The distal tip of a growing neuron that contains the 
locomotor activity required for axonal extension. (9.16) 


GTP-binding proteins (or G proteins) With key regulatory roles 
in many different cellular processes, G proteins can be present in at 
least two alternate conformations, an active form containing a 
bound GTP molecule, and an inactive form containing a bound 
GDP molecule. (8.4, 15.2) 


Guanosine triphosphate (GTP) A nucleotide of great importance in 
cellular activities. It binds to a variety of proteins (called G proteins) and 
acts as a switch to turn on their activities. (2.18) 


Half-life A measure of the instability of a radioisotope, or, equivalently, 
the amount of time required for one-half of the radioactive material to 
disintegrate. (18.8) 


Haploid Containing only one member of each pair of homologous 
chromosomes. Haploid cells are produced during meiosis, as exemplified 
by sperm. Contrast with diploid. (10.8) 


Haplotype A block of the genome that tends to be inherited intact from 
generation to generation. Haplotypes are generally defined by the 
presence of aconsistent combination of single nucleotide polymorphisms 
(SNPs). (HP10.16) 

Head group The polar, water-soluble region of a phospholipid that 
consists of a phosphate group linked to one of several small, hydrophilic 
molecules. (4.2) 


Heat shock response Activation of the expression of a diverse array of 
genes in response to temperature elevation. The products of these genes, 
including molecular chaperones, help the organism recover from the 
damaging effects of elevated temperature. (EP2.14) 


Heavy chain One of the two types of polypeptide chains in an antibody, 
usually with a molecular mass of 50 to 70 kD. (17.7) 


Helicase A protein that unwinds the DNA (or RNA) duplex in a 
reaction in which energy released by ATP hydrolysis is used to break the 
hydrogen bonds that hold the two strands together. (13.3) 


Hematopoietic stem cells (HSCs) Cells that are situated primarily in 
the bone marrow that are capable of both self-renewal and of giving rise 
to all types of blood cells. (HP 1.6) 


Hemicelluloses Branched polysaccharides of the plant cell wall whose 
backbone consists of one sugar, such as glucose, and sidechains of other 
sugars, such as xylose. (7.14) 


Hemidesmosome Specialized adhesive structure at the basal surface of 
epithelial cells that functions to attach the cells to the underlying 
basement membrane. The hemidesmosome contains a dense plaque on 
the inner surface of the plasma membrane, with keratin-containing 
filaments coursing out into the cytoplasm. (7.6) 


Heterochromatin Chromatin that remains 


interphase. (12.4) 


compacted during 


Heterogeneous nuclear ribonucleoprotein (hnRNP) The result of the 
transcription of each hnRNA which becomes associated with a variety 
of proteins; hnRNP represents the substrate for the processing reactions 
that follow. (11.7) 


Heterogeneous nuclear RNAs (hnRNAs) A large group of RNA 
molecules that share the following properties: (1) they heave large 
molecular weights (up to about 80S, or 50,000 nucleotides); (2) they 
represent many different nucleotide sequences; and (3) they are found 
only in the nucleus. Includes pre-mRNAs. (11.5) 


Heterotroph Organism that depends on an external source of organic 
compounds. (6.1) 


Highly repeated fraction Typically short (a few hundred nucleotides at 
their longest) DNA sequences that are present in at least 10° copies per 
genome. The highly repeated sequences typically account for about 10 
percent of the DNA of vertebrates. (10.8) 


Histone acetyltransferases (HATs) Enzymes that transfer acetyl 
groups to lysine and arginine residues of core histones. Histone 
acetylation is associated with activation of transcription. (12.15) 


Histone code A concept that the state and activity of a particular region 
of chromatin depends upon the specific covalent modifications, or 
combination of modifications, to the histone tails of the nucleosomes in 
that region. The modifications are created by enzymes that acetylate, 
methylate, and phosphorylate various amino acid residues within the 
core histones. (12.4) 

Histone deacetylases (HDACs) Enzymes that catalyze the removal of 
acetyl groups from core histones. Histone deacetylation is associated 
with the repression of transcription. (12.17) 

Histones A collection of small, well-defined, basic proteins of 
chromatin. (12.3) 

Homogenize To mechanically rupture cells. (8.2) 

Homologous chromosomes Paired chromosomes of diploid cells, each 
carrying one of the two copies of the genetic material carried by that 
chromosome. (10.3) 

Humoral immunity Immunity mediated by blood-borne antibodies. 
(17.1) 

Hybridomas Hybrid cells produced by fusion of a normal 
antibody-producing lymphocyte and a malignant myeloma cell. 


Hybridomas proliferate, producing large amounts of a single 
(monoclonal) antibody that was being synthesized by the normal cell 
prior to fusion with the myeloma cell. (18.26) 


Hydrogen bond The weak, attractive interaction between a hydrogen 
atom covalently bonded to an electronegative atom (thus, with a partial 
positive charge) and a second electronegative atom. (2.3) 


Hydrophilic The tendency of polar molecules to interact with 
surrounding water molecules, which are also polar; derived from “water 
loving” (2.3) 

Hydrophobic interaction The tendency of nonpolar molecules to 
aggregate so as to minimize their collective interaction with surrounding 
polar water molecules; derived from “water fearing.” (2.3) 


Hypertonic (or Hyperosmotic) Property of one compartment having 
a higher solute concentration compared with that in a given 
compartment. (4.10) 


Hypotonic (or Hypoosmotic) Property of one compartment having 
alower solute concentration compared with that in a given compartment. 
(4.10) 


Immune responses Responses elicited by cells of the immune system 
upon contact with foreign materials, including invading pathogens. 
Includes innate and adaptive responses. Adaptive immune responses 
can be divided into primary responses that follow initial exposure to an 
antigen and secondary responses that follow reexposure to that antigen. 
(17.1) 


Immune system Physiological system consisting of organs, scattered 
tissues, and independent cells that protect the body from invading 
pathogens and foreign materials. (17.1) 


Immunity State in which the body is not susceptible to infection by a 
particular pathogen. (17.1) 


Immunoglobulin superfamily (IgSF) A wide variety of proteins that 
contain domains composed of 70 to 110 amino acids that are homologous 
to domains that make up the polypeptide chains of blood-borne 
antibodies. (7.7, 17.1) 


Imprinting Differential expression of genes depending solely on whether 
they were contributed to the zygote by the sperm or the egg. (12.17) 


In vitro Outside the body. Cells grown in culture are said to be grown 
in vitro; and studies on cultured cells are an essential tool of cell and 
molecular biologists. (1.2) 


Indirect immunofluorescence A variation of direct immunoflu- 
orescence in which the cells are treated with unlabeled antibody, which 
is allowed to form a complex with the corresponding antigen. The 
location of the antigen-antibody couple is then revealed in a second step 
using a preparation of fluorescently labeled antibodies whose combining 
sites are directed against the antibody molecules used in the first 
step. (18.26) 

Induced fit The conformational change in an enzyme after the substrate 
has been bound that allows the chemical reaction to proceed. (3.6) 
Inducer A compound that binds to a protein repressor and activates 
transcription of a bacterial operon. (12.1) 

Inducible operon An operon in which the presence of the key 
metabolic substance induces transcription of the structural genes. (12.1) 
Inflammation The localized accumulation of fluid and leukocytes in 
response to injury or infection, leading to redness, swelling, and fever. (17.1) 
Inflammatory bowel diseases (IBDs) One of a variety of conditions, 
including Crohn's disease and ulcerative colitis, characterized by painful 
inflammation of the intestine (17.3). 
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attaches to the mRNA to assure that the ribosome is in the proper 
reading frame to correctly read the entire message. (11.16) 


Initiation factors Soluble proteins (IFs in bacteria and elFs in 
eukaryotes) that make initiation of translation possible. (11.16) 


Initiation of replication The first step of DNA replication in which 
the DNA polymerase is loaded onto the DNA and the DNA strands 
are separated to allow the polymerase to start copying the DNA. 
(13.2) 


Innate immune response A nonspecific response to a pathogen that 
does not require previous exposure to that agent, includes responses 
mediated by NK cells, complement, phagocytes, and interferon. (17.1) 


Insulators Specialized boundary sequences that “cordon off” a 
promoter and its enhancer from other promoter/enhancer elements. 
According to one model, insulator sequences bind to proteins of the 
nuclear matrix. (12.15) 


Insulin receptor substrates (IRSs) Protein substrates that, when 
phosphorylated in response to insulin, bind and activate a variety of 
“downstream” effectors. (15.12) 


Integral protein A membrane-associated protein that penetrates or 
spans the lipid bilayer. (4.4) 

Integrins A superfamily of integral membrane proteins that bind 
specifically to extracellular molecules. (7.5) 


Intermediate filaments (IFs) Strong, ropelike cytoskeletal fibers 
approximately 10 nm in diameter that, depending on the cell type, may 
be composed ofa variety of protein subunits capable of assembling into 
similar types of filaments. IFs are thought to provide mechanical stability 
to cells and provide specialized, tissue-specific functions. (9.1) 


Intermembrane space The space between the inner and outer 
mitochondrial membranes. (5.1) 


Interphase The portion of the cell cycle between periods of cell 
division. (14.1) 

Intervening sequences Regions of DNA that are between coding 
sequences of a gene and that are therefore missing from corresponding 
mRNA. (11.6) 


Intraflagellar transport (IFT) A process in which particles are moved 
in both directions between the base of a flagellum or cilium and its tip. 
The force that drives IFT is generated by motor proteins that track along 
the peripheral doublets of the axoneme. (9.7) 


Introns Those parts of a split gene that correspond to the intervening 
sequences. (11.6) 


Inversion A chromosomal aberration that results after a chromosome 
is broken in two places and the resulting center segment is reincorporated 
into the chromosome in reverse order. (12.6) 

In vitro Outside the body. Cells grown in culture are said to be grown 
in vitro; and studies on cultured cells are an essential tool of cell and 
molecular biologists. (1.2) 

Ion Anatom or molecule with a net positive or negative charge because it 
has lost or gained one or more electrons during a chemical reaction. (2.1) 
Ion channel A transmembranous structure (e.g., an integral protein 
with an aqueous pore) permeable to a specific ion or ions. (4.11) 
Ion-exchange chromatography A technique for protein purification in 
which ionic charge is used to separate different proteins. (18.11) 

Ionic bond A noncovalent bond occurring between oppositely charged 
ions, also called a salt bridge. (2.3) 

Iron-sulfur proteins A group of protein electron carriers with an 
inorganic, iron-sulfur center. (5.4) 

Irreversible inhibitor An enzyme inhibitor that binds tightly, often 
covalently, thus inactivating the enzyme molecule permanently. (3.7) 


Isoelectric point The pH at which the negative charges of the 
component amino acids of a protein equal the positive charges of the 
component amino acids, so the protein is neutral. (18.11) 


Isoforms Different versions of a protein. Isoforms may be encoded by 
separate, closely related genes, or formed as splice variants by alternative 
splicing from a single gene. (2.17) 

Isotonic or Isosmotic Property of one compartment having the same 
solute concentration compared with that in a given compartment. (4.10) 


K 


Karyotype Image in which paired homologous chromosomes are 
ordered in decreasing size. (12.5) 


Kinesin A plus end-directed motor protein that moves membranous 
vesicles and other organelles along microtubules through the cytoplasm. 
Kinesin is a member of a family of kinesin-related proteins (KRPs). 
(9.3) 


Kinetics The study of the rates of chemical reactions. (3.7) 


Kinetochore A buttonlike structure situated at the outer surface of the 
centromere to which the microtubules of the spindle attach. (14.7) 


Knockout mice Mice, born as the result of a series of experimental 
procedures, that are lacking a functional gene that they would normally 
contain. (18.25) 


Krebs cycle An alternative name for the TCA cycle. (5.2) 


L 


Lagging strand The newly synthesized daughter DNA strand that is 
synthesized discontinuously, so called because the initiation of each 
fragment must wait for the parental strands to separate and expose 
additional template. (13.2) 


Lamellipodium The leading edge of a moving fibroblast, which is 
extended out from the cell as a broad, flattened, veil-like projection that 
glides over the substratum. (9.14) 


Last Eukaryotic Common Ancester (LECA) The species that was the 
common ancestor to all existing eukaryotic species, thought to be a 
single celled organism that already had all of the cell biological features 
characteristic of eukaryotic cells. (EP 1.9) 


Leading strand The newly synthesized daughter DNA strand that is 
synthesized continuously, so called because its synthesis continues as 
the replication fork advances. (13.2) 


Ligand Any molecule that can bind to a receptor because it has a 
complementary structure. (4.1, 15.1) 


Ligand-gated channel A membrane channel that opens or closes in 
response to binding of a molecule to its surface. (4.11) 


Light chain Smaller of the two types of polypeptide chains in an 
antibody, with a molecular mass of 23 kDa. (17.7) 


Light-dependent reactions First of two series of reactions that 
compose photosynthesis. In these reactions, energy from sunlight is 
absorbed and converted to chemical energy that is stored in ATP and 
NADPH. (6.3) 


Light-harvesting complex II (LHCII) Pigment-protein complex, 
located outside the photosystem itself, that contains most of the antenna 
pigments that collect light for PSII. It can also be associated with PSI. 
(6.6) 


Light-independent reactions (dark reactions) Second of two series 
of reactions that compose photosynthesis. In these reactions, 
carbohydrates are synthesized from carbon dioxide using the energy 
stored in the ATP and NADPH molecules formed in the light- 
dependent reactions. (6.3) 


Limit of resolution The resolution attainable by a microscope is 
limited by the wavelength of the illumination according to the equation 
D =0.61 A/n sin a where D is the minimum distance that two points in 
the specimen must be separated to be resolved, A is the wavelength of 
light, and n is the refractive index of the medium. Alpha is a measure of 
the light-gathering ability of the lens and is directly related to its 
aperture. For a light microscope, the limit of resolution is slightly less 
than 200 nm. (18.1) 


Linkage groups Groups of genes that reside on the same chromosome 
causing nonindependent segregation of traits controlled by these genes. 
(10.3) 


Lipid-anchored protein A membrane associated protein that is located 
outside the bilayer but is covalently linked to a lipid molecule within the 
bilayer. (4.4) 


Lipid bilayer Phospholipids self-assembled into a bimolecular structure 
based on hydrophobic and hydrophilic interactions; biologically 
important as the core organization of cellular membranes. (4.1) 


Lipid rafts Microdomains within a cellular membrane that possess 
decreased fluidity due to the presence of cholesterol, glycolipids, and 
phospholipids containing longer, saturated fatty acids. A proposed 
residence of GPI-anchored proteins and signaling proteins (4.6) 


Lipid-transfer proteins Facilitate the movement of specific lipids from 
the ER to other organelles without the involvement of transport vesicles. 
(8.5) 


Liposome An artificial lipid bilayer that self-assembles into a spherical 
vesicle or vesicles when in an aqueous environment. (4.2) 


Locus (pl. loci) The position of a gene on a chromosome. (10.4) 


Luminal (cisternal) space The region of fluid content of the cytoplasm 
enclosed by the membranes of the endoplasmic reticulum or Golgi 
complex. (8.3) 


Lymphocytes Nucleated leukocytes (white blood cells) that circulate 
between the blood and lymph organs that mediate acquired immunity. 
Includes both B cells and T cells. (17.1) 


Lysosomal storage disorder Disease characterized by the deficiency of 
alysosomal enzyme and the corresponding accumulation of undegraded 
substrate. (HP8.12) 


M phase The part of the cell cycle that includes the processes of mitosis, 
during which duplicated chromosomes are separated into two nuclei, 
and cytokinesis, during which the entire cell is physically divided into 
two daughter cells. (14.1) 


Macromolecules Large, highly organized molecules crucial to the 
structure and function of cells; divided into polysaccharides, certain 
lipids, proteins, and nucleic acids. (2.5) 


Major histocompatibility complex (MHC) A region of the genome 
that encodes MHC proteins. The genes that encode these proteins tend 
to be highly polymorphic, being represented by a large number of 
different alleles. These genetic differences between humans account for 
the tendency of a person to reject a transplant from another person 
other than an identical twin. (17.10) 


Matrix One of two aqueous compartments of a mitochondrion; the 
matrix is located within the interior of the organelle; the second 
compartment is called the intermembrane space and is located between 
the outer and inner mitochondrial membrane. (5.1) 


Matrix metalloproteinases (MMPs) A family of zinc-containing 
enzymes that act in the extracellular space to digest various extracellular 
proteins and proteoglycans. (7.4) 


Maximal velocity (V,.) The highest rate achieved for a given G-11 


enzymatically catalyzed reaction, it occurs when the enzyme is saturated 
with substrate. (3.7) 


Mechano-gated channel A membrane channel that opens or closes 
in response to mechanical forces such as tension in the membrane. 
(4.11) 


Medial cisternae Compartments of the Golgi apparatus that lie in the 
center of the organelle, between the cis and trans cisternae. (8.9) 


Meiosis The process during which the chromosome number is reduced 
so that cells are formed that contain only one member of each pair of 
homologous chromosomes. (14.12) 


Membrane potential The electrical potential difference across a 
membrane. (4.16) 


Messenger RNA (mRNA) The intermediate molecule between a gene 
and the polypeptide for which it codes. Messenger RNA is assembled as 
a complementary copy of one of the two DNA strands that encodes the 
gene. (11.1) 


Metabolic intermediate A compound produced during one step of a 
metabolic pathway. (2.5, 3.9) 


Metabolic pathway A series of chemical reactions that results in the 
synthesis of an end product important to cellular function. (2.5, 3.9) 


Metabolism The total of the chemical reactions occurring within a cell. 
(1.2, 3.9) 


Metaphase The stage of mitosis during which all of the chromosomes 
have become aligned at the spindle equator, with one chromatid of each 
chromosome connected to one pole and its sister chromatid connected 
to the opposite pole. (14.9) 


Metastasis Spread of cancer cells from a primary tumor to distant sites 
in the body where the formation of secondary tumors may arise. (7.8) 


Methylguanosine cap Modification of the 5’ end of an mRNA precursor 
molecule, so that the terminal “inverted” guanosine is methylated at the 
7’ position on its guanine base, while the nucleotide on the internal side 
of the triphosphate bridge is methylated at the 2’ position of the ribose. 
This cap prevents the 5’ end of the mRNA from being digested by 
nucleases, aids in transport of the mRNA out of the nucleus, and plays a 
role in the initiation of mRNA translation. (11.7) 


MHC proteins Proteins encoded by the MHC region of the genome 
that bind processed antigens (antigenic peptides) and display them on 
the surface of the cell. Divided into two major classes, MHC class I 
molecules produced by virtually all cells of the body, and MHC class II 
molecules produced by “professional” APCs, such as macrophages and 
dendritic cells. (17.10) 


Michaelis constant (K,,) In enzyme kinetics, the value equal to the 
substrate concentration present when reaction rate is one-half of the 
maximal velocity. (3.7) 


Microfibrils Bundles of cellulose molecules that confer rigidity to the 
cell wall and provide resistance to pulling forces. (7.14) 


Microfilaments Solid, 8-nm thick, cytoskeletal structures composed 
of a double-helical polymer of the protein actin. They play a key role 
in virtually all types of contractility and motility within cells. (9.9) 
Micrometer Measure of length equaling 10~° meters. (1.7) 
MicroRNAs (miRNAs) Small RNAs (20-23 nucleotides long) that are 
synthesized from many sites in the genome and involved in inhibiting 


translation or increasing degradation of complementary mRNAs. 
(11.12) 


Microsatellite DNAs Short (1-9 basepair) sequences present in small 
clusters of about 10 to 40 base pairs in length, often found scattered 
throughout the genome. (10.8) 
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object. (1.1) 


Microsomes A heterogeneous collection of vesicles formed from the 
endomembrane system (primarily the endoplasmic reticulum and Golgi 
complex) after homogenization. (8.2) 


Microtubule-associated proteins (MAPs) Proteins other than tubulin 
contained in microtubules obtained from cells. MAPs may interconnect 
microtubules to form bundles and can be seen as cross-bridges 
connecting microtubules to each other. Other MAPs may increase the 
stability of microtubules, alter their rigidity, or influence the rate of their 
assembly. (9.2) 


Microtubule-organizing centers (MTOCs) A variety of specialized 
structures that exert a role in initiating microtubule formation. (9.5) 


Microtubules Hollow, cylindrical cytoskeletal structures, 25 nm in 
diameter, whose wall is composed of the protein tubulin. Microtubules 
are polymers assembled from af-tubulin heterodimers that are arranged 
in rows, or protofilaments. Because of their rigidity, microtubules often 
act in a supportive capacity. (9.1) 


Minisatellite DNAs Sequences of 10-100 base pairs found in clusters 
containing up to 3000 repeats. (10.8) 


Mismatch repair (MMR) DNA repair system that removes mismatched 
bases that are incorporated by the DNA polymerase and escape the 
enzyme’s proofreading exonuclease. (13.8) 


Mitochondrion The cellular organelle in which aerobic energy 
transduction takes place, oxidizing metabolic intermediates such as 
pyruvate to produce ATP. (5.1) 


Mitosis Process of nuclear division in which duplicated chromosomes 
are faithfully separated from one another, producing two nuclei, each 
with a complete copy of all the chromosomes present in the original cell. 
(14.1) 


Mitotic spindle Microtubule-containing “machine” that functions in 
the organization and sorting of duplicated chromosomes during mitotic 
cell division. (14.7) 


Model organisms Organisms that have been widely used for research 
so that a great deal is known about their biology. These organisms have 
properties that have made them excellent research subjects. Such 
organisms include the bacterium, E. coli; the budding yeast, S. cerevisiae; 
the nematode, C. elegans; the fruit fly, D. melanogaster; the mustard 
plant, A. thaliana; and the mouse, M. musculus. (1.5) 


Moderately repeated fraction DNA sequences that are repeated from 
a few to several hundred thousand times within a eukaryotic genome. 
The moderately repeated fraction of the DNA can vary from about 20 to 
about 80 percent of the total DNA. These sequences may be identical to 
each other or nonidentical but related. (10.8) 


Molecular chaperones Various families of proteins whose role is to 
assist the folding and assembly of proteins by preventing undesirable 
interactions. (2.12, EP2.14) 


Monoclonal antibody A preparation of antibody molecules produced 
from a single colony (or clone) of cells. (18.26) 


Motor proteins Proteins that utilize the energy of ATP hydrolysis to 
generate mechanical forces that propel the protein, as well as attached 
cargo, along one of the components of the cytoskeleton. Three families 
of motor proteins are known: kinesins and dyneins move along 
microtubules and myosins move along microfilaments. (9.2) 


Multiple sclerosis An autoimmune disease caused by the destruction 
of the myelin sheath surrounding the axons of neurons. (17.3) 


Multiprotein complex The interaction of more than one complete 
protein to form a larger, functional complex. (2.11) 


Muscle fiber A skeletal muscle cell, referred to as a muscle fiber because 
of its highly ordered, multinucleated, cablelike structure composed of 
hundreds of thinner, cylindrical myofibrils. (9.12) 


Mutant An individual having an inheritable characteristic that 
distinguishes it from the wild type. (10.4) 


Mutation A spontaneous change in a gene that alters it in a permanent 
fashion so that it causes heritable change. (10.4) 


Myelin sheath The lipid-rich material wrapped around most neurons 
in the vertebrate body. (4.16) 


Myofibrils The thin, cylindrical strands found within muscle fibers. 
Each myofibril is composed of repeating linear arrays of contractile 
units, called sarcomeres, that give skeletal muscle cells their striated 
appearance. (9.12) 


N 
Nanometer Measure of length equaling 10~° meters. (1.7) 


Native The natural three-dimensional structure of a protein or other 
macromolecule when it is inside a living cell. (2.12) 


Natural killer (NK) cell A type of lymphocyte that engages in a 
nonspecific attack on an infected host cell, leading to apoptosis. 
(17.1) 


Negative staining Procedures in which heavy-metal deposits are 
collected everywhere on the specimen grid except in the locations of very 
small particulate materials, including high-molecular-weight aggregates 
such as viruses, ribosomes, multisubunit enzymes, cytoskeletal elements, 
and protein complexes. (18.5) 


Nerve gas A chemical weapon that poisons the nervous system. (4.0) 


Nerve impulse The process through which an action potential is 
propagated along the membrane of a neuron by sequentially triggering 
action potentials in adjacent stretches of membrane. (4.17) 


Neurofilaments Loosely packed bundles of intermediate filaments 
located within the cytoplasm of neurons. Neurofilaments have long axes 
that are oriented parallel to that of the nerve cell axon and are composed 
of three distinct proteins: NF-L, NF-H, and NF-M. (9.9) 


Neuromuscular junction The point of contact of a terminus of an axon 
with a muscle fiber, the neuromuscular junction is a site of transmission 
of nerve impulses from the axon across the synaptic cleft to the muscle 
fiber. (4.18, 9.12) 


Neurotransmitter A chemical that is released from a presynaptic 
terminal and binds to the postsynaptic target cell, altering the membrane 
potential of the target cell. (4.18) 


Nitrogen fixation The process through which nitrogen gas is 
chemically reduced and converted into a component of organic 
compounds. (1.4) 


Noncompetitive inhibitor An enzyme inhibitor that does not bind at 
the same site as the substrate, and so the level of inhibition depends only 
on the concentration of inhibitor. (3.7) 


Noncovalent bond A relatively weak chemical bond based on attractive 
forces between oppositely charged regions within a molecule or between 
two nearby molecules. (2.3) 

Noncyclic photophosphorylation The formation of ATP during the 
process of oxygen-releasing photosynthesis in which electrons move in 
a linear path from H,O to NADP*. (6.8) 


Nonpolar molecules Molecules whose covalent bonds have a nearly 
symmetric distribution of charge because the component atoms have 
approximately the same electronegativities. (2.1) 


Non-receptor protein-tyrosine kinases Cytoplasmic kinases that 
phosphorylate specific tyrosine residues of select proteins. (15.10) 


Nonrepeated fraction Those DNA sequences in the genome that are 
present in only one copy per haploid set of chromosomes. These 
sequences contain the greatest amount of genetic information including 
the codes for virtually all proteins other than histones. (10.8) 


Nonsense-mediated decay(NMD) AnmRNA surveillance mechanism 
that detects mRNAs containing premature termination (nonsense) 
codons, leading to their destruction. (11.18) 


Nonsense mutations Mutations that produce stop codons within genes, 
thereby causing premature termination of the encoded polypeptide 
chain. (11.18) 


Nuclear envelope The complex, double- membrane structure that 
divides the eukaryotic nucleus from its cytoplasm. (12.2) 


Nuclear lamina A thin meshwork composed of intermediate filaments 
that lines the inner surface of the nuclear envelope. (12.2) 


Nuclear localization signal (NLS) Sequence of amino acids in a 
protein that is recognized by a transport receptor leading to translocation 
of the protein from the cytoplasm to the nucleus. (12.2) 


Nuclear pore complex (NPC) Complex, basketlike apparatus that fills 
the nuclear pore like a stopper, projecting outward into both the 
cytoplasm and the nucleoplasm. (12.2) 


Nucleic acid hybridization A variety of related techniques that 
are based on the fact that two single-stranded nucleic acid 
molecules of complementary base sequence will form a double- 
stranded hybrid. (18.18) 


Nucleoid The poorly defined region of a prokaryotic cell that contains 
its genetic material. (1.3) 


Nucleoli (sing. nucleolus) Irregular-shaped nuclear structures that 
function as ribosome-producing organelles. (11.4) 


Nucleosomes Repeating subunits of chromatin. Each nucleosome 
contains a nucleosome core particle, consisting of 146 base pairs of 
supercoiled DNA wrapped almost twice around a disc-shaped complex 
of eight histone molecules. The nucleosome core particles are connected 
to one another by a stretch of linker DNA. (12.3) 


Nucleotide A monomer, or building block, of DNA or RNA consisting 
of a nitrogenous base, a sugar and a phosphate group. ( 2.18, 10.5) 


Nucleotide excision repair (NER) A cut-and-patch mechanism for the 
removal from the DNA of a variety of bulky lesions, e.g., pyrimidine 
dimers, caused by ultraviolet radiation. (13.8) 


Nucleus The organelle that contains a eukaryotic cell’s genetic 
material. (1.3) 


O 


Objective lens The lens of a light microscope that focuses light rays 
from the specimen to form a real, enlarged image of the object within 
the column of the microscope. (18.1) 


Oils Fats that are liquid at room temperature. (2.7) 


Okazaki fragments Small segments of DNA that are rapidly linked to 
longer pieces that have been synthesized previously, to form the lagging 
strand. (13.2) 


Oligosaccharides Small chains composed of sugars covalently attached G-13 


to lipids and proteins; they distinguish one type of cell from another and 
help mediate interactions of a cell with its surroundings. (2.6, 4.3) 


Oncogenes Genes that encode proteins that promote the loss of growth 
control and the conversion of the cell to a malignant state. These genes 
have the ability to transform cells. (16.5) 


Operator Binding site for bacterial repressors that is situated between 
the polymerase binding site and the first structural gene. (12.1) 


Operon A functional complex on a bacterial chromosome comprising 
a cluster of genes including structural genes, a promoter region, an 
operator region, and a regulatory gene. (12.1) 


Organelles The organizationally and functionally diverse, membranous 
or membrane-bounded, intracellular structures that are the defining 
feature of eukaryotic cells. (1.3) 


Origin of replication The specific site on the bacterial chromosome 
where replication begins. (13.2) 


Osmosis The property of water passing through a semipermeable 
membrane from a region of lower solute concentration to one of higher 
solute concentration, with the tendency of eventually equalizing 
solute concentration in the two compartments. (4.10) 


Oxidation-reduction (redox) reaction One in which a change in the 
electronic state of the reactants occurs. (3.9) 


Oxidation—reduction (redox) potential The separation of charge, 
measured in voltage, for any given pair of oxidizing-reducing agents, such 
as NAD+ and NADH, relative to a standard couple (e.g., H* and H,). (5.4) 


Oxidative phosphorylation ATP formation driven by energy derived 
from high-energy electrons removed during substrate oxidation in 
pathways such as the TCA cycle, with the energy released for ATP 
formation by passage of the electrons through the electron-transport 
chain in the mitochondrion. (5.4) 


Oxidizing agent The substance in a redox reaction that becomes 
reduced, causing the other substance to become oxidized. (3.9) 


P 


P-bodies Cytoplasmic granules in which RNA is stored or processed 
for degradation. (12.19) 


P (peptidyl) site Site in the ribosome from which the tRNA donates 
amino acids to the growing polypeptide chain. (11.16) 


P680 The reaction center of photosystem II. “P” stands for pigment 
and “680” is the wavelength of light that this molecule absorbs most 
strongly. (6.5) 

P700 The reaction center of photosystem I. “P” stands for pigment, 
and “700” is the wavelength of light that this molecule absorbs most 
strongly. (6.5) 

Partition coefficient The ratio of a solutes solubility in oil to that in 
water, it is a measure of the relative polarity of a biological substance. 
(4.10) 


Pectins A heterogeneous class of negatively charged polysaccharides 
that make up the matrix of the plant cell wall. Pectins hold water and 
form a gel that fills in the spaces between the fibrous elements. (7.14) 


Peptide bond The chemical bond linking amino acids in a protein, 
which forms when the carboxyl group of one amino acid reacts with the 
amino group of a second amino acid. (2.8) 

Peptidyl transferase That portion of the large ribosomal subunit that is 
responsible for catalyzing peptide bond formation; peptidyl transferase 
activity resides in the large ribosomal RNA molecule. (11.17) 
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that surrounds the centrioles in an animal cell. (9.5) 


Peripheral proteins A membrane-associated protein that is located 
entirely outside of the lipid bilayer and interacts with it through 
noncovalent bonds. (4.4) 


Peroxisomes Simple, membrane-bound, multi-functional organelles of 
the cytoplasm that carry out a diverse array of metabolic reactions, 
including substrate oxidation leading to formation of hydrogen 
peroxide. For example, peroxisomes are the site of oxidation of very- 
long-chain fatty acids, the oxidation of uric acid, and the synthesis of 
plasmalogens. Plant glyoxysomes, which carry out the glyoxylate cycle, 
are a type of peroxisome. (5.10) 


Phagocytosis Process by which particulate materials are taken into 
cells. Materials are enclosed within a fold of the plasma membrane, 
which buds into the cytoplasm to form a vesicle called a phagosome. 
(8.17) 


Phase-contrast microscope A microscope that converts differences in 
refractive index into differences in intensity (relative brightness and 
darkness), which are then visible to the eye, making highly transparent 
objects more visible. (18.2) 


pH The standard measure of relative acidity, it mathematically equals 
—log[H*]. (2.4) 

PH domain A protein domain that binds to the phosphorylated inositol 
rings of membrane-bound phosphoinositides. (15.6) 


Phosphoglycerides The name given to membrane phospholipids that 
are built on a glycerol backbone. (4.2) 


Phosphoinositides Includes a number of phosphorylated phosphati- 
dylinositol derivatives (e.g., PIP,, PIP,, and PIP,) that serve as second 
messengers in signaling pathways. (15.6) 


Phospholipids Phosphate-containing lipids that represent the primary 
constituents of the lipid bilayer of cellular membranes. Phospholipids 
include both phosphoglycerides and sphingomyelin. (4.2) 


Photoautotroph An autotroph that utilizes the radiant energy of the 
sun to convert CO, into organic compounds. (6.1) 


Photolysis The splitting of water during photosynthesis. (6.6) 


Photon Packet of light energy. The shorter the wavelength, the greater 
the energy of the photons. (6.4) 


Photorespiration A series of reactions in which O, is attached to RuBP, 
and eventually resulting in the release of recently fixed CO, from the 
plant. (6.9) 


Photosynthesis The pathway converting the energy of sunlight into 
chemical energy that is usable by living organisms. (6.1) 


Photosynthetic unit A group of several hundred chlorophyll molecules 
acting together to trap photons and transfer energy to the pigment 
molecule at the reaction center. (6.5) 


PhotosystemI(PSI) One of two spatially separated pigment complexes, 
which are necessary to boost the energy of a pair of electrons sufficiently 
to remove them from a molecule of water and transfer them to NADP*. 
Photosystem I raises electrons from an energy level at about the midway 
point to an energy level above NADP*. (6.5) 


Photosystem II (PSII) One of two spatially separated pigment complexes, 
which are necessary to boost the energy of a pair of electrons sufficiently 
to remove them from a molecule of water and transfer them to NADP*. 
Photosystem II boosts electrons from an energy level below that of water 
at the bottom of the energy trough to about the midway point. (6.5) 


Phragmoplast Dense material roughly aligned in the equatorial plane of 
the previous metaphase plate in plant cells, consisting of clusters of 
interdigitating microtubules oriented perpendicular to the future cell plate, 
together with vesicles and associated electron-dense material. (14.11) 


Pigments Molecules that contain a chromophore, a chemical group 
capable of absorbing light of particular wavelength(s) within the visible 
spectrum. (6.4) 


Piwi-interacting (piRNAs) Small RNAs (24-32 bases) that are encoded 
by a small number of large genomic loci and act to suppress the movement 
of transposable elements in germ cells. piRNAs are derived from single- 
stranded precursors and do not require Dicer for processing. (11.12) 


Plasma cells Terminally differentiated cells that develop from 
B lymphocytes that synthesize and secrete large amounts of blood-borne 
antibodies. (17.2) 


Plasma membrane The membrane serving as a boundary between the 
interior of a cell and its extracellular environment. (4.1) 


Plasmodesmata (singular, plasmodesma) Cytoplasmic channels, 30 to 
in diameter, that connect most plant cells and extend between adjacent 
cells directly through the cell wall. Plasmodesmata are lined with plasma 
membrane and usually contain a dense central structure, the desmotubule, 
derived from the endoplasmic reticulum of the two cells. (7.12) 


Plasmolysis The shrinkage that occurs when a plant cell is placed into 
a hypertonic medium; its volume shrinks as the plasma membrane pulls 
away from the surrounding cell wall. (4.10) 


Polar molecules Molecules with an uneven distribution of charge 
because the component atoms of various bonds have greatly different 
electronegativities. (2.1) 


Poly(A) tail A string of adenosine residues at the 3° end of an mRNA 
added posttranscriptionally. (11.7) 


Polyacrylamide gel electrophoresis (PAGE) Protein fractionation 
technique in which the proteins are driven by an applied current through 
a gel composed of a small organic molecule (acrylamide) that is cross- 
linked to form a molecular sieve. (18.13) 


Polymerase chain reaction (PCR) A technique in which a single 
region of DNA which may be present in vanishingly small amounts, can 
be amplified cheaply and rapidly. (18.21) 


Polypeptide chain A long, continuous unbranched polymer formed by 
amino acids joined to one another by covalent peptide bonds. (2.8) 


Polyploidization (whole-genome duplication) Phenomenon in which 
offspring have twice the number of chromosomes in each cell as their 
diploid parents. Can be an important step in the evolution of a new 
species. (10.10) 


Polyribosome (polysome) The complex formed by an mRNA and a 
number of ribosomes in the process of translating that mRNA. (11.19) 


Polysaccharide A polymer of sugar units joined by glycosidic bonds. 
(2.6) 


Polytene chromosomes Giant chromosomes of insects that contain 
perfectly aligned, duplicated DNA strands, with as many as 1,024 times 
the number of DNA strands of normal chromosomes. (10.4) 


Pore domain The part ofa voltage-gated ion channel that forms an ion- 
conducting channel through the membrane. (4.11) 

Postsynaptic cell A cell that receives neurotransmitters from another 
cell across a synapse. (4.18) 

Posttranslational level control Regulation of gene expression after 
translation by controlling the activity or stability of proteins. (12.9) 


Posttranslational modifications (PTMs) Alterations to the side chains 
of the 20 basic amino acids after their incorporation into a polypeptide 
chain. (2.8) 


Presynaptic cell A cell that sends neurotransmitters to another cell 
across a synapse. (4.18) 


Pre-RNA An RNA molecule that has not yet been processed into its 
final mature form (e.g.,a pre-mRNA, pre-rRNA, or pre-tRNA). (11.3) 


Primary culture Culturing of cells obtained directly from the 
organism. (18.9) 


Primary electron acceptor Molecule that receives the photoexcited 
electron from reaction-center pigments in both photosystems. (6.5) 


Primary transcript (or pre-RNA) The initial RNA molecule 
synthesized from DNA, which is equivalent in length to the DNA from 
which it was transcribed. Primary transcripts typically have a fleeting 
existence, being processed into smaller, functional RNAs by a series of 
“cut-and-paste” reactions. (11.3) 


Primary walls The walls of a growing plant cell. They allow for 
extensibility. (7.14) 


Primase Type of RNA polymerase that assembles the short RNA 
primers that begin the synthesis of each Okazaki fragment of the lagging 
strand. (13.2) 


Primer The DNA or RNA strand that provides DNA polymerase with 
the necessary 3' OH terminus. (13.2) 


Processing-level control Regulation of the path by which a primary 
RNA transcript is processed into a messenger RNA that can be translated 
into a polypeptide. (12.9) 

Processive A term applied to proteins (e.g., kinesin or RNA polymerase) 
that are capable of moving considerable distances along their track or 
template (e.g., a microtubule or a DNA molecule) without dissociating 
from it. (9.3) 


Prokaryotic cells Structurally simple cells, including archaea and 
bacteria that do not have membrane-bounded organelles; derived from 
pro-karyon, or “before the nucleus.” (1.3) 


Prometaphase The phase of mitosis during which the definitive mitotic 
spindle is formed and the chromosomes are moved into position at the 
center of the cell. (14.8) 


Promoter The site on the DNA to which an RNA polymerase molecule 
binds prior to initiating transcription. The promoter contains 
information that determines which of the two DNA strands is 
transcribed and the site at which transcription begins. (11.2, 12.1) 


Proofreading The process of correcting mistakes made by a polymerase 
after they have incorporated a monomer. (11.2) 


Prophase The first stage of mitosis during which the duplicated 
chromosomes are prepared for segregation and the mitotic machinery is 
assembled. (14.7) 


Proplastids Nonpigmented precursors of chloroplasts. (6.2) 


Proteasome Barrel-shaped, multiprotein complex in which cytoplasmic 
proteins are degraded. Proteins selected for destruction are linked to 
ubiquitin molecules and threaded into the central chamber of the 
proteasome. (12.21) 


Protein kinase An enzyme that transfers phosphate groups to other 
proteins, often having the effect of regulating the activity of the other 
proteins. (3.12) 


Proteins Structurally and functionally diverse group of polymers built 
of amino acid monomers. (7.14) 


Protein tyrosine kinases Enzymes that phosphorylate specific tyrosine 
residues of other proteins. (15.10) 


Proteoglycan A protein-polysaccharide complex consisting of a core 
protein molecule to which chains of glycosaminoglycans are attached. 
Due to the acidic nature of the glycosaminoglycans, proteoglycans are 
capable of binding huge numbers of cations, which in turn draw huge 
numbers of water molecules. As a result, proteoglycans form a porous, 
hydrated gel that acts like a “packing” material to resist compression. (7.3) 


Proteome The entire inventory of proteins in a particular organism, 
cell type, or organelle. (2.15) 


Proteomics Expanding field of protein biochemistry that performs G-15 


large-scale studies on diverse mixtures of proteins. (2.15) 


Protofilaments Longitudinally arranged rows of globular subunits of 
a microtubule that are aligned parallel to the long axis of the tubule. 
(9.2) 


Proto-oncogenes A variety of genes that have the potential to subvert the 
cell’s own activities and push the cell toward the malignant state. Proto- 
oncogenes encode proteins that have various functions in a cell’s normal 
activities. Proto-oncogenes can be converted to oncogenes. (16.5) 


Proton-motive force (Ap) An electrochemical gradient that is built up 
across energy-transducing membranes (inner mitochondrial membrane, 
thylakoid membrane, bacterial plasma membrane) following the 
translocation of protons during electron transport. The energy of the 
gradient, which is comprised of both a pH gradient and a voltage and is 
measured in volts, is utilized in the formation of ATP. (5.6) 


Protoplast A naked plant cell whose cell wall has been digested away by 
the enzyme cellulase. (18.9) 


Provirus The term for viral DNA when it has been integrated into the 
DNA of its host cell’s chromosome(s). (1.8) 


Pseudogenes Sequences that are clearly homologous to functional 
genes, but have accumulated mutations that render them nonfunctional. 
(10.10) 


Purine A class of nitrogenous base found in nucleotides that has a 
double-ring structure, including adenine and guanine, which are found 
in both DNA and RNA. (2.18, 10.5) 


Pyrimidine A class of nitrogenous base found in nucleotides that has 
a single-ring structure, including cytosine and thymine, which are 
found in DNA, and cytosine and uracil, which are found in RNA. (2.18, 
10.5) 


Q 


Quality control Cells contain various mechanisms that ensure that the 
proteins and nucleic acids they synthesize have the appropriate structure. 
For example, misfolded proteins are translocated out of the ER and 
destroyed by proteasomes in the cytosol; mRNAs that contain premature 
termination codons are recognized and destroyed; and DNA containing 
abnormalities (lesions) are recognized and repaired. (8.6) 


Quaternary structure The three-dimensional organization of a protein 
that consists of more than one polypeptide chain, or subunit. (2.11) 


Rabs A family of monomeric G proteins involved in vesicle trafficking. 
(8.13) 


Ran A GTP-binding protein that exists in an active GTP-bound form 
or an inactive GDP-bound form. Ran regulates nucleocytoplasmic 
transport. (12.2) 


Ras-MAP kinase cascade A cascade that is turned on in response to a 
wide variety of extracellular signals and plays a key role in regulating 
vital activities such as cell proliferation and differentiation. (15.11) 


rDNA The DNA sequences encoding rRNA that are normally repeated 
hundreds of times and are typically clustered in one or a few regions of 
the genome. (11.4) 

Reaction-center chlorophyll The single chlorophyll molecule of the 
several hundred or so in the photosynthetic unit that actually transfers 
electrons to an electron acceptor. (6.5) 

Reading frame A division of an RNA sequence into groups of three 
nucleotides which constitute codons. (11.16) 
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often leading to uptake or signal transduction. (4.1, 15.1) 


Receptor protein-tyrosine kinases (or RTKs) Cell-surface receptors 
that, following ligand binding, can phosphorylate tyrosine residues on 
themselves and/or on cytoplasmic substrates. They are involved 
primarily in the control of cell growth and differentiation. (15.10) 


Receptor-mediated endocytosis (RME) Endocytic mechanism which 
requires the binding of solute molecules such as LDL or transferrin to a 
specific cell-surface receptor. (8.17) 


Recombinant DNA Molecules containing DNA sequences derived 
from more than one source. (18.20) 


Reducing agent The substance in a redox reaction that becomes 
oxidized, causing the other substance to become reduced. (3.9) 


Reducing power The potential in a cell to reduce metabolic 
intermediates into products, usually measured through the size of the 
NADPH pool. (3.11) 


Regulated secretion Discharge of materials synthesized in the cell that 
have been stored in membrane-bound secretory granules in the 
peripheral regions of the cytoplasm, occurring in response to an 
appropriate stimulus. (8.1) 


Regulatory gene Gene that codes for a bacterial repressor protein. 
(12.1) 


Renaturation (reannealing) Reassociation of complementary single 
strands of a DNA double helix that had been previously denatured. (10.8) 


Replica Metal-carbon cast of a tissue surface used in electron 
microscopy. Variations in the thickness of the metal in different parts of 
the replica cause variations in the number of penetrating electrons to 
reach the viewing screen. (18.5) 


Replication foci Localization of active replication forks in the cell 
nucleus. There are about 50 to 250 foci, each of which contains 
approximately 40 replication forks incorporating nucleotides into DNA 
strands simultaneously. (13.6) 


Replication forks The points at which the pair of replicated segments 
of DNA come together and join the nonreplicated segments. Each 
replication fork corresponds to a site where (1) the parental double helix 
is undergoing strand separation, and (2) nucleotides are being 
incorporated into the newly synthesized complementary strands. (13.2) 


Repressor A gene regulatory protein that binds to DNA and inhibits 
transcription. (12.1) 


Resolution The ability to see two neighboring points in the visual field 
as distinct entities. (18.1) 


Response elements The sites at which specific transcription factors 
bind to the regulatory regions of a gene. (12.14) 


Resting potential The electrical potential difference measured for an 
excitable cell when it is not subject to external stimulation. (4.16) 


Restriction endonucleases (or restriction enzymes) Nucleases 
contained in bacteria that recognize short nucleotide sequences within 
duplex DNA and cleave the backbone at highly specific sites on both 
strands of the duplex. (18.20) 


Retrotransposons Transposable elements that require 
transcriptase for their movements within the genome. (10.11) 


reverse 


Reverse transcriptase An RNA-dependent DNA polymerase. An 
enzyme that uses RNA as a template to synthesize a complementary 
strand of DNA. [An enzyme that is found in RNA-containing viruses 
and used in the laboratory to synthesize cDNAs.] (10.11) 

Rheumatoid arthritis A condition caused by the progressive 
destruction of joint tissue by a cascade of inflammatory autoimmune 
responses. (17.3) 

Ribonucleic acid (RNA) A single-stranded nucleic acid composed of a 
polymeric chain of ribose-containing nucleotides. (2.18) 


Ribosomal RNAs (or rRNAs) The RNAs of a ribosome. rRNAs 
recognize and bind other molecules, provide structural support, and 
catalyze the chemical reaction in which amino acids are covalently 
linked to one another. (11.1) 


Riboswitches Sequences within mRNAs that, once bound to a 
metabolite, undergo a change in their folded conformation that allows 
them to alter the expression of a gene involved in production of that 
metabolite. Most riboswitches suppress gene expression by blocking 
either termination of transcription or initiation of translation. (12.1) 


Ribozyme An RNA molecule that functions as a catalyst in cellular 
reactions. (2.18, 11.7) 


RNA editing Changes that are made to the sequence of an RNA 
molecule after it has been transcribed, that turn one nucleotide into 
another. (12.18) 


RNA interference (RNAi) A naturally occurring phenomenon in which 
double-stranded RNAs (dsRNAs) lead to the degradation of mRNAs 
having identical sequences. RNAi is believed to function primarily in 
blocking the replication of viruses and restricting the movement of 
mobile elements, both of which involve the formation of dsRNA 
intermediates. Mammalian cells can be made to engage in RNAi by 
treatment of the cells with small (21 nt) RNAs. These small RNAs 
(siRNAs) induce the degradation of mRNAs that contain the same 
sequence. (11.10) 


RNA sequencing Analysis of gene expression patterns by determining 
the sequence of RNA isolated from a cell, allowing the number of 
transcripts to be determined for every gene in the genome. (12.10) 


RNA silencing A process in which small, noncoding RNAs, typically 
derived from longer double-stranded precursors, trigger sequence- 
specific inhibition of gene expression. (11.10) 


RNA splicing The process of removing the intervening DNA sequences 
(introns) from a primary transcript. (11.7) 


RNA tumor viruses Retroviruses capable of infecting vertebrate cells, 
transforming them into cancer cells. RNA viruses have RNA in the 
mature virus particle. (16.2) 


RNA world A proposed stage in the early evolution of life before the 
appearance of DNA and proteins in which RNA molecules served both 
as genetic material and catalysts. (11.0) 


Rough endoplasmic reticulum (RER) That part of the endoplasmic 
reticulum that has ribosomes attached. The RER appears as an extensive 
membranous organelle composed primarily of flattened sacs (cisternae) 
separated by a cytosolic space. RER functions include synthesis of 
secretory proteins, lysosomal proteins, integral membrane proteins, and 
membrane lipids. (8.3) 


=) 
S phase The phase of the cell cycle in which replication occurs. (14.1) 


Saltatory conduction Propagation of a nerve impulse when one action 
potential triggers another at the adjacent stretch of unwrapped 
membrane (i.e., propagating by causing the action potentials to jump 
form one node of Ranvier to the next). (4.17) 


Sarcomeres Contractile units of myofibrils that are endowed with a 
characteristic pattern of bands and stripes that give skeletal muscle cells 
their striated appearance. (9.12) 

Sarcoplasmic reticulum (SR) A system of cytoplasmic, Ca**-storing 
SER membranes in muscle cells that forms a membranous sleeve around 
the myofibril. (9.12) 

Satellite DNAs Short sequences (5-100 base pairs in length) that form 
large linear arrays in the genome. (10.8) 


Saturated fatty acids Those lacking double bonds between carbons. 
(2.7) 


Saturation A condition in which every molecule of an enzyme or 
transporter is bound with its substrate molecule, so that adding more 
substrate has no additional effect on the rate of reaction or 
transport. (4.13) 


Secondary culture Transfer of previously cultured cells to a culture 
medium. (18.9) 


Secondary walls Thicker cell walls found in most mature plant cells. 
(7.14) 


Second law of thermodynamics A measure of how tightly two 
molecules can bind to each other. (3.1) 


Second messenger A substance that is formed in the cell as the result of 
the binding of a first messenger—a hormone or other ligand—to a 
receptor at the outer surface of the cell. (15.1) 


Secreted Discharged outside the cell. (8.1) 


Secretory granule Large, densely packed, membrane-bound structure 
containing highly concentrated secretory materials that are discharged 
into the extracellular space (secreted) following a stimulatory signal. 
(8.1) 


Secretory pathway (biosynthetic pathway) Route through the 
cytoplasm by which materials are synthesized in the endoplasmic 
reticulum or Golgi complex, modified during passage through the Golgi 
complex, and transported within the cytoplasm to various destinations 
such as the plasma membrane, a lysosome, or a large vacuole of a plant 
cell. Many of the materials synthesized in the endoplasmic reticulum or 
Golgi complex are destined to be discharged outside the cell; hence the 
term secretory pathway has been used. (8.1) 


Section A very thin slice of tissue. (18.2) 


Selectins A family of integral membrane glycoproteins that recognize 
and bind to specific arrangements of carbohydrate groups projecting 
from the surface of other cells. (7.7) 


Self-assembly The property of proteins (or other structures) to assume 
the correct (native) conformation based on the chemical behavior 
dictated by the amino acid sequence. (2.12) 


Semiconservative Replication in which each daughter cell receives one 
strand of the parent DNA helix. (13.1) 


Semipermeable The membrane property of being freely permeable to 
water while allowing much slower passage to small ions and polar 
solutes. (4.10) 


Side chain or R group The defining functional group of an amino 
acid, which can range from a single hydrogen to complex polar or 
non-polar units in the 20 amino acids most commonly found in cells. 
(2.8) 


Signal peptidase Proteolytic enzyme that removes the N-terminal 
portion including the signal peptide ofa nascent polypeptide synthesized 
in the RER. (8.4) 


Signal recognition particle (SRP) A particle consisting of six distinct 
polypeptides and a small RNA molecule, called the 7S RNA, that 
recognizes the signal sequence as it emerges from the ribosome. SRP 
binds to the signal sequence and then to an ER membrane. (8.4) 


Signal sequence Special series of amino acids located at the N-terminal 
portion of newly forming proteins that triggers the attachment of the 
protein-forming ribosome to an ER membrane and the movement of 
the nascent polypeptide into the cisternal space of the ER. (8.4) 


Signaling pathways The information superhighways of the cell. Each 
consists of a series of distinct proteins that operate in sequence. Each 
protein in the pathway acts by altering the conformation of the 
downstream protein in the series. (15.1) 


Signal transduction The overall process in which information carried 
by extracellular messenger molecules is translated into changes that 
occur inside a cell. (4.1, 15.1) 


Single nucleotide polymorphisms (SNPs) Sites in the genome where G-17 


alternate bases are found with high frequency in the population. SNPs 
are excellent genetic markers for genome mapping studies. (10.15) 


Single-particle reconstruction A technique that employs electron 
cryomicroscopy to visualize and analyze numerous _ purified 
macromolecules (seen as particles) in order to solve the structure of a 
macromolecule or macromolecular complex. (18.16) 


Single-particle tracking (SPT) A technique for studying movement of 
membrane proteins that consists of two steps: (1) linking the protein 
molecules to visible substances such as colloidal gold particles and 
(2) monitoring the movements of the individual tagged particles under 
the microscope. (4.7) 


Single-stranded DNA-binding (or SSB) proteins Proteins that 
facilitate the separation of the DNA strands by their attachment to bare, 
single DNA strands, keeping them in an extended state and preventing 
them from becoming rewound. (13.3) 


Site-directed mutagenesis A research technique to modify a gene ina 
predetermined way so as to produce a protein with a specifically altered 
amino acid sequence. (2.16, 18.25) 


Small interfering RNAs (siRNAs) Small (21-23 nucleotide), double- 
stranded fragments formed when double-stranded RNA initiates the 
response during RNA silencing. (11.10) 


Small, nucleolar RNAs (snoRNAs) RNAs required for the methylation 
and pseudouridylation of pre-rRNAs during ribosome formation in the 
nucleolus. (11.4) 


Small nuclear RNAs (snRNAs) RNAs required for mRNA processing 
that are small (90 to 300 nucleotides long) and that function in the 
nucleus. (11.7) 


Small, nucleolar ribonucleoproteins (sno or RNPs) Particles that are 
formed when snoRNAs are packaged with particluar proteins; snoRNPs 
play a role in the maturation and assembly of ribosomal RNAs. (11.4) 


Smooth endoplasmic reticulum (SER) That part of the endoplasmic 
reticulum that is without attached ribosomes. The membranous 
elements of the SER are typically tubular and form an interconnecting 
system of pipelines curving through the cytoplasm in which they occur. 
The SER functions vary from cell to cell and include the synthesis 
of steroid hormones, detoxification of a wide variety of organic 
compounds, mobilization of glucose from glucose 6-phosphate, and 
sequestration of calcium ions. (8.3) 


SNAREs Key proteins that mediate the process of membrane fusion. 
t-SNAREs are located in the membranes of target compartments. 
v-SNAREs incorporate into the membranes of transport vesicles during 
budding. (8.13) 


snRNPs Distinctribonucleoprotein particles contained in spliceosomes, 
so called because they are composed of snRNAs bound to specific 
proteins. (11.7) 


Specific activity The ratio of the amount of a protein of interest to the 
total amount of protein present in a sample, which is used as a measure 
of purification. (18.11) 


Specificity The property of selective interaction between components 
of a cell that is basic to life. (2.8) 


Spectrophotometer Instrument used to measure the amount of light 
of a specific wavelength that is absorbed by a solution. If one knows 
the absorbance characteristics of a particular type of molecule, then 
the amount of light of the appropriate wavelength absorbed by a 
solution of that molecule provides a sensitive measure of its 
concentration. (18.14) 


Sphingolipids A class of membrane  lipids—derivations of 
sphingosine—that consist of sphingosine linked to a fatty acid by its 
amino group. (4.2) 
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transition between metaphase and anaphase; the spindle checkpoint is 
best revealed when a chromosome fails to become aligned properly at 
the metaphase plate. (14.10) 


Splice sites The 5‘ and 3’ ends of each intron. (11.7) 


Spliceosome A macromolecular complex containing a variety of 
proteins and a number of distinct ribonucleoprotein particles that 
functions in removal of introns from a primary transcript. (11.7) 


Split genes Genes with intervening sequences. (11.6) 


Spontaneous reactions Reactions that are thermodynamically favorable, 
capable of occurring without any input of external energy. (3.1) 


Sporophyte A diploid stage of the life cycle of plants that begins with 
the union of two gametes to form a zygote. During the sporophyte stage, 
meiosis occurs, producing spores that germinate directly into a haploid 
gametophyte. (14.12) 


SRP receptor Situated within the ER membrane, the SRP receptor 
binds specificially with the SRP-ribosome complex. (8.4) 


Standard free-energy change (AG) The change in free energy when 
one mole of each reactant is converted to one mole of each product 
under defined standard conditions: temperature of 298 K and pressure 
of 1 atm. (3.2) 


Starch Mixture of two glucose polymers, amylose and amylopectin, 
that serves as readily available chemical energy in most plant cells. 
(2.6) 


Steady state Metabolic condition in which concentrations of reactants 
and products are essentially constant, although individual reactions may 
not be at equilibrium. (3.4) 


Stem cells Cells situated in various tissues of the body that constitute a 
reserve population capable of giving rise to the various cells of that 
tissue. Stem cells can be defined as undifferentiated cells that are capable 
of both (1) self-renewal, that is, production of cells like themselves, and 
(2) differentiation into two or more mature cell types. (HP 1.6) 


Stroma Space outside the thylakoid but within the relatively 
impermeable inner membrane of the chloroplast envelope. (6.2) 
Structural genes Genes that code for protein molecules. (12.1) 
Subcellular fractionation An approach that allows different organelles 
(e.g. nucleus, mitochondrion, plasma membrane, endoplasmic reticulum) 
having different properties, to be separated from one another. (8.2) 
Substrate The reactant bound by an enzyme. (3.5) 

Substrate-level phosphorylation Direct synthesis of ATP through the 
transfer of a phosphate group from a substrate to ADP. (3.10) 

Subunit A polypeptide chain that associates with other chains 
(subunits) to form a complete protein or protein complex. (2.11) 
Supercoiled A molecule of DNA that has greater or fewer than 10 base 
pairs per turn of the helix. (10.7) 

Synapse The specialized junction of a neuron with its target cell. (4.18) 
Synapsis The process by which homologous chromosomes become 
joined to one another during meiosis. (14.13) 

Synaptic cleft The narrow gap between two excitable cells. A 
presynaptic cell conducts impulses toward a synapse; a postsynaptic 
cell always lies on the receiving side of a synapase. (4.18) 

Synaptic vesicles The storage sites for neurotransmitter within the 
terminal knobs of a neuronal axon. (4.18) 

Synaptonemal complex (SC) A ladderlike structure composed of three 
parallel bars with many cross fibers. The SC holds each pair of 


homologous chromosomes in the proper position to allow the 
continuation of genetic recombination between strands of DNA. (14.13) 


Systemic lupus erythematosus (SLE) Autoimmune disorder where 
antibodies are produced against various nuclear antigens, resulting in 
damage of diverse tissues in the body. (17.3) 


T 


T-cell receptor (TCR) Proteins present on the surface of T lymphocytes 
that mediate interaction with specific cell-bound antigens. Like the 
immunoglobulin of B cells, these proteins are formed by a process of 
DNA rearrangement that generates a specific antigen-combining site. 
TCRs consist of two subunits, each containing both a variable and a 
constant domain. (17.6) 


T lymphocytes (or T cells) Lymphocytes that respond to antigen by 
proliferating and differentiating into either CTLs (cytotoxic 
lymphocytes) that attack and kill infected cells or Th cells that are 
required for antibody production by B cells. These cells attain their 
differentiated state in the thymus. (17.1) 


Tandem repeats Class of repeated DNA sequences that includes 
satellite, minisatellite, and microsatellite DNAs. (10.8) 


Telomerase A novel enzyme that can add new repeat units of DNA to 
the 3’ end of the overhanging strand of a telomere.Telomerase is a 
reverse transcriptase that synthesizes DNA using an RNA template. 
(12.5) 


Telomere An unusual stretch of repeated DNA sequences, which forms 
a “cap” at each end of a chromosome. (12.5) 


Telophase The final stage of mitosis in which daughter cells return to 
the interphase condition: the mitotic spindle disassembles, the nuclear 
envelope reforms, and the chromosomes become more and more 
dispersed until they disappear from view under the microscope. 
(14.11) 


Temperature-sensitive (ts) mutations Mutations that are only expressed 
phenotypically when the cells (or organism) are grown at a higher 
(restrictive) temperature. At the lower (permissive) temperature, the 
encoded protein is able to hold together sufficiently well to carry out its 
activity, leading to a relatively normal phenotype. ts mutations are 
particularly useful for studying required activities such as secretion and 
replication, because “ordinary” mutations affecting these processes are 
typically lethal. (13.2) 


Template A single strand of DNA (or RNA) that contains the 
information (encoded as a nucleotide sequence) for construction of a 
complementary strand. (13.2) 


Thermodynamics Study of the changes in energy accompanying 
events in the physical universe. (3.1) 


Thick filaments One of two distinct types of filaments that give 
sarcomeres their characteristic appearance. Thick filaments consist 
primarily of myosin and are surrounded by a hexagonal array of thin 
filaments. (9.12) 


Thin filaments One of two distinct types of filaments that give 
sarcomeres their characteristic appearance. Thin filaments consist 
primarily of actin and are arranged in a hexagonal array around each 
thick filament, with each thin filament situated between two thick 
filaments. (9.12) 


Threading A computer-based method for guessing the three 
dimensional structure of a protein by aligning an amino acid sequence 
onto the three dimensional structure of a related protein whose structure 
is already known. (2.12) 


Threshold The point during depolarization of an excitable cell where 
voltage-gated sodium channels open, with the resulting Na* influx 
causing a brief reversal in membrane potential. (4.16) 


Thylakoids Flattened, membranous sacs formed by the chloroplast’s 
internal membrane, which contain the energy-transducing machinery 
for photosynthesis. (6.2) 


Thyroiditis A disease that develops from an immune attack against one 
or more thyroid proteins, leading to destruction of the thyroid gland 
and hypothyroidism. (17.3) 


Tight junctions Specialized contacts that occur at the very apical end 
of the junctional complex between adjacent epithelial cells. The 
adjoining membranes make contact at intermittent points, where 
integral proteins of the two adjacent membranes meet. (7.11) 


Toll-like receptors (TLRs) A type of pathogen receptor of the innate 
immune system. Humans express at least 10 functional TLRs, all of 
which are transmembrane proteins present on the surfaces of many 
different types of cells. (17.1) 


Tonoplast The membrane that bounds the vacuole of a plant cell. 
(8.16) 


Topoisomerases Enzymes found in both prokaryotic and eukaryotic 
cells that are able to change the supercoiled state of the DNA duplex. 
They are essential in processes, such as DNA replication and transcription, 
that require the DNA duplex to unwind. (10.7) 


Trans cisternae The Golgi apparatus compartments that house the 
catalysis of late protein modifications, and are typically distal from the 
nucleus and ER. (8.9) 


Trans Golgi network (TGN) A network of interconnected tubular 
elements at the trans end of the Golgi complex that sorts and targets 
proteins for delivery to their ultimate cellular or extracellular 
destination. (8.9) 


Transcription factors Auxiliary proteins (beyond the polypeptides 
that make up the RNA polymerases) that bind to specific sties in the 
DNA and alter the transcription of nearby genes. (11.2, 12.4) 


Transcription The formation of a complementary RNA from a DNA 
template. (11.1) 


Transcriptional-level control Determination whether a particular 
gene can be transcribed and, if so, how often. (12.9) 


Transcription unit The corresponding segment of DNA on which a 
primary transcript is transcribed. (11.3) 


Transcriptome The entire inventory of RNAs transcribed by a 
particular cell, tissue, or organism. (11.13) 


Transduction The incorporation of a gene into a cellular genome by 
means of a virus. (3.1, 18.24) 


Transfection A process by which naked DNA is introduced into 
cultured cells typically leading to the incorporation of the DNA into the 
cellular genome and its subsequent expression. (18.24) 


Transfer potential A measure of the ability of a molecule to transfer 
any group to another molecule, with molecules having a higher affinity 
for the group being the better acceptors and molecules having a lower 
affinity better donors. (3.10) 


Transfer RNAs (tRNAs) A family of small RNAs that translate the 
information encoded in the nucleotide “alphabet” of an mRNA into the 
amino acid “alphabet” of a polypeptide. (11.1) 

Transgene A gene that has been stably incorporated into a cellular 
genome by the process of transfection. (18.24) 


Transgenic animals Animals that have been genetically engineered so 
that their chromosomes contain foreign genes. (18.24) 


Transition state The point during a chemical reaction at which bonds 
are being broken and reformed to yield products. (3.5) 


Transition temperature The temperature at which a membrane is 
converted from a fluid state to a crystalline gel in which lipid-molecule 
movement is greatly reduced. (4.6) 


Translation Synthesis of proteins in the cytoplasm using the informa- G-19 


tion encoded by an mRNA (11.1, 11.8) 


Translational-level control Determination whether a particular mRNA 
is actually translated and, if so, how often and for how long a period. 
(12.9) 


Translocation A chromosomal aberration that results when all or part 
of one chromosome becomes attached to another chromosome. (12.6) 


Translocation The step in translation during which the ribosome 
moves from one codon to the next. (11.17) 


Translocon A protein-lined channel embedded in the ER membrane; 
the nascent polypeptide is able to move through the translocon in 
its passage from the cytosol to the ER lumen. (8.4) 


Transmembrane domain The portion of a membrane protein that 
passes through the lipid bilayer, often composed of non-polar amino 
acids in an a-helical conformation. (4.5) 


Transmembrane protein A protein that is inserted into a membrane 
and spans across the whole width of the membrane. (4.4) 


Transmembrane signaling Transfer of information across the plasma 
membrane. (7.10) 


Transposable elements DNA segments that move from one place on a 
chromosome to a completely different site, often affecting gene 
expression. (10.11) 


Transposition Movement of DNA segments from one place on a 
chromosome to an entirely different site, often affecting gene 
expression. (10.11) 


Transposons DNA segments capable of moving from one place in the 
genome to another. (10.11) 


Transverse (T) tubules Membranous folds along which the impulse 
generated in a skeletal muscle cell is propagated into the interior of the 
cell. (9.12) 


Triacylglycerols Polymers consisting of a glycerol backbone linked by 
ester bonds to three fatty acids, commonly called fats. (2.7) 


Tricarboxylic acid cycle (TCA cycle) The circular metabolic pathway 
that oxidizes acetyl CoA, conserving its energy; the cycle is also known 
as the Krebs cycle or the citric acid cycle. (3.9, 5.2) 


Tumor-suppressor genes Genes that encode proteins that restrain cell 
growth and prevent cells from becoming malignant. (16.5) 


Turnover The regulated destruction of cellular materials and their 
replacement. (8.15) 


Turnover number The maximum number of substrate molecules that 
can be converted to product by one enzyme molecule per unit of time. 
(3.7) 


Type I diabetes Autoimmune disorder resulting in the destruction of 
insulin-secreting b cells in the pancreas. (17.3) 


U 


Ubiquinone A component of the electron transport chain, ubiquinone 
is a lipid-soluble molecule containing a long hydrophobic chain 
composed of five-carbon isoprenoid units. (5.4) 


Ubiquitin A small, highly conserved protein that is linked to proteins 
targeted for internalization by endocytosis or degradation in 
proteasomes. (12.21) 


Unconventional myosins (See Myosin.) (9.11) 


Unfolded protein response (UPR) A comprehensive response 
that occurs in cells whose ER cisternae contain an excessively high 
concentration of unfolded or misfolded proteins. Sensors that 
detect this situation trigger a pathway that leads to the synthesis of 
proteins (e.g., molecular chaperones) that can alleviate the stress in 
the ER. (8.7) 


AYVSSO1D 


G-20 Unsaturated fatty acids Those having one or more double bonds 
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between carbon atoms. (2.7) 


Untranslated regions (UTRs) Noncoding segments contained at both 
5’ and 3’ ends of mRNAs. (12.19) 


Usher syndrome a genetic disorder characterized by hearing and vision 
impairment. (7.7) 


Vv 


Vacuole A single membrane-bound, fluid filled structure that 
comprises as much as 90% of the volume of many plant cells. (8.16) 


Van der Waals force A weak attractive force due to transient 
asymmetries of charge within adjacent atoms or molecules. (2.3) 


Virion The form a virus assumes outside of a cell, which consists of a 
core of genetic material surrounded by a protein or lipoprotein capsule. 
(1.8) 


Viroids Small, obligatory intracellular pathogens, that, unlike 
viruses, consist only of an uncoated circle of genetic material, RNA. 
(1.8) 

Viruses Small, obligatory intracellular pathogens that are not 
considered to be alive because they cannot divide directly, which is 
required by the cell theory of life. (1.8) 

Voltage-gated channel An ion channel that opens or closes in response 
to changes in the voltage across a membrane. (4.11) 


Voltage-sensing domain The portion of a voltage-gated ion channel 
that allows it to respond to membrane voltage. (4.11) 


Ww 


Whole mount A specimen to be observed with a microscope that is an 
intact object, either living or dead, and can be an entire intact organism 
or a small part of a large organism. (18.2) 


Wild type The original strain of a living organism from which other 
organisms are bred for research. (10.4) 


Xx 


X-ray crystallography (or X-ray diffraction) A technique that 
bombards protein crystals with a thin beam of X-rays of a single 
(monochromatic) wavelength. The radiation that is diffracted by the 
electrons of the protein atoms strikes a photographic plate or sensor. 
The diffraction pattern produced by the crystal is determined by the 
structure within the protein. (2.10, 18.16) 


Y 


Yeast artificial chromosomes (YACs) Cloning elements that are 
artificial versions of a normal yeast chromosome. They contain all of the 
elements of a yeast chromosome that are necessary for the structure to 
be replicated during S phase and segregated to daughter cells during 
mitosis, plus a gene whose encoded product allows those cells containing 
the YAC to be selected from those that lack the element and the DNA 
fragment to be cloned. (18.23) 


Yeast two-hybrid system A technique used to search for protein- 
protein interactions. It depends on the expression of a reporter gene 
such as 8-galactosidase, whose activity is readily monitored by a test that 
detects a color change when the enzyme is present in a population of 
yeast cells. (18.12) 
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tumor-suppressor genes, 638-646, 638f, 
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Cannabinoid (CB1) receptors, 165 
Capecchi, Mario, 735 
Capping proteins, 352 
Capsids, 23, 24f 
Captopril, 99 
Carbohydrates, 42-46 
aldotetroses, 42, 43f 
formation of « and B pyranose, 43, 43f 
glycosidic bonds, 43-44, 43f, 44f 
in plasma membranes, 122-123, 122f, 123f 
polysaccharides, 44-46 
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Carbohydrate synthesis, 213-219 
in CAM plants, 219 
in C, plants, 213-217, 214f 
in C,, 218-219, 218f 
peroxisomes, 217 
photorespiration, 216-217, 216f, 217f 
redox control, 215-216, 215f 
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547-548, 548f 
Cdk2, 549, 549f, 550, 551f, 552, 557 
CDNA libraries, 731-732, 732f 
CD4 molecules, 23, 677, 685, 685f, 688, 689 
CD8 molecules, 677, 685, 685f, 688 
CD20 protein, 655 
CD28 protein, 689 


CD40 ligand (CD40L), 689 
Cech, Thomas, 425, 450-451EP 
Cell adhesion, 237-238 
cell-cell, 237-238 
in inflammation, 241-243HP, 242HPf 
in metastasis, 241-243HP, 243HPf 
molecules, 238f, 239 
receptors, in transmembrane signaling, 245 
Cell body, 159, 159f 
Cell-cell adhesion, 237-238 
Cell-cell recognition, 237-238, 237f 
Cell culture, 710-711, 711f 
cell line, 710 
primary, 710 
protoplasts, 710, 711 
secondary, 710 
three-dimensional, 710, 711f 
two-dimensional, 710, 711f 
Cell cycle, 540-552 
phases of, 540-541, 540f 
regulation of, 542-543, 542f, 543f 
in vivo, 541-542 
Cell cycle control, 546-552 
checkpoints, Cdk inhibitors, and cellular 
responses, 550-552, 551f 
protein kinases in, 546-550 
Cdk inhibitors, 548 
Cdk phosphorylation/dephosphorylation, 
547-548, 548f 
controlled proteolysis, 548 
subcellular localization, 548-550, 549f 
Cell destruction, 621-624 
apoptosis, 621-624 
necroptosis, 624 
Cell division, 539-581 
cell cycle, 540-552 
cytokinesis, 552 
defined, 540 
in eukaryotic cells, 10, 12, 12f 
meiosis, 571-580 
mitosis, 552-571 
Cell-free systems, 263, 264f, 712 
Cell fusion, 133, 133f 
Cell interactions, 237-241 
cadherins, 238f, 239-241, 240f, 241f 
cell-cell adhesion, 237-238 
cell-cell recognition, 237-238, 237f 
immunoglobulin superfamily, 238f, 239 
selectins, 238-239, 238f 
Cell line, 710 
Cell-mediated immunity, 665, 685 
Cell membranes. See also Plasma membranes 
bioenergetics in, 82 
in broken-cell preparation, 595-596 
cholesterol in, 47, 47f 
ion channels in, 143-147, 143f 
lactose permease in, 128, 129f 
phospholipids of, 48, 596, 596f 
solute movement across, 139-140, 139f 
transport of substances across, 32 


Cell plate, 255, 570-571, 571f 1-5 
Cell polarity, 136, 136f 
Cell replacement therapy, 17-21HP 


adult stem cells, 18HP, 18HPf 
direct cell reprogramming, 21HP 
embryonic stem cells, 18-19HP, 
19HPf 
induced pluripotent stem cells, 19-21 HP, 
20HPf 
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components of, 21-23 
discovery of, 2-3, 2f 
cell theory, 2-3 
microscopy, 2 
eukaryotic (See Eukaryotes) 
fractionation of contents by differential 
centrifugation, 711-712, 711f 
hijacking, 257, 258 
interactome of, 72, 73 
intercellular interaction in plasma 
membranes, 116 
origin of, 3 
overflow of information through, 406-407, 
407f 
prokaryotic (See Prokaryotes) 
properties of, 3-7 
chemical reactions carried out by, 6 
complexity of, 3-5 
energy acquired and used by, 5-6, 6f 
evolution of, 7, 17f 
genetic information in, 5 
mechanical activities of, 6 
organization of, 3-5, 4f 
receptors of, 6 
reproduction of, 5, 5f 
response to stimuli, 6-7, 7f 
self-regulation of, 6-7, 6f 
robustness of, 6 
size of, 21-23, 22f 
viroids, 25 
viruses, 23-25 


Cell signaling 


autocrine signaling, 583, 583f 
basic elements of, 583-586 
to commit cell to self-destruction, 
621-624 
in apoptosis, 621-624 
in necroptosis, 624 
defined, 583 
endocrine signaling, 583, 583f 
extracellular messengers in, 583-584, 583f, 
584f, 586 
by insulin receptor, 611-613 
to maintain cell survival, 624-625 
paracrine signaling, 583, 583f 
pathways (See Signaling pathways) 


Cell spreading, 234, 234f 
Cell-surface signals, lymphocytes activated by, 
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Cell theory, 2-3 
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Cellular motility, 353-358 
actin and myosin in lamellipodial-based 
movement of fish keratocytes, 357, 
357f, 358 
actin-based motility without cells, 358- 
360EP, 359EPf, 360EPf 
directed cell motility, 354, 355f 
electron micrograph of mouse fibroblast, 
354, 354f 
lamellipodial extension, 356, 356f 
leading edge of motile cell, 354, 355f 
sequence of activities occurring as cell 
crawls over substratum, 354, 354f 
traction forces in migrating fibroblast, 
distribution of, 357, 357f 
Cellular src gene (c-src), 634EP 
Cellulose, 44f, 45-46, 45f 
Cellulose microfibrils, 313, 314f 
Cellulose synthase, 254, 255f 
Cell wall, 2, 8f, 9, 14, 254-255, 254f, 255f 
cell plate of, 255 
matrix of, 254-255, 255f 
hemicelluloses, 254-255 
pectins, 255 
proteins, 255 
primary, 255 
secondary, 255 
CENP-A, 477, 556 
Central Dogma, 407, 633EP 
Centralspindlin, 570 
Centrioles, 321 
Centromeres, 556-558, 557f, 558f, 574f 
Centromes, 477-478, 477f 
Centrosome cycle, 556, 557f 
Centrosomes, 321-322, 321f, 322f 
Cervical cancer, 631, 632, 637, 637f, 660 
CFH gene, 401HP 
CGMP, 595, 603, 620, 620f, 697 
Chalfie, Martin, 697 
Chang, Annie, 723 
Changeux, Jean-Pierre, 149EP 
Chaperonins, 68EP 
Chargaff, Erwin, 380EP 
Charge-coupled device (CCD), 696 
Charge density, 715 
Charged polar side chains, 50, 51f 
Charpentier, Emmanuelle, 737 
Chase, Martha, 380EP 
Checkpoints, 550 
in cell cycle control, 550-552, 551f 
Chk1 checkpoint kinase, 550, 551f 
Chk2 checkpoint kinase, 551, 551f 
spindle assembly checkpoint, 566-567, 566f 
Chemical basis of life, 31-80 
Chemical synthesis of DNA, 722-723 
Chemoautotrophs, 200 
Chemokines, 676-677 
Chemotherapy, 628, 637, 644, 648, 653, 654, 
655, 659 
Chiasmata/chiasma, 574f, 575 


Chimeric antigen receptor (CAR), 655 
Chitin, 44f, 45-46, 45f 
Chk1 checkpoint kinase, 550, 551f 
Chk2 checkpoint kinase, 551, 551f 
Chlamydomonas flagella, 334HP 
Chlorophyll, 204, 204f 
Chlorophyll dimer, 207 
Chloroplasts, 200 
posttranslational uptake of proteins into, 
306-307, 306f 
Rubisco in, 67-70EP 
structure, 201-202, 201f 
Choanoflagellates, 222 
Cholera, 592-593EP 
Cholesterol 
in cell membranes, 47, 47f 
in membrane lipids, 120, 120f 
in phospholipids, 120, 120f 
simple sugars and, 42 
as steroid, importance of, 47 
structure of, 39, 40f, 47f 
Cholesteryl ester transfer protein (CETP), 
303 
Chow, Louise, 420 
Chromatids, 554, 554f, 564 
Chromatin, 9f, 10, 11f, 20HP, 460, 460f 
epigenetic state of, 647 
remodeling complexes, 496 
structure 
DNA replication and, 530-531, 531f 
higher levels of, 467-468, 467f, 468f 
lowest levels of, 465-467, 466f 
Chromatin immunoprecipitation (ChIP), 494 
Chromatography, 712-713 
affinity, 713, 713f 
gel filtration, 712-713, 713f 
high performance liquid, 712 
ion-exchange, 712, 713f 
purification and characterization of proteins 
by, 712-713 
Chromodomains, 472 
Chromosomal (kinetochore) microtubules, 
560-561 
Chromosomal passenger complex (CPC), 570 
Chromosomes 
aberrations and human disorders, 
478-479HP 
as carriers of genetic information, 369-370 
compaction, 544EP, 545EP, 552, 554, 574 
premature, 542 
defined, 368 
discovery of, 368-369 
fertilization, process of, 368-369, 369f 
giant, 371-373, 372f 
heterochromatin, 469-473 
homologous, 369, 370f, 371, 371f 
mitotic, structure of, 473-478, 474f 
polytene, 373 
sex, 382, 469, 577-578HP 
X chromosome inactivation, 470 


Chromosome walking, 731 
Chronic lymphocytic leukemia (CLL), 
655-656 
Chronic myelogenous leukemia (CML), 75, 
75f, 76, 649, 656, 659 
Chymotrypsin, 94, 95f 
Cilia, 327-333, 3328f 
axoneme, 329-331, 330f, 331f 
in development and disease, 333-334HP, 
334HPf 
locomotion, mechanism of, 331-333, 332f, 
333f 
overview of, 327-329, 328f 
structure of, 329-331, 330f 
Ciliopathies, 333-334HP 
Cis cisternae, 276, 277f, 278, 279 
Cis Golgi network (CGN), 276, 277f, 278 
Cisternae, 265-266, 266f, 276, 277f, 278 
Cisternal maturation model, 278-279, 280f 
Citric acid cycle, 173 
Clamp loader, 523, 524f 
Class switching, 683 
Clathrin-coated vesicles, 281, 282f 
dynamin in formation of, 296, 296f 
at trans Golgi network, 286, 286f 
Claude, Albert, 262 
Claudins, 247 
Clonal expansion, 689 
Clonal selection theory applied to B cells, 
666-668, 666f, 667f, 668f 
Cloning of animals, 483-484, 484f 
Closed system, 89 
Clostridium tetani, 671 
Clustered Regularly Interspaced Short 
Palindromic Repeats (CRISPRs), 
435-436 
Coactivators, 496-499 
Coated vesicles, 281-284, 281f 
clathrin-coated vesicles, 281, 282f 
COPI-coated vesicles, 281, 281f, 283f, 284 
COPII-coated vesicles, 281-283, 281f, 282f, 
283f 
phosphoinositides in regulation of, 296, 297f 
studies of, 297-299EP 
Cockayne syndrome (CS), 536HP 
Codons 
decoding, role of tRNAs in, 439-442 
identifying, 437, 439f, 440 
wobble in interaction between codons and 
anticodons, 441f 
Coenzyme Q (or ubiquinone), 180, 181f 
Coenzymes, 90 
Cofactors, 90 
Cohen, Stanley, 604, 723 
Cohesin, 554, 555f 
Colchicine, 310 
Colchicum autumnale, 309 
Coley, William, 654 
Coley’s toxin, 654 
Collagen, 226-228, 227f, 228f 


Colon cancer 
cause of, 631, 632 
EGF receptor and, 655 
genetic mutations in, 642f, 645, 650, 650f, 
658 
NSAIDs and, 632 
research efforts, 628 
tumor-suppressor genes, 640t 
Common intermediate, 88 
Compartmentalization, 115 
Competitive inhibitors, 99, 99f, 100f 
Complement, 664 
Complementarity-determining regions 
(CDRs), 674-675EP 
Complementary DNAs (cDNAs), 485, 486f 
Complementary double helix, 375 
Complexes in electron-transport, 182-186, 
183f 
complex I (NADH dehydrogenase), 
184-185, 184f 
complex II (succinate dehydrogenase), 185 
complex III (cytochrome bc,), 185 
complex IV (cytochrome c oxidase), 
185-186, 186f 
Condenser lens, 693, 693f 
Condensin, 554 
Conditional knockouts, 736 
Conductance, 143, 143f, 144, 145 
Conformation, 53-54, 54f 
alpha (a) helix, 54, 127fn 
defined, 53-54 
of integrins, 232f 
voltage-gated potassium ions, 147, 147f 
Conformational changes 
dynamic, in proteins, 58 
in integral membrane protein, 128-130, 
129f, 130f 
in myosin, 58 
in nicotinic acetylcholine receptor, 150- 
151EP, 151EPf 
revealed in X-ray crystallography, 68EP 
Congenital agammaglobulinemia, 665 
Congenital Diseases of Glycosylation (CDGs), 
274 
Congenital nephrogenic diabetes insipidus, 
142 
Congression, 559 
Conjugate acid, 38 
Conjugate base, 38-39 
Conjugation in prokaryotic cells, 12, 12f 
Connective tissue, 233 
Connexins, 248-249 
Connexons, 249 
Consensus sequence, 411 
Conservative replication, 513, 514f 
Constitutive heterochromatin, 469 
Constitutive secretion, 259 
Contractile ring theory, 568, 569f 
Contrast, 695 
Controlled proteolysis, 548 


Conventional (type II) myosin, 341, 341f, 342f 
Convergence, in cell-signaling pathways, 618, 
618f, 619f 
Copaxone, 670HP 
COPI-coated vesicles, 281, 281f, 283f, 284 
COPII-coated vesicles, 281-283, 281f, 282f, 
283f 
Copper atoms, 180, 181f 
Coprecipitation of proteins, 714 
Copy number variation (CNV), 399-400, 536 
Corey, Robert, 54 
Corneal stroma, 227f 
Cortex, 568 
Corticosteroids, 670HP 
Costimulatory signal, 689 
Co-translational translocation, 268 
Cotransport, 155-156, 156f 
Coulson, A. R., 728-729 
Coupling endergonic and exergonic reactions, 
88 
Coupling factor 1 (F,), 188 
Coupling sites, 183 
Coupling transport 
to ATP hydrolysis, 152-154 
to existing ion gradients, 155-156, 156f 
Covalent bonds, 32-34 
Covalent modification, 109-110 
Cowpox, 671 
Coxiella burnetii, 303-304 
CREB (cAMP response element-binding 
protein), 600, 601f 
Crescentin (CreS), 364 
Creutzfeld-Jakob disease (CJD), 62-64HP 
Crick, Francis, 373, 374-377, 513, 513f, 518, 
633EP 
CRISPR, 737, 738f 
CRISPR RNA (crRNA), 435-436 
Cristae, 170 
Cristae junctions, 170 
Critical-point drying, 708 
Cross bridges, 313, 331, 335, 335f, 346, 347f, 
348-350 
Crossing over, 371, 371f, 372f. See also Genetic 
recombination 
during meiosis, 579-580, 579f 
unequal, 390, 390f 
Cross-linking proteins, 353 
Cross-talk, in cell-signaling pathways, 618, 
618f, 619f 
Cryoelectron tomography (cryo-ET), 704-705 
Cryofixation and use of frozen specimens, 703, 
704-705 
CSA gene, 536HP 
CSB gene, 536HP 
CSI1 (cellulose synthase interactive protein 1), 
313 
CTLA-4, 655, 689 
Currie, A. R., 622 
Cuvette, 716 
Cyanobacteria, 14, 14f, 200-201, 214 


Cyclic adenosine monophosphate. See CAMP 
Cyclic guanosine monophosphate. See CGMP 
Cyclic lipopeptides, 102HP 
Cyclic photophosphorylation, 213 
Cyclin-dependent kinases (Cdks), 546 
Cyclins, 542-543 
cyclin-Cdks in mammalian cell cycle, 549, 
549f 
cyclin-dependent kinases, 547 
levels, in MPF, 543f, 545EP, 545EPf, 546EP 
mitotic, 547 
Cyclosporin A, 670HP 
Cysteine, 50, 51f 
Cystic fibrosis (CF), 157-158HP, 157HPf, 275, 
449 
Cystic fibrosis transmembrane conductance 


regulator (CFTR), 157-158HP, 157HPf 


Cytochrome bc, (complex III), 185 
Cytochrome b,f, 210, 210f 
Cytochrome c, 182, 182f 
Cytochrome oxidase (cytochrome c oxidase), 
185-186, 186f 
activity, summary of, 186f 
as proton pump, 183, 184f 
Cytochrome P450 family, 267 
Cytochromes, 180, 181f 
Cytokine interferon « (IFN-«), 691 
Cytokines, 676-677, 677t 
pro-inflammatory, 670HP 
Cytokinesis 
abscission, 546f, 568 
contractile ring theory, 568, 569f 
defined, 552 
myosin in, 568, 569, 569f 
overview of, 567-570, 568f 
in plant cells, cell plate formation and, 
570-571, 571f 
timing of, 570, 570f 
Cytoplasm, 258, 258f 
Ca** concentration in living cells, 
visualizing, 614-615, 614f, 615f 
of eukaryotic cells, 10, 12f 
partitioning of, 558, 559f 
Cytoplasmic dynein, 317-319, 318f 
Cytoplasmic protein kinases, oncogenes that 
encode, 647 
Cytosine (C), 78, 78f, 373 
Cytoskeleton 
actin-binding proteins, 351-353 
actin-dependent processes during 
development, 361-362, 361f, 362f 
actin filaments, 338-344 
bacterial cytoskeleton, 362-364, 363f 
cellular motility, 353-358 
cilia, 327-333, 3328f 
flagella, 327-333, 329f 
functions, overview of, 310-311, 310t 
intermediate filaments, 335-338 
microtubule-organizing centers, 321-323 
microtubules, 311-312 
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Cytoskeleton (continued) 
motor proteins, 315-319 
muscle organization and contraction, 
344-351 
poisons, drugs and, 309, 310 
proteins, 622 
Cytosol, 8f, 9f, 10, 11f 
Cytotoxic T cells (CTLs), 672-674EP, 677, 
677t, 678, 686 
C, plants, 213-217, 214f 
C, pathway, 218 
C, plants, 218-219, 218f 


Dalton, 5, 23, 23fn 
Daptomycin, 102HP 
Dasatinib, 75f 
Dawson, Martin, 379EP 
DCVax, 655 
Deacylated tRNA, 447-448 
Deafness, myosin mutations and, 344 
Death by neglect, 688, 688f 
Death domain, 622 
De Duve, Christian, 262 
Deep-etching techniques, 707, 707f 
Defensins, 664 
Dehydrogenase, 105 
Delbriick, Max, 513 
Deletion mapping, 493 
Deletions, 479HP 
Dementia, 62HP, 65HP, 66HP, 67HP 
Denaturation, 60, 61f, 382, 383-384 
Dendrites, 159, 159f 
Dendritic cells (DCs), 676, 677f, 678, 685, 686, 
689 
Deoxyadenosine, 373fn 
Deoxyadenosine monophosphate (dAMP), 
373fn, 374f 
Deoxyadenosine triphosphate (dATP), 373fn, 
374f, 518, 729f 
Deoxycytidine, 373fn 
Deoxycytidine triphosphate (dCTP), 373fn, 
518, 729, 729f, 732 
Deoxyguanosine, 373fn 
Deoxyguanosine diphosphate (dGDP), 373fn 
Deoxyguanosine triphosphate (dGTP), 373fn, 
518, 729f, 732f 
Deoxyribonucleic acid (DNA). See DNA 
Deoxyribonucleoside triphosphate precursors 
(dNTPs), 729 
Deoxyribonucleoside triphosphates, 373fn, 
518, 633EP 
dATP, 373fn, 374f, 518, 729f 
dCTP, 373fn, 518, 729, 729f, 732f 
dGTP, 373fn, 518, 729f, 732f 
dTTP, 373fn, 518, 729f 
Deoxyribose, 373 
Deoxythymidine, 373fn 
Deoxythymidine triphosphate (dTTP), 373fn, 
518, 729f 


Depolarization, 160-162, 161f, 162f, 164, 164f, 
165 
Depolymerizing proteins, 353 
Desaturases, 131 
Desensitization, 589 
Desmocollins, 244 
Desmogleins, 244 
Desmotubule, 251 
Desomosomes, 244-245, 245f 
Detoxification in liver, 267 
Development, cilia in, 333-334HP, 334HPf 
Diabetes 
Hemoglobin Alc and, 42 
type 1, 613, 669HP 
type 2, 613 
Diacylglycerol (DAG), 47, 597f, 598 
Diakinesis, 574f, 576 
Diarrhea, osmosis and, 142 
Dicer, 431, 472f 
Dideoxyribonucleoside triphosphates 
(ddNTPs), 729-730, 729f 
Diener, T. O., 25 
Diet, cancer and, 632 
Diethylaminoethyl (DEAE) cellulose, 712 
Differential centrifugation, fractionation of 
cell contents by, 711-712, 711f 
Differential interference contrast (DIC), 696 
Differentiated cells 
mitochondrion in, 15, 16 
organelles in, 15-16 
of plasma membrane, 118, 136, 136f 
Differentiation, 15-16, 16f 
Cdk inhibitors in, 552 
in gametogenesis, 572, 573f 
Diffusion, 21 
defined, 139 
energetics of, 139-140 
facilitated, 151-152, 152f 
free-energy change for, of an electrolyte into 
the cell, 140 
of ions through membranes, 143-147 (See 
also Ion channels) 
of membrane proteins after cell fusion, 133, 
133f 
of potassium ions out of a cell, 140 
rates of membrane proteins, measuring by 
FRAP, 133-134, 134f 
of substances through membranes, 140-141 
through lipid bilayer, 140-143 
of water through membranes, 141-142, 
141f, 142f 
Dimers 
histone, 530, 531f 
pyrimidine, 532 
thymidine, 535 
Diploid, 389 
Diplotene, 574f, 575-576 
Direct cell reprogramming, enzymes in, 
21HP 
Directed cell motility, 354, 355f 


Direct immunofluorescence, 740 
Disassembly 
actin, 339-340, 340f 
intermediate filaments, 335-336, 336f 
Diseases/disorders 
autoimmune, 668-671HP 
cilia in, 333-334HP, 334HPf 
genetic 
cancer as, 636-638 
linked to cadherins, 240, 241f 
inherited 
defects in ion channels and transporters 
as cause of, 157-158HP, 158HPt 
mutations responsible for, 62, 64HP, 
66HP, 67HP 
Dislocation, 275 
Dispersive replication, 513, 514f 
Dissociation, 38 
Dissociation constant (K (> 86 
Dissolution of salt crystal, 35, 35f 
Distal promoter elements, 495 
Disulfide bridges, 50, 60 
Divergence, in cell-signaling pathways, 618, 
618f 
DNA, 77-78, 77f 
base composition, 374 
cDNA, 485, 486f 
chemical synthesis of, 722-723 
cloning, 724-726, 725f 
eukaryotic DNAs in bacterial plasmids, 
725-726, 725 
eukaryotic DNAs in phage genomes, 726, 
726f 
PCR in, 728 
synthesizing cDNAs for, in plasmid, 
731-732, 731f 
vectors in, 724, 731, 736-737 
denaturation, 382, 383-384 
double helix, 375, 376f, 377, 382, 384 
double-stranded, 377, 384f, 385f, 393f, 421, 
422f, 423, 435, 456, 460, 513, 518, 524f, 
669HP, 721, 723, 726, 728, 729-730, 
735 
enzymatic amplification of, by PCR, 
726-728 
for cloning or analysis, 728 
for comparing DNA molecules, 728 
for quantifying DNA or RNA templates, 
728 
for testing for presence of specific DNA 
sequences, 728 
epigenetics, 480 
fragmentation of, 723, 731 
functions of, 375, 376, 376f 
gene editing and silencing, 734-738 
as genetic material, evidence for, 405-406 
libraries, 730-732 
cDNA, 731-732, 732f 
genomic, 731 
microarrays, 485-487, 486f 


mutations in, 27EP 
nucleotide, 373, 373fn, 374, 374f 
oncogenic viruses, 633-635EP 
purines, 373 
pyrimidines, 373 
rDNA, 413 
recombinant (See Recombinant DNA) 
renaturation, 384-387, 384f 
separation of, by gel electrophoresis, 719, 
720f 
sequencing, 728-730, 729f 
single-stranded, 384, 385f, 386f, 421, 422f, 
518, 521, 521f, 528, 529, 530f, 550, 579, 
722 
sites involved in regulating transcription, 
492-494, 492f, 493f 
structure of, 373-377, 374f 
supercoiling, 381-382, 382f 
transfer into eukaryotic cells and 
mammalian embryos, 732-734 
transgenic animals, 733-734, 733f 
transgenic plants, 734, 734f 
underwound, 381, 382f 
Watson-Crick proposal, 374-377 
DnaA, 521, 529t 
DnaB, 521, 527, 529t 
DNA-binding domain, 489 
DnaC, 521, 529t 
DNA-dependent RNA polymerases. See RNA 
polymerase 
DNA fingerprinting, 385, 385f 
DNA footprinting, 493 
DNA glycosylase, 532-534, 534f 
DNA gyrase, 517f, 518, 529t 
DNA libraries, 730-732 
cDNA, 731-732, 732f 
genomic, 731 
DNA ligase, 520, 520f, 529t 
DNA melting, 383-384, 384f 
DNA methylation, 500-501, 501f, 647 
DNA methyltransferases, 500 
DNA polymerases, 523-526 
activity of, 519f 
exonuclease, 523, 525f 


templates and nontemplates for, 518, 518f 


base pairs, proper and mismatched, 
524-526, 525f 
clamp loader, 523, 524f 
properties of, 518-519, 518f, 519f 
structure and functions of, 523-526, 523f, 
524f 
ensuring high fidelity, 523-526, 525f 
exonuclease activities, 523, 525f 
3'5’ exonuclease of, activation of, 
524-525, 525f 
III holoenzyme, 523, 523f 
DNA polymerases I, 518, 523, 525, 525f 
DNA polymerases III holoenzyme, 523, 523f 
DNA polymerases a, 529, 529f 
DNA polymerases B, 529, 529f, 534 


DNA polymerases y, 529 
DNA polymerases 6, 529, 529f, 530, 533f, 535 
DNA polymerases ¢, 529, 530, 533f, 535 
DNA polymerases n, 535 
DNA rearrangements, 681-683, 682f, 683f 
DNA repair, 531-535 
base excision repair, 532-534, 533f, 534f 
deficiencies, consequences of, 536-537HP 
double-strand breakage repair, 534, 535f 
mismatch repair, 534 
mutant genes involved in, 649 
nucleotide excision repair, 532, 533f 


stalled replication fork and, 535-536, 535fn 


translesion synthesis and, 535 
xeroderma pigmentosum (XP) and, 535 
DNA-repair disorders, 536-537HP 
DNA replication, 512-531. See also DNA 
polymerases 
alternate schemes of, 514f 
in bacterial cells, 516-520 
bidirectional, 517, 517f 
chromatin structure and, 530-531, 531f 
in eukaryotic cells, 526-530 
fidelity during, 523-526, 525f 
Meselson-Stahl experiment, 513-515, 515f 
origin of, 517 
overview of, 513 
semiconservative, 513-515, 514f, 515f 
semidiscontinuous, 519-520, 519f 
in viruses, 526 
Watson-Crick proposal, 513, 513f, 518 
DNA sequencing, 728-730, 729f 
DNA tumor viruses, 631, 635EP, 641 
Docking, 288, 289, 289f 
Doherty, Peter, 672EP 
Dolichol phosphate, 274 
Dolly (cloned sheep), 483, 484f 
Domains 
of helices in Kv channel subunits, 145, 146f 
of life, 28EPf 
membrane 
cell polarity and, 136, 136f 
transmembrane, 127-128, 127f, 128f 
pore, in ion channel subunits, 145, 146f 
prokaryotic cell, 13-14 
protein, 57-58, 57f 
SH3, 60, 60f, 72, 73f 
taxonomic schemes, 27EP, 28EP 
voltage-sensing, in ion channel subunits, 
145, 146f 
Don Cassel and Zvi Selinger, 591EP, 592EP 
Dopamine transporter (DAT), 165 
Double-blind study, 65 
Double helix, 375, 376f, 377, 382, 384 


Double-stranded breaks (DSBs), 534, 535f, 737 


Double-stranded DNA, 377, 384f, 385f, 393f, 
421, 422f, 423, 435, 456, 460, 513, 518, 
524f, 669HP, 721, 723, 726, 728, 
729-730, 735 

Double-stranded RNA (dsRNA), 430-431 


Doudna, Jennifer, 693, 737 
Down syndrome (trisomy 21), 479HP 
Dreyer, William, 681 
Driesch, Hans, 6 
Drones, 552 
Drosophila, 16, 17f, 370-373, 370f, 663-664, 737 
Drosophila K* Shaker ion channel, 146f 
Drugs 
cytoskeleton and, 309, 310 
development of, 75, 111, 432HP 
Dual-label fluorescence, 697, 698f 
Duplication, 389-391, 479HP 
of DNA sequences, 390, 390f 
globin genes, evolution of, 390-391, 391f 
polyploidization, 389-390 
Dwarfism, 227, 536HP 
Dynamic changes with proteins, 58, 58f 
Dynamic instability, 326-327, 327f 
Dynamic nonequilibrium, 89 
Dynamic protein folding, 60-62 
Dynamin, 296, 296f 
Dynein, 317-319, 318f 


E. faecium, 101HP, 102HP 
Early endosomes, 300 
Earth ’s biogeologic clock, 9f 
Ebola virus, 661, 662 
Ectoplasmic face, 125f 
Edidin, Michael, 133 
Editing, gene, 734-738 
engineered nucleases in, 737-738, 738f 
knockout mice, 735-736, 736f 
RNA interference, 736-737, 737f 
in vitro mutagenesis, 735 
E (exit) site, 444f, 445 
Effector, 584 
Efflux pump, 102HP 
Eflux, 139, 140, 160 
EGF receptor (EGFR), 658 
Ehler-Danlos syndromes, 227 
Eicosanoids, 586 
EI Fa, 275 
18S rRNA, 27EP 
Electrochemical gradient, 140, 161, 186 
Electron cryomicroscopy (cryo-EM), 718-719, 
718f 
Electron crystallography, 719 
Electron density map, 150EP, 150EPf 
Electronegative atom, 32 
Electron micrographs 
of apoptotic cells, 621f 
of ATP synthesis, 188, 188f, 192 
of cell spreading, 234f 
of centrosome, 323f 
of coated pits, 294, 294f 
of coated vesicles, 281f 
of cristae, 170, 171f 
of cytokinesis, 571, 572f 
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of DNA double helix, 521, 521f 
electron microscopes used in, 701, 708, 708f 
of eukaryotic cells, 10, 11f 
of gap junctions, 248, 248f, 252EP 
of glycogen, 44f, 45 
of GroEL, 68EP, 69EPf 
of hnRNAs and mRNAs, 421f 
of intron in globin gene, 422f 
of lysosomes, 291, 291f 
of MAPs, 313, 313f 
of membranes of electric organs, 
149-150EP, 150EPf 
of movement of materials through nuclear 
pore, 462, 462f 
of nAChR, 149-150EP, 150EPf 
of negatively stained specimens, 705, 705f 
of nuclear pore complex, 462, 463f 
of plasma membrane, 115, 115f 
of polyribosomes, 449f 
of ribosome, 79f 
of sarcomere, 347f 
of snoRNAs, 415, 415f 
of supercoiled DNA, 382f 
of synaptonemal complex, 574-575, 575f 
of transport through Golgi complex, 
279-280, 280f 
of TRPV ion channel, 126f 
Electron paramagnetic resonance (EPR) 
spectroscopy, 129, 130f 
Electrons in photosystem II, 207-210, 208f 
flow of electrons from PSII to 
plastoquinone, 207, 208, 209f 
flow of electrons from water to PSII, 
208-210, 209f 
primary electron acceptor, 207 
Electron-transfer potential, 179 
Electron transport 
carriers, 180-182, 181f 
copper atoms, 180, 181f 
cytochromes, 180, 181f 
electron-tunneling pathways, 182, 182f 
flavoproteins, 180, 181f 
inhibitors to determine sequence of, 182, 
182f 
iron-sulfur proteins, 180, 181f 
ubiquinone (UQ, or coenzyme Q), 180, 
181f 
chain, 180-182, 182f 
complexes, 182-186, 183f 
complex I (NADH dehydrogenase), 
184-185, 184f 
complex II (succinate dehydrogenase), 
185 
complex III (cytochrome bc,), 185 
complex IV (cytochrome c oxidase), 
185-186, 186f 
in photosynthesis, 211-212, 212f 
Electron transport chain, 105, 180-182, 182f 
Electron-tunneling pathways, 182, 182f 


Electrophoresis, 715. See also Polyacrylamide 
gel electrophoresis (PAGE) 
Electroporation, 733 
Electrospray ionization (ESI), 717 
Electrotonic synapse, 251EP 
Embryonic development 
cadherins in, 239-240, 240f 
changes in cell shape during, 362, 363f 
Embryonic stem cells, 18-19HP, 19HPf, 
735-736 
Emerson, Robert, 205 
Empty magnification, 694 
Enantiomers. See Stereoisomerism 
ENCODE Project, 396 
Encoding genetic information, 436-439 
codons, identifying, 437, 439f, 440 
genetic code, 436-437, 437f, 438f 
Endergonic processes, 85 
Endergonic reactions, coupling, 88 
Endocannabinoids, 165 
Endocrine signaling, 583, 583f 
Endocytic pathway, 258f, 259, 300-303, 301f 
Endocytosis, 293-296 
phosphoinositides in regulation of coated 
vesicles, 296, 297f 
receptor-mediated, 294-296, 294f, 
297-299EP 
Endomembranes, study of, 260-265 
autoradiography, 260, 261f 
cell-free systems, 263, 264f 
green fluorescent protein (GFP), 260, 261, 
262f 
mutant phenotypes, 263-264, 264f 
RNA interference (RNAi), 265, 265f 
subcellular fractions, analysis of, 262, 
263f 
Endomembrane system 
biosynthetic pathway, 258, 258f 
cytoplasm, 258, 258f 
endocytic pathway, 258f, 259 
endomembranes (See Endomembranes, 
study of) 
endoplasmic reticulum (See Endoplasmic 
reticulum (ER)) 
organelles, 258-260, 259f 
overview of, 258-260 
secretory pathway, 258-259, 258f 
Endonuclease 
in Alu sequence, 393 
AP, 533-534 
caspase activated DNase, 622 
Dicer, 472f 
FEN-1, 529, 529f 
in processing of eukaryotic mRNAs, 424, 
424f 
ribonuclease P, 416 
in RNA interference, 431f 
Zucchini, 435 
Endoplasmic reticulum (ER), 8f, 9f, 10, 10t, 
11f, 27EP, 257, 265-276 


asymmetry of, 271, 271f 
ERAD, 275 
export signals, 281, 283 
Golgi complex, 276-284, 277f 
membrane biosynthesis in, 271-272, 272f, 
273f 
misfolded proteins, destruction of, 275, 275f 
overview of, 265 
partitioning during mitosis, 558 
protein transport across, 268, 268fn 
rough endoplasmic reticulum, 265-266, 
266f, 267-271, 267f 
smooth endoplasmic reticulum, 265-267, 
267f 
subcompartments, 265-271 
Endosomes, 300 
Endosymbiont theory, 26EP, 27EP 
Endothermic reactions, 83 
Energetics 
of diffusion, 139-140 
of filament sliding, 349-350, 349f 
of Na*/glucose cotransporter, 155 
of photosynthesis, 203, 203f, 206, 206f 
of solute movement, 139-140, 139f (See also 
Diffusion) 
Energy. See also Metabolism 
acquired and used by cells, 5-6, 6f 
change in internal, 83, 83f 
coupling endergonic and exergonic 
reactions, 88 
covalent bonds, 32-34 
defined, 82 
endothermic reactions, 83 
entropy, 83-84, 84f 
equilibrium vs. steady-state metabolism, 
88-89 
exothermic reactions, 83 
free, 85-88 
reducing power, 109 
spontaneous transformations of, 85-87 
Energy transduction 
defined, 82 
examples of, 82f 
in plasma membranes, 116 
Engelmann, T., 201 
Engineered nucleases in gene editing and 
silencing, 737-738, 738f 
Enhancers, 495 
Enthalpy, 85, 85f 
Entropy, 83-84, 84f 
when water freezes, 85, 85f 
Enzymatic amplification of DNA by PCR, 
726-728 
Enzyme replacement therapy, 287HP 
Enzymes. See also Substrates 
activation energy barrier, overcoming, 
90-92, 91f, 92f 
active site of, 92-93, 93f 
activity in metabolic regulation, altering, 
109-110 


altering activity 
by allosteric modulation, 110 
by covalent modification, 109-110 
as biological catalysts, 89-93 
catalytic activity, 90, 91t, 93-96 
changing substrate reactivity, 94, 95f 
inducing strain in substrate, 94-96, 95f 
substrate orientation, 94, 94f 
defined, 6, 90 
in direct cell reprogramming, 21HP 
inhibitors, 98-100, 99f, 100f 
competitive inhibitors, 99, 99f, 100f 
irreversible inhibitors, 98-99 
noncompetitive inhibition, 99-100, 100f 
kinetic activity, 97-100 
enzyme inhibitors, 98-100, 99f, 100f 
Michaelis-Menten model of, 97-98 
of metabolism, 6, 29EP 
properties of, 90 
replacement therapy, 287HP 
restriction, 723, 724f 
specificity of, 90, 93 
transcribing, in eukaryotic cells, 411, 412t 
Enzyme-substrate (ES) complex, 92, 93f 
Epidemiologists, 631 
Epidermal growth factor (EGF), 604, 609, 618f, 
619f 
Epidermolysis bullosa, 237 
Epigenetics, 480 
Epigenomes, 480 
Epinephrine, 587t, 590-592EP, 591EPf, 
591EPt, 595, 599, 600, 601, 601f, 601t 
Epithelialmesenchymal transition (EMT), 240 
Epithelial tissue, 233 
Epitope (antigenic determinant), 680 
Epstein-Barr virus, 631 
Equilibrium 
in coupling endergonic and exergonic 
reactions, 88 
enzymes and, 90 
in free-energy changes, 85-88 
glycolysis and, 105, 106f 
vs. steady state metabolism, 88-89, 89f 
Equilibrium centrifugation, 720-721, 721f 
Equilibrium constant (K,,)> 85-86, 87, 87t 
dissociation constant, 86 
ER-associated degradation (ERAD), 275 
ERGIC (endoplasmic reticulum Golgi 
intermediate compartment), 276 
Erikson, Ray, 634EP 
Erythroblastosis fetalis, 680-681 
Erythrocyte ghosts, 137, 138f 
Erythrocyte plasma membrane, 137-139, 138f 
asymmetric distribution of phospholipids 
in, 122f 
carbohydrates of glycolipids of, 122, 123f 
proteins of, 137, 138f 
skeleton, 137, 138f, 139 
Erythroleukemias, 635EP 
Escherichia coli, 13, 16, 17f, 22, 29EP, 59 


Escherichia coli replication 
B clamp, 523, 524f, 529, 529t 
eukaryotic replication fork, 528, 529-530, 
529f 
leading and lagging strands, 519-520, 519f, 
520f 
origin of, 517 
proteins required for, 516, 529t 
replication forks, 517, 518 
unwinding problem, 518 
ESCRT complexes, 301, 568 
Ester bonds, 40 
Estrogen, 47, 47f, 632 
Eubacteria. See Bacteria domain 
Eucarya, 28EP, 28EPf 
Euchromatin, 469 
Euglena, 27EP, 27EPn 
Eukaryotes, 8-13, 15-17 
cell division in, 10, 12, 12f 
characteristics that distinguish, 8-9f, 8-13 
cytoplasm of, 10, 12f 
DNA replication in, 526-530 
experimental demonstration of, 527, 527f 
initiation of, 526-527 
nuclear structure and, 530, 530f 
replication forks, 528, 529-530, 529f 
restricting to once per cell cycle, 527-528, 
528f 
DNA transfer into, 732-734 
transgenic animals, 733-734, 733f 
transgenic plants, 734, 734f 
flagella, 12, 13, 13f 
gene regulation in, overview of, 483-484, 
484f 
initiation of translation in, 443-444, 443f 
model organisms, 16, 17f 
origin of, 26-29EP, 26EPf 
plasma membranes of, carbohydrates in, 
122 
prokaryotic cells compared to, 9-13, 10t 
RNA procesing in, 411-412, 412f 
single-celled (unicellular), 15-16, 15f 
structure of, 10, 11f 
transcribing enzymes in, 411, 412t 
transcrition in, 410-412, 412f 
types of, 15-16 
Eukaryotic DNAs, cloning 
in bacterial plasmids, 725-726, 725f 
eukaryotic DNAs in phage genomes, 726, 
726f 
Eukaryotic genome, packaging of, 465-469 
higher levels of chromatin structure, 
467-468, 467f, 468f 
nucleosomes, 465-467, 466f 
Evans, Martin, 735 
Evolution 
of cells, 7, 17f 
disappearance of mitochondrion during, 
27EPn 
Evolutionary clock, 27EP 


Exchangers, 156, 157HP I-11 
Excitation-contraction coupling, 350-351, 
350f, 351f 
Excitation energy, 205-206, 206f 
Excited state, 204 
Executioner caspase, 623 
Exercise in aerobic metabolism, role of, 
177-178HP 
Exergonic processes, 85 
Exergonic reactions, coupling, 88 
Exocytosis, 290-291 
Exogenous antigens, 685, 686 
Exome, 367, 399 
Exonic splicing enhancers (ESEs), 425, 427f 
Exon-junction complex (EJC), 448 
Exons, 420 
Exon shuffling, 429 
Exonuclease, 524-525, 525f 
Exonuclease activities of DNA polymerases, 
523, 525f 
Exothermic reactions, 83 
Expansins, 255 
Experimental allergic encephalomyelitis, 
669HP 
Export signals 
ER, 281, 283 
nuclear (NES), 465, 548 
Extracellular interactions, overview of, 223, 
223f 
Extracellular matrix (ECM), 224, 225f, 226f 
basement membrane in, 224, 225f 
of cartilage cells, 224, 225f 
components of, 226-231 
collagen, 226-228, 227f, 228f 
fibronectin, 229, 230f, 231f 
laminin, 229, 231f 
proteoglycan, 228-229, 228f 
defined, 224 
dynamic properties of, 231 
glycocalyx in, 224, 224f 
integrins (See Integrins) 
macromolecular organization of, 
224, 226f 
Extracellular messengers, 583-584, 583f, 584f, 
586 
formation of cAMP in response to addition 
of, 595-598, 595f (See also Second 
messengers) 
insulin as, 611 
receptors, 586 
amino acids and amino acid derivatives, 
586 
eicosanoids, 586 
gases, 586 
polypeptides and proteins, 586 
steroids, 586 
Extremely drug resistant (XDR), 100HP 
Extremophiles, 13 
Extrinsic pathway in apoptosis, 622-623, 
623f 
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F 


Facilitated diffusion, 151-152, 152f 
Facilitative transporter, 151, 156f 
F-actin. See Actin filaments 
Factor Xa, 93f 
Facultative heterochromatin, 469, 469f 
FAK (focal adhesion kinase), 233, 235, 235f 
Familial adenomatous polyposis coli (FAP), 
645 
Familial hypercholesterolemia (FH), 
297-299EP, 298EPf 
Families (or superfamilies) of molecules, 76 
Fats, 46-47, 46f, 47f£ 
Fatty acids, 46-47, 46f, 47f, 48f 
Fawcett, Dan, 329 
Feedback inhibition, 110, 110f 
FEN-1, 529, 529f, 529t 
Fermentation, 108, 108f 
Fernandez-Moran, Humberto, 188 
Ferredoxin, 211 
Feulgen stain, 695, 695f 
Fibrillar collagens, 226 
Fibroblast 
electron micrograph of mouse, 354, 354f 
traction forces in migrating, distribution of, 
357, 357£ 
Fibronectin, 229, 230f, 231f 
Fibrous proteins, 56-57 
Fidelity during DNA replication, 523-526, 
525f 
Fields, Stanley, 714 
“Fight-or-flight” hormone. See Epinephrine 
Filaments 
actin (See Actin filaments) 
intermediate (See Intermediate filaments) 
microtubles (See Microtubules) 
sliding filament model of muscle 
contraction, 346, 347f, 348-351 
thick, 346, 347f 
thin, 346, 347f 
Filament-serving proteins, 353 
Filament sliding, energetics of, 349-350, 
349f 
Fire, Andrew, 430 
First eukaryotic common ancestor, 26EP, 
26EPf, 27EP 
First law of thermodynamics, 82-83 
Fission, mitochondrial, 169, 170f 
5’ end (five-prime end), 374, 374f 
5S rRNA, 415-416 
Fixative, 695 
Flagella, 8f, 9f, 12, 13, 13f, 327-333, 329f 
axoneme, 329-331, 330f, 331f 
intraflagellar transport, 331, 332f 
locomotion, mechanism of, 331-333, 332f, 
333f 
overview of, 327, 329 
structure of, 329-331, 329f 
Flavell, Richard, 420 
Flavoproteins, 180, 181f 


Flemming, Walther, 552 
Flippases, 132, 272 
Fluidity, membrane. See Membrane fluidity 
Fluorescence, 696 
Fluorescence in situ hybridization (FISH), 386, 
386f 
Fluorescence microscopy, 696-701 
dual-label fluorescence, 697, 698f 
fluorescence recovery after photobleaching, 
698, 698f, 699f 
fluorescence resonance energy transfer, 
697-698, 698f 
green fluorescent protein, 697f 
laser scanning confocal microscopy, 699, 
700f 
light sheet fluorescence microscopy, 
700-701, 701f 
super-resolution fluorescence microscopy, 
699-700, 701f 
Fluorescence recovery after photobleaching 
(FRAP), 133-134, 134f, 698, 698f, 
699f 
Fluorescence resonance energy transfer 
(FRET), 69-70EP, 129, 697-698, 698f 
Fluorophores, 696 
Flurizan, 66HP 
FMR1 allele, 389HP 
Focal adhesions, 234-235, 235f, 236f 
Folkman, Judah, 659 
Follicular B-cell lymphomas, 648 
F, portion of ATP synthase in ATP synthesis, 
192-193, 193f 
Forster resonance energy transfer (FRET), 
697-698, 698f 
Forward genetics, 734-735 
Fossil fuels, 199 
Fox, George, 27EP, 28EP 
FOXP2 gene, 397, 398 
FOXP3 gene, 678 
F pilus, 12f 
Fractionation 
of cell contents by differential 
centrifugation, 711-712, 711f 
of nucleic acids, 719-721 
separation of, by ultracentrifugation, 
719-721, 721f 
separation of DNAs by gel 
electrophoresis, 719, 720f 
Fraenkel-Conrat, Heinz, 79 
Fragile X syndrome, 389HPf 
Fragmentation 
of DNA, 723, 731 
in mass spectrometer, 717fn 
Francois Jacob, 406 
Free-energy change, 85-88 
in chemical reactions, 85-86 
in diffusion of an electrolyte into the cell, 
140 
in metabolic reactions, 86-88 
standard conditions in, 86-87, 
86fn, 87t 


Free radicals 
aging and, 34-35HP 
defined, 34 
Freeze etching, 706, 707, 707f 
Freeze-fracture replication, 124, 125f, 246, 
247f, 705, 706, 706f 
Fridovich, Irwin, 34HP 
Frozen specimens, cryofixation and use of, 
703, 704-705 
Fructose, 42, 42f 
D-fructose, 42f 
Fructose 1,6-bisphosphatase, 110-111, 111f 
Frye, Larry, 133 
FTD, 67HP 
Fukushima Daiishi nuclear disaster, 220HP 
Functional groups, 40, 40t 
Furchgott, Robert, 620 
Furshpan, Edwin, 251EP 
Fusion, mitochondrial, 169, 170f 


G 
Gain-of-function mutation, 388HP 
Gajdusek, D. Carleton, 63HP 
Gall, Joseph, 386 
y-secretase, 64-65HP, 65HPf, 66HP 
y-tubulin, 322-323, 323f 
y-tubulin ring complexes (y-TuRCs), 323, 323f 
Gamete, 368 
Gametic meiosis, 572, 573f, 574 
Gametophyte, 552, 573 
Gamma-aminobutyric acid (GABA), 164 
Gamma radiation, 709 
Gap-junction intercellular communication 
(GJIC), 249, 251-253EP, 251EPf, 
252EPf, 253EPf 
Gap junctions, 247-250, 248f 
Gases as extracellular carrier, 586 
Gated ion channels, 144-147 
defined, 144 
ligand-gated channels, 144 
mechano-gated channels, 144 
voltage-gated channels, 144-147 
Gaucher's disease, 287-288HP 
Gay, Leslie, 379EP 
GC box, 492-493 
Gel electrophoresis. See also Polyacrylamide 
gel electrophoresis (PAGE) 
separation of DNAs by, 719, 720f 
two-dimensional, 716, 716f 
Gel filtration chromatography, 712-713, 713f 
Gelles, J., 320EP 
Gemmata obscuriglobus, 26EP, 27EPf 
Gene duplication, 390, 390f 
Gene expression, 406 
in bacteria, control of, 456-460 
chromosomal aberrations and human 
disorders, 478-479HP 
epigenetics, 480 
eukaryotes, overview of gene regulation in, 
483-484 


eukaryotic genome, packaging of, 465-469 
gene activity, profiling, 485-488 
heterochromatin, 469-473 
mitotic chromosome, structure of, 
473-478 
nuclear envelope, structure of, 460-465 
nucleus as organized organelle, 480-483 
posttranlational control, determining 
protein stability, 509-510 
RNA processing control, 503-504 
transcription, DNA sites involved in 
regulating, 492-494 
transcriptional activation (See 
Transcriptional activation) 
transcriptional repression, 400f, 499-502 
transcription factors (See Transcription 
factors) 
translational control, 505-508 
Gene-expression analysis, 651-654, 652f, 653f, 
654f 
Gene number, 389, 390f, 394 
Gene regulatory protein, 457 
Genes 
bacterial operon for regulation of, 457, 458f 
chemical nature of, 377-381EP 
defined, 368 
discoveries on nature of, 367f 
duplication of, 390, 390f 
editing and silencing, 734-738 
engineered nucleases in, 737-738, 738f 
knockout mice, 735-736, 736f 
RNA interference, 736-737, 737f 
in vitro mutagenesis, 735 
globins (See Globin genes) 
molecular structure of, mutations and, 5 
profiling activity of, 485-488, 485f 
DNA microarrays, 485-487, 486f 
RNA sequencing, 487-488, 487f 
pseudogenes, 391 
split, 420-423 
as unit of inheritance, 367-368 
Gene therapy 
25, 158HP, 197HP, 732 
Genetically engineered mice, 34-35HP, 
65-66HP 
Genetic code, 436-437, 437f, 438f 
Genetic diseases/disorders 
cancer as, 636-638 
linked to cadherins, 240, 241f 
Genetic information 
in cells, 5 
decoding codons, role of tRNAs in, 439-442 
encoding, 436-439 
translation 
elongation and termination, 445-448 
initiation of, 442-445 
Genetic polymorphisms, 398, 400HP, 595HP 
Genetic recombination, 371, 371f, 372f 
in DNA repair, 534 
heterochromatin as inhibitor of, 469 
hot spots, 402HP, 580 


during meiosis, 572, 574, 575 579HP, 575f, 
579-580, 579f 
visible evidence of, 576f 
Genetics 
analysis in Drosophila, 370-373, 370f 
basis of being human, 397-398 
crossing over, 371, 371f, 372f 
mutagenesis, 371-373, 372f 
variation in human species population, 
398-400 
copy number variation, 399-400 
DNA sequence variation, 398-399 
structural variation, 399, 399f 
Genome analysis, 390 
Genomes 
of African populations, 385-386 
application of genomic analyses to medicine, 
400-403HP 
bacterial, organization of, 456 
comparative, 396 
comparisons, 395f 
complexity of, 382-387 
defined, 367 
duplication, 389-391 
dynamic nature of, 391-394 
mobile genetic elements in evolution of, 
393-394 
sequencing, 394-396 
Genome sequencing, 222, 223 
Genome-wide association studies (GWASs), 
400-401HP 
Genome-wide location analysis, 493-494, 493f 
Genome-wide RNA interference (RNAi) 
screens, 72 
Genomic imprinting, 501-502 
Genomic libraries, 731 
Genomic Sequencing and Newborn Screening 
Disorders Project, 402-403HP 
Gerhart, John, 545EP 
Germinal vesicle, 543EP 
Gey, George, 3, 627, 628 
Gey, Martha, 3 
GFP gene, 697 
Ghosts, erythrocyte, 137, 138f 
Giant chromosomes, 371-373, 372f 
Giardia, 27EPn 
Gibbs, J. Willard, 85 
Gilenya, 671HP 
Gilman, Alfred, 593EP 
Gleevec, 75, 75f, 76, 99, 656-657, 657f, 659 
Gliomas, 646, 655, 659. See also Brain cancer 
Global genomic pathway, 532 
Global warming, 219-220HP 
Globin genes 
B-globin, 721 
evolution of, 390-391, 391f 
intron in, electron micrographs of, 422f 
intron in, visualizing, 421, 422, 422f 
Globular proteins, 57 
Glucagon, 591EP, 591t, 592EPf, 595-596, 599, 
600, 601f, 601t, 611 


Glucocorticoid receptor (GR), 494, 495f 1-13 
Glucocorticoid response element (GRE), 494, 

495f 
Gluconeogenesis, 110-111, 111f Zz 
Glucose o 


catabolism of, 104, 105 
in cellulose, 44f, 45 
gluconeogenesis vs., 110-111, 111f 
in simple sugars, 42, 42f 
stability of, 90 
in starch, 44f, 45 
D-glucose, 42, 42f 
a-D-glucose, 42, 42f 
Glutamic acid, ionization reactions of, 50, 52f 
Glutathione-S-transferase (GST) tags, 713 
GLUT4, 612, 612f, 619 
Glycans. See Carbohydrates 
Glyceraldehyde, 42, 43f 
D-glyceraldehyde, 42, 43f 
L-glyceraldehyde, 42, 43f 
Glyceraldehyde 3-phosphate dehydrogenase, 
105 
Glyceraldehyde 3-phosphate (GAP), 214-215, 
214f 
Glycerol phosphate shuttle, 175, 176f 
Glycine, 50, 51f 
Glycocalyx, 224, 224f 
Glycogen and starch, 44f, 45 
Glycogen synthase kinase-3 (GSK-3), 
612-613 
Glycolipids in membrane lipids, 118f, 
119-120, 119f, 120f, 122, 286-288HP 
Glycolysis, 105-108 
in aerobic metabolism, 172-173, 174f 
ATP production in, 105-108 
catabolism of glucose, 104-105 
fermentation and, 108, 108f 
free-energy profile of, in cardiac muscle 
tissue, 106f 
steps of, 105, 106f 
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GTPase-activating, 608 
heat shock, 67EP, 68EP 
hemagglutinin, 74, 74f 
heterochromatic, 472 
homologous, 76-77 
identified by mass spectrometry, 71-72, 71f 
IFT, 334HP 
integral, 123-125, 123f 
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freeze-fracture replication, 124, 125f 
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dynein, 317-319, 318f 
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multiprotein complex, 59-60, 59f 
mutated, 655 
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novel, production of, 74-75 
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primary structure of, 53 
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656-658, 657t 
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protein-protein interactions, 59-60, 59f, 
60f 
Raf, 609 
ratio of lipid to, 118 
RecA, 579 


in RER synthesis 
of integral membrane proteins on ER- 
bound ribosomes, 270-271, 271f 


processing of newly synthesized proteins, 


270 
of proteins on membrane-bound vs. free 
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single-stranded DNA-binding, 521, 521f, 
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SR, 428 
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structure of, determining, 717-719 
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tertiary structure of, 55-58 
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protein domains, 57-58, 57f 
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56-57, 57f£ 
transmembrane, 123-124 
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X-ray crystallography, 58 
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protein crystals of TCR suitable for, 
674EP 
structures in, compared to proteins, 58 
Protein-targeting drug, development of, 75, 
75 
Protein-tyrosine kinases, 603-604 
non-receptor, 604 
receptor, 604, 618f 
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in signal transduction, 603-607 
downstream signaling pathways, 
activation of, 604, 606-607 
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Protein-tyrosine phosphorylation (continued) 
phosphotyrosine-dependent protein- 
protein interactions, 604 
protein kinase activation, 604 
receptor dimerization, 604 
Proteoglycan, 228-229, 228f 
Proteolysis, controlled, 548 
Proteome, 71, 72 
Proteomics, 71-72 
Protocells, 31 
Protofilaments, 312, 312f 
Proton gradient, 192-193 
F, portion of ATP synthase in ATP 
synthesis, role of, 192-193, 193f 
using, 192-193 
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Proto-oncogene, 638, 639, 639f 
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Protoplasts, 710, 711 
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P700, 207, 210-211, 210f 
P680, 207, 208f 
PTEN gene, 646 
P-type pumps, 153-154 
Pulse-chase, 260 
Purification of MPF, 545-546EP 
Purines, 78, 78f, 373 
Pyrimidine dimers, 532 
Pyrimidines, 78, 78f, 373 
Pyrimidine synthesis inhibitor, 671HP 
Pyruvate, anaerobic oxidation of, 108 
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Ras-MAP kinase pathway, 607-610 
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kinase pathway 
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Receptor-mediated endocytosis (RME), 
293-296, 294f, 297-299EP 
Receptor protein-tyrosine kinases (RTKs), 586, 
604 
signal transduction by, 586 
Receptors, 6, 116 
acetylcholine, 147-151EP, 163, 164f 
B-cell, 683-684, 689 
cannabinoid, 165 
cell adhesion, in transmembrane signaling, 
245 
in cell signaling, 583-584 
chimeric antigen, 655 
EGF, 658 
endocytosis, receptor-mediated, 294-296, 
294f, 297-299EP 
extracellular messengers, 586 
amino acids and amino acid derivatives, 
586 
eicosanoids, 586 
gases, 586 
polypeptides and proteins, 586 
steroids, 586 
GPCRs (See G protein-coupled receptors 
(GPCRs)) 
insulin 
diabetes mellitus, 613 
glucose transport, 612-613, 612f 
as protein-tyrosine kinase, 611 
signaling by, 611-613 
substrates, 611-612, 612f 
insulin, cell signaling by, 611-613 
nicotinic acetylcholine, 147-151EP 
conformational change in, 150-151EP, 
151EPf 
electron density map of, 150EP, 150EPf 
in electron micrographs of membranes of 
electric organs, 149-150EP, 150EPf 
SDS-PAGE following electrophoresis, 
148EP, 149EPf 


NMDA, 165 
odorant, 603 
pattern recognition, 663 
ryanodine, 614 
seven-transmembrane (7TM), 587 
in signal transduction 
by B- and T-cell receptors, 586 
by GPCRs, 587-588 
survey of receptors in, 586 
SRP, 269, 269f 
in transmembrane signaling, 245 
Recombinant DNA, 723-726 
DNA cloning, 724-726, 725f 
eukaryotic DNAs in bacterial plasmids, 
725-726, 725f 
eukaryotic DNAs in phage genomes, 726, 
726£ 
formation of, 723-724, 725f 
restriction endonucleases, 723, 724f 
Recombinants, 371 
Recombination nodules, 575, 575f 
Red blood cells. See Erythrocyte plasma 
membrane 
Redox control, in carbohydrate synthesis, 
215-216, 215f 
Reduced coenzymes, 175-177, 176f, 177f 
Reducing agent, 103 
Reducing power, 109 
Reductionist nature of cell and molecular 
biology, 2 
Refractory period, 160-161, 162f 
Regenerative ability, 539, 540 
Regenerative therapeutics, 540 
Regulated secretion, 259 
Regulatory gene, 457 
Regulatory T cells Tie cells), 678 
Renaturation, 384-387, 384f 
Replica, 706 
Replication foci, 530, 530f 
Replication forks, 517 
in bacterial cells, 517 
eukaryotic, 528, 529-530, 529f 
machinery operating at, 521-522, 521f, 522f 
stalled, 535-536, 535fn 
Replicons, 526-527 
Replisome, 522, 523f, 528, 529f 
Repressor, 457 
Reproduction of cells, 5, 5f 
Residual body, 292 
Residues, 50 
Resolution, in light microscope, 694, 694f 
Respiratory chain. See Electron transport 
chain 
Response elements, 494 
Resting potential, 159-160, 160f 
Restriction endonucleases, 723, 724f 
Restriction point, 547fn 
Retinoblastoma, 640-641, 641f 
Retrotransposons, 392-393, 393f 
Retroviruses, 393, 607, 631, 732 


Reuptake, 165 
Revel, Jean Paul, 252EP 
Reverse genetics, 735 
Reverse transcriptase (RT), 102HP, 392-393, 
526, 633EP 
REC, 529, 529f, 529t 
R groups. See Side chains 
Rheumatoid arthritis, 669HP 
Rho, 411 
RhoA protein, 569, 570 
Rh*/Rh” genotype, 680-681 
Ribonuclease 
folding of, 60-61, 61f 
ribbon model of, 54, 55f 
self-assembly of, 60 
Ribonuclease P, 416 
Ribonucleic acid (RNA). See RNA 
Ribonucleoproteins 
small nuclear (snRNPs), 426, 427f, 428, 428f 
small nucleolar (snoRNPs), 415 
Ribonucleoside monophosphates, 77 
Ribose, 77, 77f, 373fn, 414, 415 
methylated, 415, 416f, 423, 424f 
Ribosomal RNA (rRNA), 407, 407f 
nucleolus, 413, 413f 
pre-rRNAs (See 
Preribosomal RNA (pre-rRNA)) 
synthesis and processing of, 413-416, 414f 
5SrRNA, 415-416 
pre-rRNA, 413-415, 414f 
tRNAs, 416, 416f 
Ribosomal subunits, assembly of, 78, 79, 79f 
Ribosomes 
ER-bound, synthesis of integral membrane 
proteins on, 270-271, 271f 
in initiation of translation, 444-445, 444f 
macromolecular composition of 
mammalian, 413, 413f 
synthesis of proteins on membrane-bound 
vs. free, 268 
Riboswitches, 459-460 
Ribozymes, 78, 78f, 90 
creating new, in laboratory, 429-430 
discovery of, 425 
Ribulose bisphosphate carboxylase oxygenase 
(Rubisco), 214-216, 214f, 216f 
Ribulose 1,5-bisphosphate (RuBP), 213-214, 
214f 
Rienhoff, Hugh, 366, 367 
Ringer, Sydney, 350 
RISCs, 431 
Ritonavir, 99 
Ritossa, EF M., 67EP 
Rituxan, 655 
RNA, 77-78, 77f, 78f 
antisense, 436 
CRISPR (crRNA), 435-436 
double-stranded (dsRNA), 430-431 
5S rRNA, 415-416 
heterogeneous nuclear (hnRNA), 417, 417f 


mRNAs compared to, 420, 421f 
in RNA splicing, 426 
interference (RNAi), 265, 265f, 430-432, 
430f, 736-737, 737£ 
clinical applications of, 432-433HP 
long noncoding (IncRNA), 502, 502f 
messenger (mRNA), messenger RNA 
(mRNA) 
microRNA (miRNA), 407, 431f, 434-435, 
434f 
in cancer, 649 
in genetics of cancer, 649 
in translational control, 508-509, 508f, 
509f 
naturally occurring, 429 
noncoding meiotic (meiRNA), 574 
noncoding (ncRNA), 435-436 
piwi-interacting (piRNA), 407, 435 
preribosomal (pre-rRNA), 413-416 
primary transcript (pre-RNA), 412 
processing, in eukaryotic cells, 411-412, 412f 
ribosomal (rRNA) (See Ribosomal RNA 
(rRNA)) 
sequencing, 485, 487-488, 487f 
single-guide (sgRNA), 435-436 
small interfering (siRNA), 72, 265, 407, 431, 
431f, 432-433HP, 432fn, 736-737 
small nuclear (snRNA), 407, 426, 427f 
small nucleolar (snoRNA), 407, 415, 416f 
templates, quantifying, 728 
transport, 465 
trans (tRNA) (See Trans RNA (tRNA)) 
XIST, 470 
RNA editing, 504 
RNA enzymes, 78 
RNA interference (RNAi), 265, 265f, 430-432, 
430f, 736-737, 737£ 
clinical applications of, 432-433HP 
RNA polymerase, 408-410, 409f, 410f, 520, 
523-524, 532, 533f 
RNA sequencing (RNA-Seq), 485, 487-488, 
487f£ 
RNA silencing, 431 
RNA splicing, 429 
RNA tumor viruses, 631, 632EPf, 633-634EP, 
635EP, 638, 646 
“RNA World” hypothesis, 405 
Roberts, Richard, 420 
Robertson, J. D., 115 
Robustness of cells, 6 
Rodbell, Martin, 591EP, 592EP 
Rosenbaum, Joel, 331 
Rosenthal, Alan, 672EP 
Rotational catalysis, 190, 192f 
Roth, Thomas, 297 
Rothman, James, 279 
Rough endoplasmic reticulum (RER), 
265-266, 267-271, 267f 
cisternae, 265-266, 266f 
functions of, 268-271 


glycosylation in, 273-274, 273f, 274f 1-27 
secretory cells, 267-268, 267f 
synthesis 
of integral membrane proteins on ER- 
bound ribosomes, 270-271, 271f 
processing of newly synthesized proteins, 
270 
of proteins on membrane-bound vs. free 
ribosomes, 268 
of secretory, lysosomal, or plant vacuolar 
proteins, 268-270, 269f 
Rous, Peyton, 632-633EP 
RPA, 529, 529f, 529t 
RRNA. See Ribosomal RNA (rRNA) 
Rube Goldberg machine, 7f 
Ruben, Samuel, 202-203 
Rubisco, 67-70EP 
Ruderman, Joan, 545EP 
Ruvkun, Gary, 433-434 
Ryanodine receptors (RyRs), 614 
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Saccharomyces cerevisiae, 16, 17f, 28EP, 72, 546 

Sakmann, Bert, 143 

Salmonella, 13f, 29EP 

Saltatory conduction, 162, 162f 

Salt crystal, dissolution of, 35, 35f 

Sandhoff ’s disease, 287HPt 

Sanger, Frederick, 728-729 

Sanger-Coulson sequencing, 728-729 

Sarcomeres, 346, 347f 

Sarcoplasmic reticulum (SR), 267, 350-351, 
350f 

Sarin, 114 

Satellite cells, 13HP 

Satellite DNAs, 385, 386f 

Saturated fat, 46, 46f 

Saturation-type kinetics, 151, 152f 

Scanning electron microscopy (SEM), 701, 
707-708, 707f 

SCF complex, 562-564, 563f 

Schenk, Dale, 65HP 

Schizosaccharomyces pombe, 546 

Schleiden, Matthias, 2-3 

Schwann, Theodor, 3 

Scintillants, 709 

Scurvy, 226 

Scyllo-inositol, 66 

SDS-polyacrylamide gel electrophoresis (SDS- 
PAGE), 148EP, 149EPf, 297EP, 593EP 

Sea urchin embryos, MPF in, 544-545EP, 
545EPf 

Secondary active transport, 155-156, 156f 

Secondary cell culture, 710 

Secondary nondisjunction, 578-578HP, 
578HPf 

Secondary oocytes, 577 

Secondary spermatocytes, 577 

Secondary structure of proteins, 53-54, 54f 

Secondary walls, 255 


I-28 Second law of thermodynamics, 83-84 
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Second messengers, 584, 595-598 
cAMP as, 595-596, 595f 
discovery of, 595, 596 
formation of, 595, 595f 
hormonal responses mediated by, 
600-602, 601t 
in mobilization of blood glucose, 600, 
600f, 601f 
in signal amplification, 600 
signal transduction pathways, 600-602, 
602f 
phosphatidylinositol-derived, 596-597, 
596f, 597f 
phospholipase C, 597-598, 597f 
diacylglycerol, 597f, 598 
inositol 1,4,5-trisphosphate, 597f, 598, 
598t 
Secretory cells, in rough ER, 267-268, 267f 
Secretory granules, 259, 261f 
Secretory pathway, 258-259, 258f 
Secretory proteins, synthesis of, 268-270, 269f 
Section, 695 
Securin, 562 
Sedimentation velocity, 720, 721f 
Selectins, 238-239, 238f 
Selectivity filter, 144-145, 144f, 145fn, 151EPf 
Self, distinguishing from nonself, 686-688 
Self-assembly, 60, 62, 79 
Self-regulation of cells, 6-7, 6f 
Selinger, Zvi, 592-593EP 
Semagacestat, 66HP 
Semiconservative replication, 513-515, 514f, 
515f 
Semidiscontinuous replication, 519-520, 519f, 
520f 
Semipermeable, 141 
Senescence, 550, 638 
Sense RNA, 436 
Sensory perception, GPCRs in, 603 
Separase, 562 
Sequencing 
amino acid sequence of proteins 
predicting, by genome sequencing, 71, 73 
primary structure correlated with, 56 
in protein evolution and adaptation, 
76-77 
in protein folding, 61, 63HP, 63HPf 
in site-directed mutagenesis, 74-75 
tertiary structure correlated with, 56, 57f, 
61 
in threading, 61-62 
three-dimensional structure determined 
by, 70HP 
DNA sequence variation, 398-399 
genomes, 394-396 
of insulin, 53, 53f 
intervening sequences, 420 
nonsynonymous, 437 
RNA, 485, 487-488, 487f 
synonymous, 437 


Seven-transmembrane (7TM) receptors, 587 
Sex chromosomes, 382, 469, 577-578HP 
Sexual arousal, 620 
Shadow casting, 705, 705f 
Shaker ion channel, 146f 
Sharp, Phillip, 420 
Shimomura, Osamu, 697 
Shine-Dalgarno sequence, 506 
SH3 domain, 60, 60f, 72, 73f 
Sickle cell anemia, 53, 53f, 437 
Sicl protein, 548 
Side chains, 49f, 50-53, 51f 
cysteine, 50, 51f 
glycine, 50, 51f 
hydrophilic, 52, 52f 
hydrophobic, 52-53, 52f 
ionization reactions of glutamic acid and 
lysine, 50, 52f 
nonpolar, 50, 51f 
polar, charged, 50, 51f 
polar, uncharged, 50, 51f 
proline, 50, 51f 
Siekevitz, Philip, 263 
Sigler, Paul, 68EP 
Sigma factor (6), 411 
Signal amplification, cAMP in, 600 
Signal hypothesis, 268 
Signaling pathways 
of apoptosis, 621-624 
extrinsic pathway, 622-623, 623f 
intrinsic pathway, 623-624, 624f 
complexity of, 617-618 
convergence in, 618, 618f, 619f 
cross-talk in, 618, 618f, 619f 
defined, 584 
divergence in, 618, 618f 
downstream, activation of, 604, 606-607 
overview of, 583, 584f 
in plants, 613 
ras-MAP kinase pathway, 607-610 
Signaling proteins, 584, 585f. See also 
GTP-binding proteins (G proteins) 
actions of, 584, 585f 
binding sites for 
autophosphorylation sites, 604, 606, 
611-612 
cytoplasmic side of plasma membrane, 
587-588 
PTB domains, 604-606, 605f, 606f, 612 
SH2 domains, 604-606, 605f, 606f, 
611-612, 611f 
construction of, 584 
diversity of, 606, 606f 
interacting with activated RTKs, 605-607, 
606f 
adaptor proteins and scaffolding proteins, 
606, 607f 
signaling enzymes, 607 
transcription factors, 607 
mutations altering structure of, 594HP 


Signal recognition particle (SRP), 268-269, 
269f 
Signal sequence, 268 
Signal transduction, 116 
by B- and T-cell receptors, 586 
cAMP, 600-602, 602f 
by GPCRs, 586, 587-590 
bacterial toxins, 590 
GTP-binding proteins, 586, 588-589 
physiologic processes mediated by, 587t 
receptors, 587-588 
termination of response, 589-590 
by ligand-gated channels, 586 
in lymphocyte activation, 689-691 
phospholipase C in, 597-598 
protein-tyrosine phosphorylation in, 
603-607 
downstream signaling pathways, 
activation of, 604, 606-607 
ending response, 607 
phosphotyrosine-dependent protein- 
protein interactions, 604 
protein kinase activation, 604 
receptor dimerization, 604 
by RTKs, 586 
by steroids, 586 
survey of receptors in, 586 
Sildenafil, 620 
Silencing, gene, 734-738 
engineered nucleases in, 737-738, 738f 
knockout mice, 735-736, 736f 
RNA interference, 736-737, 737f 
in vitro mutagenesis, 735 
Simian virus 40 (SV40), 382f, 463, 528, 631, 
641 
Simple sugars, structure of, 42, 42f 
Single-celled eukaryotic cells, 15-16, 15f 
Single-guide RNA (sgRNA), 435-436 
Single nucleotide polymorphisms (SNPs), 
398-399, 401-402HP 
Single-nucleotide variants (SNVs), 399 
Single-particle reconstruction, 718-719, 718f 
Single-particle tracking (SPT), 134 
Single-stranded DNA, 384, 385f, 386f, 421, 
422f, 518, 521, 521f, 528, 529, 530f, 550, 
579, 722 
Single-stranded DNA-binding (SSB) proteins, 
521, 521f, 522f, 523f, 529f, 529t 
Sinoatrial node, 249 
Site-directed mutagenesis (SDM), 74-75, 735 
16S rRNA, 27EP 
Skeleton, membrane, 137, 138f, 139 
Skin cancer. See Melanoma 
Skou, Jens, 152-153 
Slevin, P. R., 320EP 
Sliding filament model of muscle contraction, 
346, 347f, 348-351 
composition and organization of thick and 
thin filaments, 348, 348f 
excitation-contraction coupling, 350-351, 
350f, 351f 


molecular basis of contraction, 348-350, 
349f 
Small interfering RNA (siRNA), 72, 265, 407, 
431, 431f, 432-433HP, 432fn, 736-737 
Small nuclear ribonucleoproteins (snRNPs), 
426, 427f, 428, 428f 
Small nuclear RNA (snRNA), 407, 426, 427f£ 
Small nucleolar ribonucleoproteins 
(snoRNPs), 415 
Small nucleolar RNA (snoRNA), 407, 415, 
416f 
Smallpox, 671 
Smell (olfaction), 603 
Smith, Dennis, 544EP 
Smithies, Oliver, 735 
Smoking, 148EPfn, 400HP, 631 
Smooth endoplasmic reticulum (SER), 
265-267, 267f 
functions of, 266-267 
Snake venom, 233 
SNAP-25, 288, 289, 289f, 290f 
SNARE motif, 288 
SNAREs, 288-290, 289f, 290f 
S-nitrosylation, 621 
Sodium dodecyl sulfate (SDS), 137 
polyacrylamide gel electrophoresis and, 716 
Sodium-potassium pump, 153-154, 153f, 154f 
Solanezumab, 66 
Solubilization of membrane proteins with 
detergents, 126, 126f 
Solute movement, 139-140, 139f 
electrochemical gradient, 140 
energetics of, 139-140 (See also Diffusion) 
Somatic cell nuclear transfer (SCNT), 19HP, 
19HPf 
Somatic hypermutation, 683 
Somatic mutation, 594HP 
Somavert, 75 
Song, Ok-kyu, 714 
Sorting signals, 260 
Southern blot, 722, 722f 
Sox2 gene, 20HP 
Specific activity, 712 
Specificity, 49 
of enzymes, 90, 93 
of GPCRs, 599 
host-cell, of viruses, 24 
Spectrin, 137, 138f, 139 
Spectrophotometer, 716 
Speech and language disorders, 397 
Spermatids, 572, 573f 
Spermatogonia, 572, 573f 
Spermatozoon, 572 
S phase, 541 
Sphingolipids, 119-120, 119f 
Sphingolipid storage diseases, 287HP, 287HPt 
Spindle assembly checkpoint (SAC), 566-567, 
566f 
Spleen, 231f, 662, 662f 
Spliceosome, 426 
Splice sites, 525 


Split genes, 420-423, 429 
Spongiform encephalopathy, 63HP 
Spongiform encephalopathy, 63HP 
Spontaneous energy transformations, 85-87 
Spontaneous events, 83 
Spontaneous mutation rate, 524 
Sporogenesis, 572, 573, 573f 
Sporophyte, 572, 573, 573f 
Spudich, J. A., 319EP 
Squamous cell carcinoma, 537HP 
SRC-homology 2 ( SH2 ), 604-606, 605f, 606f, 
611-612, 611f 
SRC oncogene, 633-634EP, 647 
SRC protein kinase, 235, 235f 
SRC tyrosine kinase, 690 
SRGAP2 gene, 397 
SRP receptor, 269, 269f 
SR proteins, 428 
Stahl, Franklin, 513-515, 515f 
Standard conditions in free-energy change, 
86-87, 86fn 
Standard free energy change (AG°’), 86-87, 
87t 
Stanley, Wendell, 23 
Staphylococcus aureus, 101HP 
Starch, 44f, 45 
START, 547 
Static enzyme, 190 
Statin drugs, 302 
Statins, 302 
Steady state metabolism, 9f, 88-89 
Stehelin, Dominique, 633EP 
Steitz, Thomas, 453EPf 
Stem cells 
adult, 18HP, 1SHPf 
cancer, 658-659 
defined, 18HP 
embryonic, 18-19HP, 19HPf 
hematopoietic, 665, 666f, 671HP 
induced pluripotent, 19-21HP, 20HPf 
mesenchymal, 18HP 
Stentor coeruleus, 21 
Stereoisomerism, 42-43 
of amino acids, 49, 49f 
of glyceraldehyde, 42-43, 43f 
Steroids 
cholesterol, 47 
as extracellular carrier, 586 
structure of, 47, 47f 
Steward, Frederick, 483 
Stimuli, response to, 6-7, 7f, 116 
STIM1, 615, 616f 
Stochastic events, 5 
Stomach cancer, 631f, 642f 
Store-operated calcium entry (SOCE), 615, 
616f 
STORM (stochastic optical reconstruction 
microscopy), 700 
Stroke, 233, 241HP 
Stroma, 202 


Stroma targeting domain, 306 I-29 
Stromatolites, 31 
Structural genes, 457 
Structural polysaccharides (cellulose, chitin, 
and glycosaminoglycans), 44f, 45-46, 
45f 
Structural variants, 399, 399f 
Structure-based drug design, 75-76 
Subcellular fractions, analysis of, 262, 263f 
Subcellular localization, 548-550, 549f 
Subcompartments of endoplasmic reticulum, 
265-271 
Substrate-level phosphorylation, 105-106, 107f 
Substrate orientation, 94, 94f 
Substrate reduction therapy, 288HP 
Substrates 
changing substrate reactivity, 94, 95f 
defined, 90 
inducing strain in substrate, 94-96, 95f 
substrate orientation, 94, 94f 
Subthreshold depolarization, 251EP 
Subunits 
ion channel subunits, helix/helices in 
domains of, 145, 146f 
KcsA channel subunits, 145, 145f 
Ky channel subunits, 145-146, 146f 
of proteins, 58-59 
ribosomal, assembly of, 78, 79, 79f 
of voltage-gated potassium ions, 145, 146f 
Succinate dehydrogenase (complex II), 185 
Sumner, James, 90 
Supercoiling, 381-382, 382f 
Superfamilies of molecules, 76 
Supergroups, 28EPf 
Superoxide dismutase (SOD), 34HP 
Super-resolution fluorescence microscopy, 
699-700, 701f 
Surveillance, mRNA, 448-449 
Sutherland, Earl, 590EP 
Sutton, Walter, 369, 370 
SV40, 382f, 463, 528, 631, 641 
Svoboda, Karel, 319EP 
Symport, 156, 156f 
Synapses, 162-165 
actions of drugs on, 165 
defined, 162-163 
neuromuscular junctions, 163, 163f 
synaptic plasticity, 165 
Synapsis, 574-575 
Synaptic cleft, 163, 163f, 164, 164f, 165 
Synaptic plasticity, 165 
Synaptic transmission, 162-165, 163f 
sequence of events during, 164, 164f 
Synaptic vesicles, 118f, 163, 163f, 164, 164f 
Synaptonemal complex (SC), 574-575, 575f 
Synonymous sequence, 437 
Syntelic attachment, 559f, 566 
Synthesis 
of mRNAs, 417-419 
hnRNA formation, 417, 417f 
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mRNA transcription, machinery for, 
417-419, 418f, 419f 
rough ER in 
processing of newly synthesized proteins, 
270 
synthesis of integral membrane proteins 
on ER-bound ribosomes, 270-271, 271f 
synthesis of proteins on membrane- 
bound vs. free ribosomes, 268 
synthesis of secretory, lysosomal, or plant 
vacuolar proteins, 268-270, 269f 
Synthetic biology, 22-23, 22f 
System 
closed vs. open, 89 
defined, 83 
energy of, 83, 83f 
entropy of, 83-84, 84f, 85, 94 
surroundings of, 83, 83f, 84, 85, 89 
Systemic lupus erythematosus (SLE), 669HP, 
669HPf 


T 


Tail-anchored proteins, 271 
TALENs, 737 
Tamoxifen, 631 
Tandem MS, 717 
Tandem repeats, 384-386 
TAP tag, 72, 73, 713 
TAP-tag mass spectrometry, 72 
Targeted therapies, 654 
Taste (gustation), 603 
TATA-binding protein (TBP), 418 
TATA box, 492-493 
Taxol, 310 
Taylor, J. Herbert, 514-515 
Tay-Sachs disease, 287HPt, 288HP 
TBP-associated factors (TAFs), 418f 
T-cell receptor (TCR), 655, 670HP, 675-676, 
683-684, 685, 686, 688, 689 
T-cell, 655, 670HP, 675-676, 683-684, 685, 
686, 688, 689 
toll-like, 663-664, 664f 
T cells, 621 
activation and mechanism of action, 
675-678 
antigen-presenting cells, 670HP, 672-673EP, 
676, 676f 
cytokines, 676-677, 677t 
dendritic cells, 676 
DNA rearrangements producing genes 
encoding antigen receptors of, 681-683, 
682f, 683f 
subclasses of, 677-678 
cytotoxic T cells (CTLs), 677 
helper T cells (T,, cells), 677-678, 677fn 
regulatory T cells (Ta cells), 678 
T-DNA region, 734 
T-DNA transformation, 734 
Tec tyrosine kinase, 690 
Telomerase, 475, 476f 


Telomeres, 473, 475-477, 475f, 476f, 574, 575f 
Telophase 
of meiosis, 572f, 577 
of mitosis, 553f, 567-571, 567f 
Temin, Howard, 633EP 
Temperature-sensitive (ts) mutants, 516 
Templates for DNA polymerase, 518, 518f 
Teprotide, 99 
Teratoma, 19HP, 20HP 
Terminal knob, 159, 159f, 163, 163f, 164, 164f 
Terminal meiosis, 572, 573f, 574f 
Termination, translational, 448, 448f 
Tertiary structure of proteins, 55-58 
dynamic changes with proteins, 58, 58f 
myoglobin, 55f, 56, 57f 
protein domains, 57-58, 57f 
similarities between proteins revealed by, 
56-57, 57f 
Testosterone, 47, 47f, 632 
Tetanus, 671 
Tethering, 288, 289f 
Tetrad, 574f, 575, 576 
Tetrahymena thermophila, 425-426, 
450-451EP, 451EPf 
Tetranucleotide sequences, 731 
Tetranucleotide theory, 374, 378EP, 380EP 
Tetroses, 42 
TFFI gene, 481 
TFUD, 418 
TFIIH, 532, 533f, 536HP 
TGF-beta pathway, 366, 367 
Tg737 gene, 334HP 
Thale cress, 17f 
Theriot, Julie, 358-359EP 
Thermal denaturation, 383-384, 384f 
Thermodynamics 
defined, 82 
laws of, 82-84 
first law of thermodynamics, 82-83 
second law of thermodynamics, 83-84 
of water-ice transformation, 85t 
Thermus thermophilus, 453EPf£ 
Thick filaments, 346, 347f 
Thin filaments, 346, 347f 
Third generation sequencers, 730 
Threading, 61-62 
Three-dimensional cell culture, 710, 711f 
3’ end (three-prime end), 374, 374f 
3’ poly(A) tails, 423, 424, 424f 
Threshold, 160-162, 161f, 164 
Thylakoids, 202, 202f 
Thylakoid transfer domain, 306-307 
Thymidine dimers, 535 
Thymine (T), 78, 78f, 373 
Thymus, 231f, 662, 662f, 665, 666f, 669HP, 
671HP, 686, 688, 688f 
Thymus-independent antigens, 666f, 667 
Thyroid adenoma, 594HP, 594HPt 
Thyroiditis, 669HP 
Thyroid-stimulating hormone (TSH), 669HP 


Tic complexes, 306 
Tight junctions, 244f, 245-247, 246f, 247f 
Tilghman, Shirley, 421 
Till, James, 18HP 
Tilney, Lew, 358EP 
TIM complex, 304, 305f 
Time-resolved crystallography, 96, 97f 
Ti plasmid, 734, 734f 
Tipranavir, 99 
TIRE (Total Internal Reflection Fluorescence), 
134 
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